Best practices for backing up a Hadoop cluster
Before backing up a Hadoop cluster, consider the following:
To backup an entire Hadoop file system provide "/" as the backup selection and ensure that "/" is snapshot enabled.
Before you execute a backup job, ensure for a successful ping response from the backup hosts to hostname (FQDN) of all the nodes.
Update the firewall settings so that the backup hosts can communicate with the Hadoop cluster.
Ensure that the local time on the HDFS nodes and the backup host are synchronized with the NTP server.
Ensure that you have valid certificates for a Hadoop cluster that is enabled with SSL (HTTPS).