Similarly to the JobTracker High Availability configuration for MR1, Hue supports (since today’s date in master or Hue 3.7 or CDH5.1) more than one Resource Manager in case the Resource Manager goes down.
Hue will automatically pick up the active Resource Manager even if it failed over. This is possible because:
- When submitting Oozie jobs, the logical name of the Resource Manager is used instead of the hostname of the current Resource Manager
- Job Browser will automatically look for the active Resource Manager API if needed
Here is an example of configuration for the [[yarn_clusters]] section in hue.ini:
[hadoop]
\# Configuration for YARN (MR2)
\# ------------------------
[[yarn_clusters]]
[[[default]]]
\# Whether to submit jobs to this cluster
submit_to=True
\# Name used when submitting jobs
logical_name=ha-rm
\# URL of the ResourceManager API
resourcemanager_api_url=http://gethue-1.com:8088
\# URL of the ProxyServer API
proxy_api_url=http://gethue-1.com:8088
\# URL of the HistoryServer API
history_server_api_url=http://gethue-1.com:19888
[[[ha]]]
\# Enter the host on which you are running the failover Resource Manager
resourcemanager_api_url=http://gethue-2.com:8088
logical_name=ha-rm
submit_to=True
We hope that the multi Resource Manager support will make your life with Hadoop easier!
As usual feel free to send feedback on the hue-user list or @gethue!