YARN Resource Manager High Availability (HA) in MR2

Published on 18 June 2014 in - 1 minute read - Last modified on 06 March 2021

Similarly to the JobTracker High Availability configuration for MR1, Hue supports (since today’s date in master or Hue 3.7 or CDH5.1) more than one Resource Manager in case the Resource Manager goes down.

Hue will automatically pick up the active Resource Manager even if it failed over. This is possible because:

  • When submitting Oozie jobs, the logical name of the Resource Manager is used instead of the hostname of the current Resource Manager
  • Job Browser will automatically look for the active Resource Manager API if needed

Here is an example of configuration for the [[yarn_clusters]] section in hue.ini:

[hadoop]

\# Configuration for YARN (MR2)

\# ------------------------

[[yarn_clusters]]

[[[default]]]

\# Whether to submit jobs to this cluster

submit_to=True

\# Name used when submitting jobs

logical_name=ha-rm

\# URL of the ResourceManager API

resourcemanager_api_url=http://gethue-1.com:8088

\# URL of the ProxyServer API

proxy_api_url=http://gethue-1.com:8088

\# URL of the HistoryServer API

history_server_api_url=http://gethue-1.com:19888

[[[ha]]]

\# Enter the host on which you are running the failover Resource Manager

resourcemanager_api_url=http://gethue-2.com:8088

logical_name=ha-rm

submit_to=True

We hope that the multi Resource Manager support will make your life with Hadoop easier!

As usual feel free to send feedback on the hue-user list or @gethue!


comments powered by Disqus

More recent stories

26 June 2024
Integrating Trino Editor in Hue: Supporting Data Mesh and SQL Federation
Read More
03 May 2023
Discover the power of Apache Ozone using the Hue File Browser
Read More