Articles & News

10 April 2019

How to enable new user and create Hive tables on a Kerberized secure cluster with Apache Sentry

It can be tricky to grant a new user proper permissions on a secure cluster, let’s walk through the steps to enable any new user for table creation on a kerberized cluster. Depends on your cluster size, creating user and group on each node can be tedious. Here we use pssh (Parallel ssh) for this task. 1. Install the tool and prepare a file which contains all your hosts. For Mac user:…

3 minutes read - Administration / Querying / Version 4

01 April 2019

Hive on Tez integrations improvements

We’ve made some improvements when using Apache TEZ as the query engine of the SQL Editor: When running a query, the job id will now show up in the query log. Pressing the id will show the job in the mini job browser. TEZ, does not update its progress in the log, but if you’ve opened the mini job browser, Hue will be able to update the job’s progress right in the editor.…

1 minute read - Querying / Version 4

28 March 2019

Hue 4.4 and its improvements are out!

Hi Big Data Explorers,   The Hue Team is glad to thanks all the contributors and release Hue 4.4!    The focus of this release was to improve the self service SQL troubleshooting and stability. This release comes with 450 commits and 80+ bug fixes! For all the changes, check out the release notes. Go grab the tarball or source, and give it a spin! And for a quick try, ’docker pull gethue/4.…

3 minutes read - Querying / Version 4 / Release

26 March 2019

Quick Task: Document Count Check

When Hue database has too many entries in certain tables, it will cause performance issue. Now Hue config check will help superuser to find this issue. Login as superuser and go to “Hue Administration”, this sample screenshot will be displayed in the quick start wizard when the tables have too many entries. Warning: Hue database Document2 has too many entries which may cause performance issue, please run command line tool to clean up.…

1 minute read - Administration / Version 4

26 March 2019

Quick Task: Fixing “ImportError: No module named google_compute_engine” when building Hue

When building Hue on a Google Compute Engine machine, you might it this issue: ImportError: No module named google_compute_engine with this full trace: creating 'dist/kombu-4.3.0-py2.7.egg' and adding 'build/bdist.linux-x86_64/egg' to it removing 'build/bdist.linux-x86_64/egg' (and everything under it) - Building egg for boto-2.46.1 Traceback (most recent call last): File "", line 1, in File "/home/romain/hue/build/env/local/lib/python2.7/site-packages/setuptools/", line 253, in run_setup raise File "/usr/lib/python2.7/", line 35, in __exit__ self.gen.throw(type, value, traceback) File "/home/romain/hue/build/env/local/lib/python2.7/site-packages/setuptools/", line 195, in setup_context yield File "…

2 minutes read - Administration / Version 4

26 March 2019

Quick Task: Restrict Number of Concurrent Sessions Per User

Hue administrators can restrict the number of concurrent sessions per user. The default value is 0 to represent no restrictions. In that case, a user can have as many simultaneous Hue sessions, i.e. logins, as he wishes. For security purposes, this can be restricted. When it is, normally the concurrent_user_session_limit is set to 1. [desktop] [[session]] concurrent_user_session_limit=1 When concurrent_user_session_limit is set to 1 any session, i.e. on a different machine or browser, in excess of 1 is removed by eldest.…

1 minute read - Administration

22 March 2019

Quick Task: How to query Apache Druid analytic database

Self-service exploratory analytics is one of the most common use cases of the Hue users. While deeply integrated with Apache Impala and Apache Hive, Hue also lets you take advantage of its smart editor and assistants with any databases. In this tutorial, let’s see how to query Apache Druid. Apache Druid is an “OLAP style” database. If not already running, it is easy to get Druid downloaded and started. In our case we will just query the provided Wikipedia data sample.…

2 minutes read - Querying / Version 4

22 March 2019

Querying & Exploring the Instacart dataset Part 1: Ingesting the data

Self-service exploratory analytics is one of the most common use cases of the Hue users. In this tutorial, let’s see how to get started on the analysis. We will use the free Instacart dataset and start with the Importer feature. Getting the data This steps was made particularly easy by Instacart. Just go on their dataset page of 3 million orders and download the 200 MBs. Making it queryable Next step is not always trivial.…

2 minutes read - Browsing / Querying / Version 4

18 March 2019

Quick Task: How to count the documents of a user via the Shell?

How to count the documents of a user? Sometimes, it is convenient to administrate Hue directly via the command line. While investigating while was slow, we discovered that the demo user had more than 85 000 documents! This was a quick way to validate this and delete the extra ones.   On the command line: .<span class="hljs-meta-keyword"/build/</spanenv<span class="hljs-meta-keyword"/bin/</spanhue shell If using Cloudera Manager, as a root user launch the shell.…

2 minutes read - Administration

12 March 2019

Hue in Docker

Containers offer a modern way to isolate and run applications. This post is the first one of a series showing how to run Hue as a service. Here, we will explore how to build, run and configure a Hue server image with Docker. For impatient people, the source is available at tools/docker. Get the docker image Just pull the latest from the Internet or build it yourself from the Hue repository.…

2 minutes read - Administration

More recent stories

25 December 2019
A more collaborating Datawarehousing Experience with SQL query sharing via links or gists
Read More
05 December 2019
Hue 4.6 and its improvements are out!
Read More
13 November 2019
Visually surfacing SQL information like Primary Keys, Foreign Keys, Views and Complex Types
Read More