- Version 4
- All Categories
- Browsing (3)
- Community (1)
- Development (59)
- Flink sql (4)
- Hive (1)
- Impala (1)
- Ksqldb (3)
- Phoenix (3)
- Query (8)
- Release (32)
- Spark sql (3)
- Trino (1)
- Tutorial (19)
- Version 4 (78)
- Version 4.10 (11)
- Version 4.11 (11)
- Version 4.9 (7)
- Version 5.0 (2)
16 April 2019
2x Faster page load time with the new bundling of JavaScript files
Fast page loads are notorious for providing a much nicer user experience. Who likes to wait or have a slow display of the page? In the continuous offer to render Hue snappier and at the same time modernize and improve the developer experience, JavaScript files are now bundled together and are importer only when needed (under the cover this is done by leveraging https://webpack.js.org/). This is a lot of requests not happening anymore when opening Hue the first time!…
1 minute read - Version 412 April 2019
Documentation revamp: making Hue easier to install, use and develop
Making Hue easier to install, use and develop with a modern documentation. While all the efforts on the improving the core SQL Experience are important, not communicating clearly about it is not very efficient. In consequence, the documentation website was rewritten and its content is now being continuously refreshed. Before Can be seen in older version: http://cloudera.github.io/hue/docs-4.3.0/ After Can be seen in latest: https://docs.gethue.com/ Table of content It was unified in the left navigation, and we now see the four sections within the same page:…
2 minutes read - Version 410 April 2019
How to enable new user and create Hive tables on a Kerberized secure cluster with Apache Sentry
It can be tricky to grant a new user proper permissions on a secure cluster, let's walk through the steps to enable any new user for table creation on a kerberized cluster. Depends on your cluster size, creating user and group on each node can be tedious. Here we use pssh (Parallel ssh) for this task. 1. Install the tool and prepare a file which contains all your hosts. For Mac user:…
3 minutes read - Version 401 April 2019
Hive on Tez integrations improvements
We’ve made some improvements when using Apache TEZ as the query engine of the SQL Editor: When running a query, the job id will now show up in the query log. Pressing the id will show the job in the mini job browser. TEZ, does not update its progress in the log, but if you’ve opened the mini job browser, Hue will be able to update the job’s progress right in the editor.…
1 minute read - Version 428 March 2019
Hue 4.4 and its improvements are out!
Hi Big Data Explorers, The Hue Team is glad to thanks all the contributors and release Hue 4.4! The focus of this release was to improve the self service SQL troubleshooting and stability. This release comes with 450 commits and 80+ bug fixes! For all the changes, check out the release notes. Go grab the tarball or source, and give it a spin! And for a quick try, ‘docker pull gethue/4.…
3 minutes read - Version 4 / Release26 March 2019
Quick Task: Document Count Check
When Hue database has too many entries in certain tables, it will cause performance issue. Now Hue config check will help superuser to find this issue. Login as superuser and go to “Hue Administration”, this sample screenshot will be displayed in the quick start wizard when the tables have too many entries. Warning: Hue database Document2 has too many entries which may cause performance issue, please run command line tool to clean up.…
1 minute read - Version 426 March 2019
Quick Task: Fixing “ImportError: No module named google_compute_engine” when building Hue
When building Hue on a Google Compute Engine machine, you might it this issue: ImportError: No module named google_compute_engine with this full trace: creating 'dist/kombu-4.3.0-py2.7.egg' and adding 'build/bdist.linux-x86_64/egg' to it removing 'build/bdist.linux-x86_64/egg' (and everything under it) - Building egg for boto-2.46.1 Traceback (most recent call last): File "", line 1, in File "/home/romain/hue/build/env/local/lib/python2.7/site-packages/setuptools/sandbox.py", line 253, in run_setup raise File "/usr/lib/python2.7/contextlib.py", line 35, in __exit__ self.gen.throw(type, value, traceback) File "/home/romain/hue/build/env/local/lib/python2.7/site-packages/setuptools/sandbox.py", line 195, in setup_context yield File "…
2 minutes read - Version 422 March 2019
Quick Task: How to query Apache Druid analytic database
Self-service exploratory analytics is one of the most common use cases of the Hue users. While deeply integrated with Apache Impala and Apache Hive, Hue also lets you take advantage of its smart editor and assistants with any databases. In this tutorial, let's see how to query Apache Druid. Apache Druid is an “OLAP style” database. If not already running, it is easy to get Druid downloaded and started. In our case we will just query the provided Wikipedia data sample.…
2 minutes read - Version 422 March 2019
Querying & Exploring the Instacart dataset Part 1: Ingesting the data
Self-service exploratory analytics is one of the most common use cases of the Hue users. In this tutorial, let's see how to get started on the analysis. We will use the free Instacart dataset and start with the Importer feature. Getting the data This steps was made particularly easy by Instacart. Just go on their dataset page of 3 million orders and download the 200 MBs. Making it queryable Next step is not always trivial.…
2 minutes read - Version 411 March 2019
Self service Impala SQL query troubleshooting
Greetings Data Explorers! We’ve been looking for ways to help debug performance issues with your Impala queries. Let’s preface by saying that this new feature is a work in progress and we’re looking to add more with the help of your feedback. After finding data in the Catalog and using the Query Assistant, end users might wonder why their queries are taking a lot of time to execute. Build up on top of the Impala profiler, this new feature educates them and surface up more information so that they can be more productive by themselves.…
7 minutes read - Version 4