Articles & News

08 November 2013

Season II: 8. How to transfer data from Hadoop with Sqoop 2

Note: Sqoop2 is now replaced by https://gethue.com/importing-data-from-traditional-databases-into-hdfshive-in-just-a-few-clicks/  Apache Sqoop is a great tool for moving data (in files or databases) in or out of Hadoop. In Hue 3, a new app was added for making Sqoop2 easier to use. In this final episode (previous one was about Search) of the season 2 of the Hadoop Tutorial series let’s see how simple it becomes to export our Yelp results into a MySql table!…

2 minutes read - Tutorial

04 November 2013

Season II: 7. How to index and search Yelp data with Solr

In the previous episode we saw how to use Pig and Hive with HBase. This time, let’s see how to make our Yelp data searchable by indexing it and building a customizable UI with the Hue Search app.    Indexing data into Solr  This tutorial is based on SolrCloud. Here is a step by step guide about its installation and a list of required packages: solr-server solr-mapreduce search  Next step is about deploying and configuring Solr Cloud.…

3 minutes read - Tutorial

24 October 2013

Hive Query editor with HiveServer2 and Sentry

Hue provides a Web interface for submitting Hive queries. Hue had its own server to service Hive queries called Beeswax. The more sophisticated and robust service, Apache HiveServer2, is supported as of Hue 2.5. Beeswax Hive Editor Thanks to HiveServer2 integration, Hue is getting the benefits from Sentry (How to configure Sentry Video). In addition to the security provided, Hue’s interface becomes more consistent. For example, a user without permissions on a database or table won’t see it in the query editor or in the Metastore app.…

4 minutes read -

23 October 2013

Better file formats for Impala and quick SQL in Hadoop

Impala File Formats Using the best file format is crucial for getting great performances. This is one reason with JSON is no supported in the Impala application. Indeed, parsing or retrieving all the text record even for one field would damage the performance badly. Impala is recommending a series of alternativeformats.  We show here how to create a Hive table in Avro format containing json data and a table in the new parquet format.…

2 minutes read -

21 October 2013

Season II: 6. Use Pig and Hive with HBase

The HBase app is an elegant way to visualize and search a lot of data. Apache HBase tables can be tricky to update as they require lower level API. Some good alternative for simplifying the data management or access is to use Apache Pig or Hive.  In this post we are going to show how to load our yelp data from the Oozie Bundles episode into HBase with Hive. Then we will use the HBase Browser to visualize it and Pig to compute some statistics.…

3 minutes read - Tutorial

20 October 2013

Stockholm HUG: Hue: The Hadoop UI

HUE Enrico Berti - Cloudera - Stockholm HUG @ Spotify - Oct 2013 Hue: The Hadoop UI - Stockholm HUG from gethue

1 minute read -

14 October 2013

Season II: 5. Bundle Oozie coordinators with Hue

Hue provides a great Oozie UI in order to use Oozie without typing any XML. In Tutorial 3, we demonstrate how to use an Oozie coordinator for scheduling a daily top 10 of restaurants. Now lets imagine that we also want to compute a top 10 and 100. How can we do this? One solution is to use Oozie bundles.    Workflow and Coordinator updates Bundles are are way to group coordinators together into a set.…

3 minutes read - Tutorial

10 October 2013

How to change or reset a forgotten password?

Hue is the web interface that improves the Apache Hadoop user experience. It’s a Django driven application and manages users accordingly. In this tutorial, we’ll be exploring the different options available for altering passwords in Hue using the default authentication backend (AllowFirstUserBackend). User Interface Users can change their passwords via the “” -> “Edit Profile” found in the top-right corner of Hue.If a user cannot remember their password, the Hue administrator can change it for them via the user manager.…

3 minutes read - Development

04 October 2013

Move data in & out your Hadoop cluster with the Sqoop UI

Hi Ice Cream lovers, Hue, the open source Big Data UI, has a brand new application that enables transferring data between relational databases and Hadoop. This new application is driven by Sqoop 2 and has several user experience improvements to boot. Sqoop is a batch data migration tool for transferring data between traditional databases and Hadoop. The first version of Sqoop is a heavy client that drives and oversees data transfer via MapReduce.…

3 minutes read -

More recent stories

03 May 2023
Discover the power of Apache Ozone using the Hue File Browser
Read More
23 January 2023
Hue 4.11 and its new dialects and features are out!
Read More