Articles & News

01 October 2013

Group Synchronization Backends in Hue

Hueis the turn-key solution for Apache Hadoop. It hides the complexity of the ecosystem including HDFS, Oozie, MapReduce, etc. Hue provides authentication and integrates with SAML, LDAP, and other systems. A new feature added in Hue is the ability to synchronize groups with a third party authority provider. In this blog post, we’ll be covering the basics of creating a Group Synchronization Backend.   The Design The purpose of the group synchronization backends are to keep Hue’s internal group lists fresh.…

3 minutes read - Development

27 September 2013

Season II: 4. Fast SQL with the Impala Query Editor

In the previous episodes, we presented how to schedule repetitive worflows on the grid with Oozie Coordinator. Let’s now look at a fast way to query some data with Impala. Hue, the Hadoop UI, has been supporting Impala closely since its first version and brings fast interactive queries within your browser. If not familiar with Impala, we recommend you to check the documentation of the fastest SQL engine for Hadoop. Impala App Most of Hive SQL is compatible with Impala and we are going to compare the queries of episode one in both Hive and Impala applications.…

3 minutes read - Tutorial

25 September 2013

SSO with Hue: new SAML backend

A new SAML 2.0 backend is under development to provide a better authentication experience in Hue. Rather than signing in using Hue credentials, single sign-on (SSO) authentication can be achieved with this new backend. The Basics In SAML 2.0, there are 2 basic components: the Service Provider (SP) and Identity Provider (IdP). The typical flow from SP to IdP is made obvious in following image. SAML architecture from http://en.…

4 minutes read - Development

23 September 2013

Hue 3 and the new Sqoop and ZooKeeper apps are out!

Hi Big Data Surfers, The Hue team is pleased to bring you Hue 3.0, available as a tarball{.trackLink} release. Please see the release notes for more information and the documentation. Hue 3.0 brings a browser for ZooKeeper and 245 commits. The tech stack was upgraded and a new phase of redesign was performed in order to improve the UX. Notable Features Sqoop With the Sqoop 2 application data from databases can be easily exported or imported into HDFS in a scalable way.…

2 minutes read - Release

18 September 2013

Season II: 3. Schedule Hive queries with Oozie coordinators

In the previous episode we saw how to create an Hive action in an Oozie workflow. These workflows can then be repeated automatically with an Oozie coordinator. This post describes how to schedule Hadoop jobs (e.g. run this job everyday at midnight). Oozie Coordinators Our goal: compute the 10 coolest restaurants of the day everyday for 1 month:  From episode 2, now have a workflow ready to be ran everyday.…

2 minutes read - Tutorial

11 September 2013

Season II: 2. Execute Hive queries and schedule them with Oozie

In the previous episode, we saw how to to transfer some file data into Apache Hadoop. In order to interrogate easily the data, the next step is to create some Hive tables. This will enable quick interaction with high level languages like SQL and Pig.   We experiment with the SQL queries, then parameterize them and insert them into a workflow in order to run them together in parallel.…

5 minutes read - Tutorial

05 September 2013

Season II: 1. Prepare the data for analysis with Pig and Python UDF

Welcome to season 2 of the Hue video series. In this new chapter we are going to demonstrate how Hue can simplify Hadoop usage and lets you focus on the business and less about the underlying technology. In a real life scenario, we will use various Hadoop tools within the Hue UI and explore some data and extract some competitive advantage insights from it.   Let’s go surf the Big Data wave, directly from your Browser!…

3 minutes read - Tutorial

23 August 2013

The Web UI for HBase: HBase Browser

In this post, we’ll take a look at the new HBase Browser App added in Hue 2.5.  If you want to learn how to create various tables in HBase, go look at episode 1! Prerequisites before starting Hue: Have Hue built or installed Have HBase and Thrift Service 1 initiated (Thrift can be configured through Cloudera Manager or manually). Look at the HBase service configuration and check to see if the “Enable HBase Thrift Server Framed Transport” property is enabled.…

5 minutes read -

22 August 2013

NoSql 2013, Hue: Starting Point for Exploration and Real Time Interaction with Hadoop

NoSql 2013, Hue: Starting Point for Exploration and Real Time Interaction with Hadoop Hue: Starting Point for Exploration and Real Time Interaction with Hadoop Thursday, August 22, 2013 11:15 AM - 11:45 AM Level: Technical - Introductory Apache Hadoop opens up many data crunching possibilities to the enterprise but also brings a lot of complexity: job and query management, XML configurations, file operations… take place on the command line or in high learning curve BI tools.…

1 minute read -

22 August 2013

Y! HUG August 2013, Hue: the Hadoop UI

Y! HUG August 2013, Hue: the Hadoop UI Hue: the Hadoop UI with tutorials. Session II: Hue: the UI for Apache Hadoop Hue is an open source, Web-based interface that makes Apache Hadoop easier to use. Hue’s target is the Hadoop user experience and lets users focus on quick data processing. Hue is a mature Web project that integrates into a single UI the Hadoop components and their main satellite projects.…

2 minutes read -

More recent stories

03 May 2023
Discover the power of Apache Ozone using the Hue File Browser
Read More
23 January 2023
Hue 4.11 and its new dialects and features are out!
Read More