- All Categories
- Browsing (3)
- Community (1)
- Development (59)
- Flink sql (4)
- Hive (1)
- Impala (1)
- Ksqldb (3)
- Phoenix (3)
- Query (8)
- Release (32)
- Spark sql (3)
- Trino (1)
- Tutorial (19)
- Version 4 (78)
- Version 4.10 (11)
- Version 4.11 (11)
- Version 4.9 (7)
- Version 5.0 (2)
04 October 2013
Move data in & out your Hadoop cluster with the Sqoop UI
Hi Ice Cream lovers, Hue, the open source Big Data UI, has a brand new application that enables transferring data between relational databases and Hadoop. This new application is driven by Sqoop 2 and has several user experience improvements to boot. Sqoop is a batch data migration tool for transferring data between traditional databases and Hadoop. The first version of Sqoop is a heavy client that drives and oversees data transfer via MapReduce.…
3 minutes read -01 October 2013
Group Synchronization Backends in Hue
Hueis the turn-key solution for Apache Hadoop. It hides the complexity of the ecosystem including HDFS, Oozie, MapReduce, etc. Hue provides authentication and integrates with SAML, LDAP, and other systems. A new feature added in Hue is the ability to synchronize groups with a third party authority provider. In this blog post, we’ll be covering the basics of creating a Group Synchronization Backend. The Design The purpose of the group synchronization backends are to keep Hue’s internal group lists fresh.…
3 minutes read - Development27 September 2013
Season II: 4. Fast SQL with the Impala Query Editor
In the previous episodes, we presented how to schedule repetitive worflows on the grid with Oozie Coordinator. Let’s now look at a fast way to query some data with Impala. Hue, the Hadoop UI, has been supporting Impala closely since its first version and brings fast interactive queries within your browser. If not familiar with Impala, we recommend you to check the documentation of the fastest SQL engine for Hadoop. Impala App Most of Hive SQL is compatible with Impala and we are going to compare the queries of episode one in both Hive and Impala applications.…
3 minutes read - Tutorial25 September 2013
SSO with Hue: new SAML backend
A new SAML 2.0 backend is under development to provide a better authentication experience in Hue. Rather than signing in using Hue credentials, single sign-on (SSO) authentication can be achieved with this new backend. The Basics In SAML 2.0, there are 2 basic components: the Service Provider (SP) and Identity Provider (IdP). The typical flow from SP to IdP is made obvious in following image. SAML architecture from http://en.…
4 minutes read - Development23 September 2013
Hue 3 and the new Sqoop and ZooKeeper apps are out!
Hi Big Data Surfers, The Hue team is pleased to bring you Hue 3.0, available as a tarball{.trackLink} release. Please see the release notes for more information and the documentation. Hue 3.0 brings a browser for ZooKeeper and 245 commits. The tech stack was upgraded and a new phase of redesign was performed in order to improve the UX. Notable Features Sqoop With the Sqoop 2 application data from databases can be easily exported or imported into HDFS in a scalable way.…
2 minutes read - Release18 September 2013
Season II: 3. Schedule Hive queries with Oozie coordinators
In the previous episode we saw how to create an Hive action in an Oozie workflow. These workflows can then be repeated automatically with an Oozie coordinator. This post describes how to schedule Hadoop jobs (e.g. run this job everyday at midnight). Oozie Coordinators Our goal: compute the 10 coolest restaurants of the day everyday for 1 month: From episode 2, now have a workflow ready to be ran everyday.…
2 minutes read - Tutorial11 September 2013
Season II: 2. Execute Hive queries and schedule them with Oozie
In the previous episode, we saw how to to transfer some file data into Apache Hadoop. In order to interrogate easily the data, the next step is to create some Hive tables. This will enable quick interaction with high level languages like SQL and Pig. We experiment with the SQL queries, then parameterize them and insert them into a workflow in order to run them together in parallel.…
5 minutes read - Tutorial05 September 2013
Season II: 1. Prepare the data for analysis with Pig and Python UDF
Welcome to season 2 of the Hue video series. In this new chapter we are going to demonstrate how Hue can simplify Hadoop usage and lets you focus on the business and less about the underlying technology. In a real life scenario, we will use various Hadoop tools within the Hue UI and explore some data and extract some competitive advantage insights from it. Let’s go surf the Big Data wave, directly from your Browser!…
3 minutes read - Tutorial23 August 2013
The Web UI for HBase: HBase Browser
In this post, we’ll take a look at the new HBase Browser App added in Hue 2.5. If you want to learn how to create various tables in HBase, go look at episode 1! Prerequisites before starting Hue: Have Hue built or installed Have HBase and Thrift Service 1 initiated (Thrift can be configured through Cloudera Manager or manually). Look at the HBase service configuration and check to see if the “Enable HBase Thrift Server Framed Transport” property is enabled.…
5 minutes read -22 August 2013
NoSql 2013, Hue: Starting Point for Exploration and Real Time Interaction with Hadoop
NoSql 2013, Hue: Starting Point for Exploration and Real Time Interaction with Hadoop Hue: Starting Point for Exploration and Real Time Interaction with Hadoop Thursday, August 22, 2013 11:15 AM - 11:45 AM Level: Technical - Introductory Apache Hadoop opens up many data crunching possibilities to the enterprise but also brings a lot of complexity: job and query management, XML configurations, file operations… take place on the command line or in high learning curve BI tools.…
1 minute read -