- All Categories
- Browsing (3)
- Community (1)
- Development (59)
- Flink sql (4)
- Hive (1)
- Impala (1)
- Ksqldb (3)
- Phoenix (3)
- Query (8)
- Release (32)
- Spark sql (3)
- Trino (1)
- Tutorial (19)
- Version 4 (78)
- Version 4.10 (11)
- Version 4.11 (11)
- Version 4.9 (7)
- Version 5.0 (2)
04 October 2015
Hadoop World Spark Meetup: Interactive Spark in your Browser
Apache Spark Lightning Night - New York Hue now offers a Notebook application for interactively processing, visualizing and sharing data. Through a new Spark REST Job Server, Spark Python and Scala shells are available as well as Streaming. Those are ideal for doing some quick big data crunching from anywhere or a Web browser! This talk details the architecture of the REST API and Notebook UI as well as their integration with the Hadoop ecosystem.…
1 minute read -25 September 2015
Bay Area bike share analysis with the Hadoop Notebook and Spark & SQL
In a previous post, we demonstrated how to use Hue's Search app to seamlessly index and visualize trip data from Bay Area Bike Share and use Spark to supplement that analysis by adding weather data to our dashboard. In this tutorial, we'll use the Notebook app to study deeper the peak usage of the Bay Area Bike Share (BABS) system. To start, download the latest data set from ( http://www.bayareabikeshare.com/datachallenge</a></del> the original website doesn't have the data anymore) https://github.…
6 minutes read -24 September 2015
How to use the Livy Spark REST Job Server API for doing some interactive Spark with curl
Livy is an open source REST interface for using Spark from anywhere. Note: Livy is not supported in CDH, only in the upstream Hue community. It supports executing snippets of code or programs in a Spark Context that runs locally or in YARN. This makes it ideal for building applications or Notebooks that can interact with Spark in real time. For example, it is currently used for powering the Spark snippets of the Hadoop Notebook in Hue.…
5 minutes read - Development10 September 2015
LDAP or PAM pass-through authentication with Hive or Impala and Impersonation
Hue is a server between users logged in their browsers and the respective Hadoop services. Consequently, Hue is seen as a single ‘hue’ user by the other servers. Impersonation is used in order to still apply the permissions of the real logged-in user. For example when a user ‘bob’ submits a query, Hue also sends the username of this user and HiveServer2 will use ‘bob’ and not ‘hue’ as the owner of the query.…
2 minutes read -09 September 2015
Storing passwords in file script rather than in hue.ini configuration
This article details how to store passwords in a script launched from the OS rather than have clear text passwords in the hue*.ini files. Some passwords go in Hue ini configuration file making them easily visible to Hue admin user or by users of cluster management software. You can use the password_script feature to prevent passwords from being visible. Prior to 3.8, Hue only supported clear text passwords in all the Hue configuration files.…
4 minutes read -02 September 2015
Mini how-to: disabling some apps from showing up
In the Hue ini configuration file, in the [desktop] section, you can enter the names of the app to hide: [desktop] \# Comma separated list of apps to not load at server startup. app_blacklist=beeswax,impala,security,filebrowser,jobbrowser,rdbms,jobsub,pig,hbase,sqoop,zookeeper,metastore,spark,oozie,indexer And the main menu will go from to You can see a live demo of enabling only the Search App. Note Some apps might currently still depend on another app and you will get an error if you disable them.…
1 minute read -28 August 2015
Mini task: Configure Hue with a Proxy
We explained how to run Hue with NGINX serving the static files or under Apache. If you use another proxy, you might need to set these options: [desktop] \# Enable X-Forwarded-Host header if the load balancer requires it. use_x_forwarded_host=false \# Support for HTTPS termination at the load-balancer level with SECURE_PROXY_SSL_HEADER. secure_proxy_ssl_header=false
1 minute read -24 August 2015
Hue 3.9 with some general overall improvements is out!
Hi Big Data Aficionados, The Hue Team is glad to thanks all the contributors and release Hue 3.9! The focus of this release was to improve the experience everywhere (no new app were added) and the stability. More than 700 commits on top of 3.8 are in and some apps like the Notebook Editor and Spark Job Server got a serious lift! Go grab the tarball release and give it a spin!…
4 minutes read - Release24 August 2015
Improved HBase cell editor with history in Hue 3.9
For Hue 3.9 we've made some improvements to the HBase Browser that will make your life easier when editing cell contents. In the full editor the cell history is now shown on the right-hand side and selecting an old entry will allow you to revert the current contents to that specific version. You can also look at historical entries while editing without loosing the current edited value, the current version is always shown on top of the list.…
1 minute read -24 August 2015
Hadoop / Spark Notebook and Livy REST Job Server improvements!
The Notebook application as well as the REST Spark Job Server are being revamped. These two components goals are to let users execute Spark in their browser or from anywhere. They are still in beta but next version of Hue will have them graduate. Here are a list of the improvements and a video demo: Revamp of the snippets of the Notebook UI Support for Spark 1.3, 1.4, 1.5 Impersonation with YARN Support for R shell Support for submitting jars or python apps How to play with it?…
2 minutes read -