Articles & News

10 January 2021

SQL Querying a log stream and outputting Calculations to another stream

Initially published on https://medium.com/data-querying/live-sql-querying-live-logs-and-sending-live-updates-easily-e6297150cf92 Log analysis tutorial from an Apache Kafka data stream via Flink SQL, ksqlDB & Hue Editor. Real time queries on streams on data is a modern way to perform powerful analyses as demoed in the previous post. This time we will see a more personalized scenario by querying our own logs generated in the Web Query Editor. First, thank you to the community for all the improvements on the open source projects mentioned below, with in particular Flink Version 1.…

3 minutes read - Querying / Tutorial / Version 4.9

31 December 2020

A Spark SQL Editor via Hue and the Spark SQL Server

Write and Execute some Spark SQL quickly in your own Web Editor. Initially published on https://medium.com/data-querying/a-sparksql-editor-via-hue-and-the-spark-sql-server-f82e72bbdfc7 Apache Spark is popular for wrangling/preparing data, especially when embedding some SQL snippets to keep the data manipulation programs declarative and simpler. One good news is that the SQL syntax is very similar to Apache Hive so the very powerful Hive autocomplete of Hue works very well. Here we will describe how to integrate with the Spark SQL Thrift Server interface that might be already available in your stack.…

3 minutes read - Querying / Tutorial / Version 4.9

17 November 2020

Easy Querying of live Kafka data in a Big Table like HBase with SQL

Easy Querying of live Kafka data in a Big Table like HBase with SQL Initially published on https://medium.com/data-querying/phoenix-brings-sql-to-hbase-and-let-you-query-kafka-data-streams-8fd2edda1401 Apache HBase is a massive key value database in the Big Table family. It excels in random read/write and is distributed. The Hue Query Assistant is a versatile SQL compose Web application with a goal of making database querying easy and ubiquitous within organizations. In this post we will demo the recent integration of Apache Phoenix which provides a SQL interfacing layer to HBase, hence making it easy to query.…

4 minutes read - Querying / Tutorial / Version 4.9

20 October 2020

Tutorial on querying live streams of data with Flink SQL

Real time data querying is becoming a modern standard. Who wants to wait until the next day or week when needing to take decision now? Apache Flink SQL is an engine now offering SQL on bounded/unbounded streams of data. The streams can come from various sources and here we picked the popular Apache Kafka. This tutorial is based on the great Flink SQL demo Building an End-to-End Streaming Application but focuses on the end user querying experience.…

4 minutes read - Querying / Tutorial / Version 4.9

20 October 2020

Tutorial on querying live streams of data with ksql (Kafka SQL)

Real time data querying is becoming a modern standard. Who wants to wait until the next day or week when needing to take decision now? The stream of data comes from an Apache Kafka topic which can be queried via ksqlDB. Components To keep things simple, all the pieces have been put together in a “one-click” Docker Compose project which contains: ksqlDB from the ksqlDB quickstart A Hue Editor already configured with the ksqlDB Editor One-line setup For fetching the configurations and starting everything:…

3 minutes read - Querying / Tutorial / Version 4.9

30 September 2020

Quick checking Hue's SQL Editor connections to Databases in Kubernetes

Let's see one way to quickly check why Hue can't connect to a SQL Database in the Kubernetes world. We previously documented how to run Hue in Kubernetes or Docker. The concept is to use the Database specific command shell on the command line to try to send some queries. Apache Hive is the Data Warehouse used as example (but this would work the same with psql, mysql, …) or any other Database Hue can connect to.…

2 minutes read - Administration / Version 4.9

23 September 2020

Hue 4.8 and its improvements are out!

Hi Data Explorers, The Hue Team is glad to thanks all the contributors and release Hue 4.8!  This release brings all these improvements on top of 4.7: SQL Editor Initial support of Apache Flink, Phoenix, SparkSql SQL dialects Parser refactoring to make Apache Calcite grammar improvements easier Entity Relationship Diagram of SQL tables with Foreigh Keys as links to other tables Improved collaboration with Query sharing and SQL Gist API New components framework to make pieces clean and reusable ERD Diagram of SQL tables Parser: Reuse the JavaScript module to tokenize SQL queries REST API: invoke actions like ‘Execute SQL query’, ‘List or download files on S3’ How to login with Apache Knox in a secure cluster Development Docker / Kubernetes / Grafana config were refreshed Continuous Integration: Python lint, commit messages checks, JavaScript licenses Onboarding experience simplified Preview Editor v2, Connectors, Python 3 are in beta It has more than 600+ commits and 200+ bug fixes!…

2 minutes read - Version 4.8 / Release

15 September 2020

SQL Querying Improvements: Phoenix, Flink, SparkSql, ERD Table...

Hi Data Crunchers, Are you looking at executing your SQL queries more easily? Here is a series of various querying improvements coming in the next release of Hue! New Databases Hue is getting a more polished experience with Apache Phoenix, Apacke Flink SQL and Apache Spark SQL (via Apache Livy). Apache Phoenix Apache Phoenix makes it easy to query the Apache HBase database via SQL. Now the integration is fully working out of the box and several corner cases (e.…

4 minutes read - Version 4.8 / Querying

14 September 2020

REST API for sending SQL queries and Browsing files

Hi Data App Builders, Are you looking at executing some SQL queries of Browsing S3 files programatically? (so that it can be automated for example) The Hue development flow continues to mature (Docker Quick Start, improved CI, shareable Web Components…) and is now getting more help on how to reuse its API. Concept The REST API is not properly public yet and can (will) be simplified in the current work in progress HUE-1450.…

7 minutes read - Version 4.8 / Development

22 August 2020

Hue in Kubernetes

Let’s see how to automate the deployment further and run this into a container orchestration like Kubernetes.  We previously documented the Hue Docker image that provides the latest Hue web server into a “box”. Hue ships with a Helm chart to make it really easy to get started. Helm is a package manager for Kubernetes. More advanced, directly use the yaml config files. In a Shell, after installing Helm, just execute these three instructions to boot a live Hue with its database:…

1 minute read - Administration / Version 4.8

More recent stories

10 January 2021
SQL Querying a log stream and outputting Calculations to another stream
Read More
31 December 2020
A Spark SQL Editor via Hue and the Spark SQL Server
Read More
17 November 2020
Easy Querying of live Kafka data in a Big Table like HBase with SQL
Read More