Bay Area BikeShare Data Analysis with Search and Spark Notebook

Bay Area BikeShare Data Analysis with Search and Spark Notebook

In this tutorial, we use public data from Bay Area BikeShare and visualize bike trips patterns and their users to understand more the usage of the platform. Hue provides a Dynamic Search dashboard as well as the new Spark Notebook for enriching the data.

We recommend to start with the Trip dataset from and index it into Solr. For impatient people, we provide a subset of trips ready to be indexed as well as the weather data to be processed later with Spark. The Search Dashboard can be downloaded here, the Notebook can be downloaded and imported with Hue 3.9 or just copy pasted.


This demo combined with Real-time Spark Streaming have been presented at conference like Hadoop Summit and Big Data Day LA.

Happy Biking!




Example of interactive dashboard created by Drag&Drop


As usual feel free to comment on the hue-user list or @gethue!



A quick way to index the data with Solr:

bin/solr create_collection  -c  bikes

curl $u --data-binary @/home/test/index_data.csv -H 'Content-type:text/csv'


  1. Varun 2 years ago

    I’m using HDP 2.5 and Hue 3.11.I deploying hue solr dashboard project.
    1. Is there is any way add labels or custom labels on Bar chart?
    2. Is there is any way to increase limit size rather than top 10 for all widgets.? Eg: Markermap shows no of count 10 only.
    3. Is there is any way to add external plugins on hue dashboard?

  2. ada 2 years ago

    i clicked add new dashboards in hue/search, but it returns the error as below:
    HTTPConnectionPool(host=’name1′, port=8983): Max retries exceeded with url: /solr/admin/cores? (Caused by NewConnectionError(‘: Failed to establish a new connection: [Errno 111] Connection refused’,))

Leave a reply

Your email address will not be published. Required fields are marked *


This site uses Akismet to reduce spam. Learn how your comment data is processed.