***************************
Visualize Data with Tableau
***************************
`Tableau `_ is a commercial tool for exploring and visualizing
tabular data.
In addition to making visualizations, Tableau makes it easy to select, filter, group, and describe
your data in tables. This can be easier than the same operations in a Python REPL with ``pyspark``
or ``pandas``.
This page describes various ways to connect Tableau to dsgrid data after you've installed Tableau
Desktop on your local computer.
Install Tableau
===============
Licenses are available to NREL employees. Go to theSOURCE, then IT Service Portal, Service
Catalog, search for Tableau, and submit a ticket to get ``Tableau Creator`` installed (IT will
install ``Tableau Desktop``).
Parquet files on a local computer
=================================
This can be accomplished by connecting Tableau to DuckDB.
1. Copy the Parquet files to your computer.
2. Install `DuckDB `_. You want the ``Command line``
``Environment``.
3. Install a JDBC driver and connect Tableau to DuckDB by following `DuckDB's
`_ instructions.
4. Create a view of your data as noted `here
`_. You can also import
your data from Parquet files to a DuckDB database file if you prefer.
5. Use Tableau with your DuckDB data source.
Parquet files on an HPC
========================
This can be accomplished by connecting Tableau to a Spark cluster on the HPC.
Follow the
`Spark-on-HPC instructions `_
CSV files on a local computer
=============================
1. Export the dsgrid data in Parquet files to CSV.
2. Load the CSV files directly in Tableau.