*************************** Visualize Data with Tableau *************************** `Tableau `_ is a commercial tool for exploring and visualizing tabular data. In addition to making visualizations, Tableau makes it easy to select, filter, group, and describe your data in tables. This can be easier than the same operations in a Python REPL with ``pyspark`` or ``pandas``. This page describes various ways to connect Tableau to dsgrid data after you've installed Tableau Desktop on your local computer. Install Tableau =============== Licenses are available to NREL employees. Go to theSOURCE, then IT Service Portal, Service Catalog, search for Tableau, and submit a ticket to get ``Tableau Creator`` installed (IT will install ``Tableau Desktop``). Parquet files on a local computer ================================= This can be accomplished by connecting Tableau to DuckDB. 1. Copy the Parquet files to your computer. 2. Install `DuckDB `_. You want the ``Command line`` ``Environment``. 3. Install a JDBC driver and connect Tableau to DuckDB by following `DuckDB's `_ instructions. 4. Create a view of your data as noted `here `_. You can also import your data from Parquet files to a DuckDB database file if you prefer. 5. Use Tableau with your DuckDB data source. Parquet files on an HPC ======================== This can be accomplished by connecting Tableau to a Spark cluster on the HPC. Follow the `Spark-on-HPC instructions `_ CSV files on a local computer ============================= 1. Export the dsgrid data in Parquet files to CSV. 2. Load the CSV files directly in Tableau.