Meritshot Tutorials

  1. Home
  2. »
  3. Creating and Refreshing Extracts in Tableau

Tableau Tutorial

Creating and Refreshing Extracts in Tableau

Data extracts in Tableau are subsets of your original data source that are saved as a .hyper or .tde file. Extracts allow you to work with large datasets more efficiently by improving performance, enabling offline analysis, and supporting advanced data features. Creating and refreshing extracts are essential processes that keep your data analysis up-to-date and fast.

Creating Extracts

  1. Connect to Data Source:
    • Open Tableau and connect to your desired data source (e.g., Excel, SQL Server, etc.).
    • After connecting, ensure your data is clean and well-structured for analysis.
  2. Choose the Extract Option:
    • Once your data source is connected, go to the “Data” menu and select your data source.
    • Click on “Extract Data” to start the process. This option is usually found under the data source’s name.
  3. Configure Extract Options:
    • Filter Data: You can filter the data to include only the relevant rows needed for your analysis. For example, you might extract data for the last two years instead of the entire dataset.
    • Aggregate Data: If you only need summary information, you can aggregate the data during the extract creation. This reduces the size of the extract and improves performance.
    • Roll-up Dates: Choose to roll up date fields to specific levels (e.g., year, quarter) to further reduce the extract size.
  4. Save and Create Extract:
    • After configuring the options, click “Extract.” Tableau will create the extract file (.hyper or .tde) and save it locally on your computer.
    • You can now use this extract as your data source in Tableau, which will be faster to load and process than the original live connection.

Refreshing Extracts

  1. Why Refresh Extracts?:
    • Refreshing extracts updates your data with the latest changes from the original data source. This ensures your analysis reflects the most current information without having to recreate the entire extract.
  2. Manual Refresh:
    • To manually refresh an extract, go to the “Data” menu in Tableau, select your data source, and click on “Extract” -> “Refresh.” This pulls in the latest data from the original source and updates the extract file.
  3. Scheduled Refresh:
    • If you publish your workbook to Tableau Server or Tableau Online, you can schedule regular refreshes for your extracts. This is particularly useful for automated, up-to-date reporting.
    • To set up a scheduled refresh, navigate to the published workbook on Tableau Server or Tableau Online, and set the refresh frequency (e.g., daily, weekly).

Benefits of Using Extracts

  1. Improved Performance:
    • Extracts load faster and are more responsive than live data connections, especially with large datasets. This is crucial when working with complex calculations or multiple visualizations.
  2. Offline Analysis:
    • Since extracts are saved locally, you can work with your data even when you’re not connected to the original data source. This enables analysis in offline environments.
  3. Data Consistency:
    • Extracts allow you to work with a consistent snapshot of your data, ensuring that your analysis is based on a fixed dataset. This is useful when you need to ensure that all stakeholders are viewing the same data.
  4. Advanced Features:
    • Some Tableau features, like using Tableau Prep for data cleaning and transformation, are optimized for extracts. This allows for more advanced data manipulation and preparation before analysis.

Frequently Asked Questions

  1. Q: Can I create an extract for a specific subset of my data?

A: Yes, you can apply filters when creating an extract to include only the relevant data you need. This helps reduce the size of the extract and improves performance.

  1. Q: What’s the difference between a .hyper and a .tde file?

A: .hyper files are the newer format for Tableau extracts, offering better performance and support for larger datasets compared to the older .tde format.

  1. Q: How often should I refresh my extracts?

A: The frequency of refresh depends on how often your data changes and how critical it is to have up-to-date information. Common refresh schedules include daily or weekly updates.

  1. Q: Can I automate the refresh of extracts?

A: Yes, you can schedule automated refreshes on Tableau Server or Tableau Online, ensuring your data stays up-to-date without manual intervention.

  1. Q: What happens if I forget to refresh my extract?

A: If you don’t refresh your extract, your analysis will be based on outdated data, which could lead to inaccurate insights and decisions. Regular refreshes are important to maintain data accuracy.