Query Diagnosis and Troubleshooting

FireboltAutomations · February 15, 2023, 8:31pm

Introduction

If a query is not running as quickly as desired, several areas can be investigated for bottlenecks or restrictions. By the end of this article, you will know how to effectively diagnose and troubleshoot your SQL queries in Firebolt.

TL;DR

Utilize all basic SQL functionalities to easily maximize query optimization.
- Use the RECOMMEND DDL command to check if the primary index defined on the table is the most optimal.
- Use aggregating indexes to improve performance.
- Materialize Common Table Expressions (CTEs) for better efficiency.
Leverage query statistics in query history for performance insights.
Monitor the number of queries running in parallel to understand the impact of concurrent queries.

Step-by-Step Guide

Step 1: Utilize all basic SQL functionalities

Ensure you use all the basic functionalities to optimize query speed:

Add more filters to the WHERE and JOIN clauses for better pruning.
- Consider using deep filters for more aggressive filtering. See detailed article on How to optimize your queries using deep filters (optimizing joins).
Use the best primary index to better support the query pattern. See detailed article on How to optimize query performance by choosing the right Primary Index in Firebolt.
- Use RECOMMEND DDL to make sure you use the right Primary index and Partition columns based on your workload.
Create aggregating indexes to optimize aggregations. See detailed article on Optimizing SQL Queries in Firebolt Using Aggregating Indexes.
Use MATERIALIZED hint for CTEs to improve query performance by caching the results in engine RAM. This is particularly beneficial when the CTE is reused multiple times, is computationally expensive, or produces a small number of rows that fit into the node’s RAM. This optimization enhances query speed and efficiency. See detailed explanation on Materialized common table expressions.

Step 2: Query Statistics in Query History

engine_query_history table provides details on usage and time spent during various steps of query execution.

Example SQL code:

SELECT
eqh.start_time,
eqh.end_time,
eqh.status,
eqh.duration_us/1000000 as duration_secs,
eqh.time_in_queue_us,
eqh.query_id,
eqh.query_text,
eqh.error_message,
eqh.spilled_bytes as "amount of spilled data to disk in bytes",
eqh.scanned_rows,
eqh.inserted_rows,
eqh.user_name
FROM information_schema.engine_query_history eqh
WHERE LOWER(query_text) NOT LIKE '%engine_query_history%'
AND LOWER(query_text) NOT LIKE '%engine_running_queries%'
AND LOWER(query_text) LIKE '%<an identifying text pattern for the query>%'
AND status != 'STARTED_EXECUTION'
AND start_time BETWEEN '<start_time>' and '<end_time>'
ORDER BY start_time;

Step 3: Concurrent queries

Concurrent queries running on the engine can affect the performance of a given query. To gauge this impact, it is helpful to know the number of other queries running simultaneously. This can be achieved by displaying query activity minute-by-minute for a period before and after the execution of the query being analyzed.