Start Now Login
Announcing Query Groups – Intelligent Query Classification

Announcing Query Groups – Intelligent Query Classification

Query Groups is a powerful feature which intelligently classifies and ranks query workloads on your cluster. Query Groups can answer questions like:

How it Works

Queries are grouped together using a proprietary algorithm, and ranked by volume, execution time, and queue time. More metrics will be added in the future.

All queries in a “query group” share SQL structure and operate on the same tables.

Example – Find the queries causing a query spike

At 8:17am on Aug. 7 the below cluster experienced an 8x spike in queries. Typically, this type of event is caused by a handful of new queries which suddenly increased their volume. How do you find which queries? Who ran them?

Query groups can quickly determine which queries are responsible.

Click on the new Query Groups page in the left nav. Groups are sorted by Rank by default. In this case, we want to re-sort by “Rank Change”. Sorting by rank change will order the list of query groups by the ‘fastest movers’. So you’ll quickly see the groups which moved up the ranks in the past week.

Sure enough, we see a handful of query groups which suddenly started running. Clicking into the first one, we can isolate the exact queries.

The same procedure could be used to determine the queries that spike in latency or queue time.

What’s Next

We will expand the ‘grouping’ concept in the future to add:

Related content
3 Things to Avoid When Setting Up an Amazon Redshift Cluster Apache Spark vs. Amazon Redshift: Which is better for big data? Amazon Redshift Spectrum: Diving into the Data Lake! What Causes "Serializable Isolation Violation Errors" in Amazon Redshift? A Quick Guide to Using Short Query Acceleration and WLM for Amazon Redshift for Faster Queries What is TensorFlow? An Intro to The Most Popular Machine Learning Framework Titans of Data with Mirko Novakovic - How Containers are Giving Rise to New Data Services Why We Built intermix.io - “APM for Data” 4 Simple Steps To Set-up Your WLM in Amazon Redshift For Better Workload Scalability World-class Data Engineering with Amazon Redshift - Training Announcing App Tracing - Monitoring Your Data Apps With intermix.io Have Your Postgres Cake with Amazon Redshift and eat it, too. 4 Real World Use Cases for Amazon Redshift 3 Steps for Fixing Slow Looker Dashboards with Amazon Redshift Zero Downtime Elasticsearch Migrations Titans of Data with Florian Leibert – CEO Mesosphere Improve Amazon Redshift COPY performance:  Don’t ANALYZE on every COPY Building a Better Data Pipeline - The Importance of Being Idempotent The Future of Machine Learning in the Browser with TensorFlow.js Gradient Boosting Libraries — A Comparison Crowdsourcing Weather Data With Amazon Redshift The Future of Apache Airflow Top 14 Performance Tuning Techniques for Amazon Redshift Product Update: An Easy Way To Find The Cause of Disk Usage Spikes in Amazon Redshift How We Reduced Our Amazon Redshift Cost by 28%
Ready to start seeing into your data infrastructure?
Get started with a 14-day free trial, with access to the full platform

No Credit Card Required