Start Now login

intermix Blog

Best practices and lessons learned for cloud ETL and data engineering.

How We Reduced Our Amazon Redshift Cost by 29%

7 min READ
September 18th 2018
Scale down, if you dare Amazon Redshift makes it easy to scale. Need more computing power or disk space? Simply add a few nodes with a click or two in the AWS Console and after a couple of hours of Read-Only mode your data warehouse will be ready to crunch your valuable and growing dataset […]
Stefan Gromoll Stefan Gromoll

Product Update: An Easy Way To Find The Cause of Disk Usage Spikes in Amazon Redshift

1 min READ
September 12th 2018
In August we worked on a few new features in intermix.io that make your life as a data engineer easier when working with Amazon Redshift. For example, “Disk full” errors can be one of the performance issues you may encounter. And so we’ve made it easier to find the cause of disk usage spikes in […]
Paul Lappas Paul Lappas

Top 14 Performance Tuning Techniques for Amazon Redshift

19 min READ
August 31st 2018
Introduction Amazon Redshift is a data warehouse that makes it fast, simple and cost-effective to analyze petabytes of data across your data warehouse and data lake. Amazon Redshift can deliver 10x the performance of other data warehouses by using a combination of machine learning, massively parallel processing (MPP), and columnar storage on SSD disks. But […]
Paul Lappas Paul Lappas

The Future of Apache Airflow

5 min READ
August 20th 2018
This is a guest blog post by Pete DeJoy. Pete is a Product Specialist at Astronomer, where he helps companies adopt Airflow.  Apache Airflow has come a long way since it was first started as an internal project within Airbnb back in 2014 thanks to the core contributors’ fantastic work in creating a very engaged […]
Pete DeJoy Pete DeJoy

Announcing Query Groups – Intelligent Query Classification

1 min READ
August 10th 2018
Query Groups is a powerful feature which intelligently classifies and ranks query workloads on your cluster. Query Groups can answer questions like: my cluster just experienced a sudden increase in latency – which queries are causing this? which queries are consuming the most amount of cluster resources? which queries are slowly increasing in latency? Have […]
Paul Lappas Paul Lappas
1 2 3 6