intermix Blog

Best practices and lessons learned for cloud ETL and data engineering.

Struggling with slow queries & locked tables? Here’s how to configure your Redshift cluster for performance

12 min READ
May 27th 2019
When customers start using intermix.io for the first time, they can see the set-up and configuration of their Amazon Redshift cluster in the context of their queries and workflows. At that point, customers experience one common reaction:“Knowing what we know now, how would we set up our Redshift cluster […]
Lars Kamp Lars Kamp

AWS Redshift Architecture: Clusters & Nodes & Data Apps, oh my!

7 min READ
February 23rd 2019
In this post, we’ll lay out the 5 major components of Amazon Redshift’s architecture.Data applicationsClustersLeader nodesCompute nodesRedshift SpectrumUnderstanding the components and how they work is fundamental for building a data platform with Redshift. In the post, we’ll provide tips and refer […]
Lars Kamp Lars Kamp

Zero Downtime Elasticsearch Migrations

6 min READ
July 12th 2018
IntroductionAt intermix.io, Elasticsearch is a final destination for data that is processed through our data pipeline. Data gets loaded from Amazon Redshift into Elasticsearch via an indexing job. Elasticsearch data then gets served to the intermix.io dashboard to data engineers, giving them a view […]
Stefan Gromoll Stefan Gromoll

Are you paying too much for Amazon Redshift? 4 steps to reduce your costs.

7 min READ
October 14th 0202
Amazon Redshift’s pricing can be a bit confusing, since you can choose to pay hourly based on your node usage (on demand), by the number of bytes scanned (spectrum), by the time spent over your free daily credits (concurrency scaling), or by committing to an annual plan (reserved instance).And as you s […]
Mark Smallcombe Mark Smallcombe
12