Postgres to ClickHouse: Data Modeling Tips
PeerDB makes it fast and simple to replicate data from Postgres to ClickHouse. A common question from PeerDB users is how to model their data in ClickHouse after the replication process to maximize the benefits of ClickHouse.
This question arises because ClickHouse and Postgres differ in data modeling, as each is a purpose-built database highly optimized for its specific workload -Postgres is a transactional (OLTP) database, while ClickHouse is an analytical (OLAP) columnar database. This guide walks you through essential data modeling concepts in ClickHouse for users coming from the Postgres world.
ReplacingMergeTree table engine
PeerDB maps PostgreSQL tables to ClickHouse using the ReplacingMergeTree engine. ClickHouse performs best with append-only workloads and does not recommend frequent UPDATEs. This is where ReplacingMergeTree is particularly powerful.
ReplacingMergeTree
supports workloads that involve both data ingestion and modifications. Each table is append-only, with user updates ingested as versioned INSERTs. The ReplacingMergeTree engine manages deduplication (merging) of rows in the background. This is one of the key factors that enables ClickHouse to deliver exceptional real-time ingestion performance.
In PeerDB, both INSERTs and UPDATEs from Postgres are captured as new rows with different versions (using _peerdb_version
) in ClickHouse. The ReplacingMergeTree
table engine periodically handles deduplication in the background using the Ordering Key (ORDER BY columns), retaining only the row with the latest _peerdb_version
. DELETEs from PostgreSQL are propagated as new rows marked as deleted (using the _peerdb_is_deleted
column). The snippet below shows the target table definition for the public_goals
table in ClickHouse.
You might still see duplicates for rows—how should you handle them?
ReplacingMergeTree clears out duplicates asynchronously in the background but doesn’t guarantee the absence of duplicates. So, when you query the data, you might still see duplicates for the same row or primary key but with different versions. This is expected. To remove duplicates, you have a couple of approaches:
Use FINAL in your queries
ClickHouse has a unique modifier called FINAL, which performs de-duplication (merging of rows) at query time. This de-duplication occurs after filtering (WHERE clause) but before aggregations (GROUP BY).
A historical concern has been that FINAL can slow down query performance. While it does impact query performance to some extent, recent releases of ClickHouse have introduced significant improvements to enhance FINAL query performance. So, don’t hesitate to use the FINAL clause and evaluate how your queries perform. Below is an example of how to use the FINAL clause:
Use argMax to deduplicate rows at query time
In ClickHouse, argMax is a powerful function for deduplicating rows dynamically during query execution. This is particularly useful when you need to retain the most recent or relevant record based on a versioning or timestamp column.
For instance, if you’re working with a table like peerdb.public_goals
, where id is the primary key and _peerdb_version
tracks versions, you can use argMax to select the row with the highest _peerdb_version
for each id
. This approach allows you to efficiently remove duplicates without altering the underlying data. You can then run your aggregations as a subquery over this deduplicated result set for further analysis. Below query is an example of using argMax
Use WINDOW FUNCTIONS
You can use ClickHouse’s window functions to achieve similar deduplication by selecting the row with the highest _peerdb_version
within each id partition. Here’s an example:
Use Views to simplify deduplication
Encapsulate deduplication in a view to make it simple for BI tools to query the most up-to-date data. For example, use a window function in the view to keep only the latest version of each row:
Nullable Columns
If you’re coming from the Postgres world, one surprising aspect of ClickHouse is that it doesn’t store NULL values for columns unless you explicitly wrap the column types in Nullable
. For example, instead of storing NULL for dates, ClickHouse stores 1970-01-01
as the default value, which might be unexpected. This behavior is due to the fact that storing NULLs can impact query performance in ClickHouse, as it’s a columnar database. Hence, ClickHouse requires users to explicitly define Nullable
types.
In PeerDB, we’ve introduced a setting called PEERDB_NULLABLE
, which, when set to true
, automatically detects nullable columns in Postgres and marks them as Nullable
in ClickHouse during the replication process. This means you don’t need to manually define Nullable
types during replication. You can read more about this feature in the following PR.
Data Types
ClickHouse offers a wide variety of data types, ranging from numbers, text, timestamps, dates, and arrays to the recently introduced JSON type. Many of the data types in Postgres can be natively stored in ClickHouse without much modification.
As a reference, here goes the data type matrix we use at PeerDB when replicating data from Postgres to ClickHouse.
The Ordering Key
What is an Ordering Key?
Choosing the right ordering key is crucial for query performance in ClickHouse. Defined by the ORDER BY
clause when creating a table, the ordering key functions similarly to a index in Postgres but is optimized for analytics. Unlike Postgres, which uses a B-tree index with entries pointing to each row, ClickHouse uses Sparse Indexing:
-
Data is sorted based on Ordering Key: The ordering key ensures that data on disk is sorted according to the specified columns. This allows for better compression, as correlated values are stored together.
-
Ordering Key also creates a sparse index: The ordering key also creates a sparse index, storing only ranges of columns, with each entry pointing to a group of sorted rows. This keeps the index small, allowing ClickHouse to quickly identify relevant groups of rows using a binary search and execute queries efficiently. You can read more about this here.
You can think of ordering keys as similar to BRIN indexes in Postgres, but in ClickHouse, the data is automatically sorted based on the ordering key via asynchronous merging of parts, so you don’t need to handle sorting during data ingestion.
Choosing an appropriate Ordering Key
When selecting an ordering key, base your choice on the columns most frequently used in your query filters. Prioritize columns that are commonly used in WHERE clauses, and order them in ascending sequence of cardinality—starting with columns that have the fewest distinct values. This approach optimizes data compression and query performance. For a deeper understanding of this topic, refer to the detailed guide here.
PRIMARY KEY vs Ordering Key
If you observe the table definition of public_goals
, it has a PRIMARY KEY
. You might be wondering how the PRIMARY KEY
differs from the Ordering Key. Let us understand how they differ:
-
PRIMARY KEY
, if specified, defines the columns in the sparse index, while the columns in theORDER BY
clause determine how the data is sorted on disk. They are also used for deduplicating data by theReplacingMergeTree
. -
If the
PRIMARY KEY
isn’t specified, the Ordering Key automatically becomes thePRIMARY KEY
and defines the columns in the sparse index.
NOTE: Columns in the PRIMARY KEY
should always be prefixed in the Ordering Key. This ensures that the index aligns with the physical data order, maximizing query performance by minimizing unnecessary data scans.
An example where PRIMARY KEY
could differ from Ordering Key
An example where you might have different PRIMARY KEY
and ORDER BY
columns is when your queries are primarily filtered on customer_id
rather than id
. In this case, you can define the PRIMARY KEY
on just customer_id
and the ORDER BY
on customer_id, id
. This approach ensures a smaller, more efficient sparse index for querying, while data deduplication occurs on id
, ensuring no data is lost.
NOTE: Unlike in Postgres, where the PRIMARY KEY
is a B-tree index that guarantees uniqueness, in ClickHouse, it does not ensure uniqueness. Instead, it defines the columns that should be part of the sparse index.
Modifying the Ordering Key
Choosing the right ordering key is crucial for query performance in ClickHouse, as it acts as an index when querying data. By default, PeerDB uses the PostgreSQL PRIMARY KEY
to define the ordering key in ClickHouse tables, but you can change it using the following methods:
Use materialized views
You can use materialized views to create a new table with a different ordering key suitable for your workload. Include the primary key columns at the end of the ordering key to ensure proper deduplication, as ReplacingMergeTree uses the ORDER BY clause for deduplication, and including the primary key ensures that no data is lost.
NOTE: After creating the materialized view, be sure to follow the steps described in the previous section on handling duplicates to ensure proper deduplication during query time.
Predefine target tables with the desired Ordering Key
To change the ordering key, you can predefine new tables with your desired Ordering Key and then swap them with the existing tables. Here’s how you can do it:
1. Create a Dummy Mirror: Create a dummy mirror in PeerDB to generate the default tables with the correct metadata columns and data types.
2. Create a New Table with the Desired Ordering Key: Use the table created by PeerDB to define a new table with your desired ordering key. Include the primary key columns at the end of the ordering key to ensure proper deduplication. Here is an example:
3. Drop the Old Table:
4. Rename the New Table: Rename the new table to the actual table
5. Start MIRROR to Point to the New Table: Configure the mirror to point to the actual table. PeerDB uses CREATE TABLE IF NOT EXISTS
behind the scenes and continues to ingest data into the new table.
Handling DELETEs
As mentioned, DELETEs from PostgreSQL are propagated as new rows marked as deleted (using the _peerdb_is_deleted
column). To exclude deleted rows from your queries, you can create row-level policies in ClickHouse based on the _peerdb_is_deleted
column. Here’s an example:
This policy ensures that only rows where _peerdb_is_deleted
is 0 are visible when querying the table.
How to handle updates for primary keys?
To propagate primary key updates into ClickHouse, enable PEERDB_CLICKHOUSE_ENABLE_PRIMARY_UPDATE
setting on your mirror. This instructs the PeerDB normalize step to generate a “delete + insert” pair for changed primary keys, using a sign column (in the ReplacingMergeTree
) and a version column to remove the old record before inserting the new one, ensuring that your ClickHouse tables accurately reflect the latest state from the source system. It is highly uncommon for primary keys to be updated in Postgres.
Conclusion
I hope you enjoyed reading the guide. We aimed to cover the most common data-modeling challenges you might encounter when migrating from PostgreSQL to ClickHouse. In the next blog, I plan to dive into more advanced topics, such as joins, writing efficient SQL queries, and so on. If you want to give PeerDB and ClickHouse a try to start replicating data from Postgres to ClickHouse, please check out the links below or reach out to us directly!