You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Data is usually not modified after being appended, especially after a period of 24 hours. It might make sense to run background jobs to compact all similar rows into an aggregate table that is partitioned by hours.
While we might need minute-by-minute granularity for our timestamps for the first 24 hours, I think it is acceptable to reduce the granularity we store past 24 hours into hourly chunks instead. However, we will have to limit the API for any queries requesting for minute granularity data past the 24 hour mark. The benefits are that it would significantly improve performance and storage usage for larger sites, as we would no longer need to store as much duplicate data and process as many rows.
DuckDB does not support aggregate tables, thus we'll have to run our own compaction jobs, which aren't too heavy of an operation. For reference, here are the documentation for StarRocks and ClickHouse natively supported aggregated tables.
The text was updated successfully, but these errors were encountered:
Data is usually not modified after being appended, especially after a period of 24 hours. It might make sense to run background jobs to compact all similar rows into an aggregate table that is partitioned by hours.
While we might need minute-by-minute granularity for our timestamps for the first 24 hours, I think it is acceptable to reduce the granularity we store past 24 hours into hourly chunks instead. However, we will have to limit the API for any queries requesting for minute granularity data past the 24 hour mark. The benefits are that it would significantly improve performance and storage usage for larger sites, as we would no longer need to store as much duplicate data and process as many rows.
DuckDB does not support aggregate tables, thus we'll have to run our own compaction jobs, which aren't too heavy of an operation. For reference, here are the documentation for StarRocks and ClickHouse natively supported aggregated tables.
The text was updated successfully, but these errors were encountered: