|
| 1 | +## FAQ {#faq} |
| 2 | + |
| 3 | +### Compute {#compute} |
| 4 | + |
| 5 | +#### Can I create multiple services in this single EKS cluster? {#can-i-create-multiple-services-in-this-single-eks-cluster} |
| 6 | + |
| 7 | +Yes. The infrastructure only needs to be provisioned once for every AWS account and region combination. |
| 8 | + |
| 9 | +### Which regions do you support for BYOC? {#which-regions-do-you-support-for-byoc} |
| 10 | + |
| 11 | +BYOC supports the same set of [regions](/cloud/reference/supported-regions#aws-regions ) as ClickHouse Cloud. |
| 12 | + |
| 13 | +#### Will there be some resource overhead? What are the resources needed to run services other than ClickHouse instances? {#will-there-be-some-resource-overhead-what-are-the-resources-needed-to-run-services-other-than-clickhouse-instances} |
| 14 | + |
| 15 | +Besides Clickhouse instances (ClickHouse servers and ClickHouse Keeper), we run services such as `clickhouse-operator`, `aws-cluster-autoscaler`, Istio etc. and our monitoring stack. |
| 16 | + |
| 17 | +Currently we have 3 m5.xlarge nodes (one for each AZ) in a dedicated node group to run those workloads. |
| 18 | + |
| 19 | +### Network and Security {#network-and-security} |
| 20 | + |
| 21 | +#### Can we revoke permissions set up during installation after setup is complete? {#can-we-revoke-permissions-set-up-during-installation-after-setup-is-complete} |
| 22 | + |
| 23 | +This is currently not possible. |
| 24 | + |
| 25 | +#### Have you considered some future security controls for ClickHouse engineers to access customer infra for troubleshooting? {#have-you-considered-some-future-security-controls-for-clickhouse-engineers-to-access-customer-infra-for-troubleshooting} |
| 26 | + |
| 27 | +Yes. Implementing a customer controlled mechanism where customers can approve engineers' access to the cluster is on our roadmap. At the moment, engineers must go through our internal escalation process to gain just-in-time access to the cluster. This is logged and audited by our security team. |
| 28 | + |
| 29 | +#### What is the size of the VPC IP range created? {#what-is-the-size-of-the-vpc-ip-range-created} |
| 30 | + |
| 31 | +By default we use `10.0.0.0/16` for BYOC VPC. We recommend reserving at least /22 for potential future scaling, |
| 32 | +but if you prefer to limit the size, it is possible to use /23 if it is likely that you will be limited |
| 33 | +to 30 server pods. |
| 34 | + |
| 35 | +#### Can I decide maintenance frequency {#can-i-decide-maintenance-frequency} |
| 36 | + |
| 37 | +Contact support to schedule maintenance windows. Please expect a minimum of a weekly update schedule. |
| 38 | + |
| 39 | +## Observability {#observability} |
| 40 | + |
| 41 | +### Built-in Monitoring Tools {#built-in-monitoring-tools} |
| 42 | + |
| 43 | +#### Observability Dashboard {#observability-dashboard} |
| 44 | + |
| 45 | +ClickHouse Cloud includes an advanced observability dashboard that displays metrics such as memory usage, query rates, and I/O. This can be accessed in the **Monitoring** section of ClickHouse Cloud web console interface. |
| 46 | + |
| 47 | +<br /> |
| 48 | + |
| 49 | +<Image img={byoc3} size="lg" alt="Observability dashboard" border /> |
| 50 | + |
| 51 | +<br /> |
| 52 | + |
| 53 | +#### Advanced Dashboard {#advanced-dashboard} |
| 54 | + |
| 55 | +You can customize a dashboard using metrics from system tables like `system.metrics`, `system.events`, and `system.asynchronous_metrics` and more to monitor server performance and resource utilization in detail. |
| 56 | + |
| 57 | +<br /> |
| 58 | + |
| 59 | +<Image img={byoc4} size="lg" alt="Advanced dashboard" border /> |
| 60 | + |
| 61 | +<br /> |
| 62 | + |
| 63 | +#### Prometheus Integration {#prometheus-integration} |
| 64 | + |
| 65 | +ClickHouse Cloud provides a Prometheus endpoint that you can use to scrape metrics for monitoring. This allows for integration with tools like Grafana and Datadog for visualization. |
| 66 | + |
| 67 | +**Sample request via https endpoint /metrics_all** |
| 68 | + |
| 69 | +```bash |
| 70 | +curl --user <username>:<password> https://i6ro4qarho.mhp0y4dmph.us-west-2.aws.byoc.clickhouse.cloud:8443/metrics_all |
| 71 | +``` |
| 72 | + |
| 73 | +**Sample Response** |
| 74 | + |
| 75 | +```bash |
| 76 | +# HELP ClickHouse_CustomMetric_StorageSystemTablesS3DiskBytes The amount of bytes stored on disk `s3disk` in system database |
| 77 | +# TYPE ClickHouse_CustomMetric_StorageSystemTablesS3DiskBytes gauge |
| 78 | +ClickHouse_CustomMetric_StorageSystemTablesS3DiskBytes{hostname="c-jet-ax-16-server-43d5baj-0"} 62660929 |
| 79 | +# HELP ClickHouse_CustomMetric_NumberOfBrokenDetachedParts The number of broken detached parts |
| 80 | +# TYPE ClickHouse_CustomMetric_NumberOfBrokenDetachedParts gauge |
| 81 | +ClickHouse_CustomMetric_NumberOfBrokenDetachedParts{hostname="c-jet-ax-16-server-43d5baj-0"} 0 |
| 82 | +# HELP ClickHouse_CustomMetric_LostPartCount The age of the oldest mutation (in seconds) |
| 83 | +# TYPE ClickHouse_CustomMetric_LostPartCount gauge |
| 84 | +ClickHouse_CustomMetric_LostPartCount{hostname="c-jet-ax-16-server-43d5baj-0"} 0 |
| 85 | +# HELP ClickHouse_CustomMetric_NumberOfWarnings The number of warnings issued by the server. It usually indicates about possible misconfiguration |
| 86 | +# TYPE ClickHouse_CustomMetric_NumberOfWarnings gauge |
| 87 | +ClickHouse_CustomMetric_NumberOfWarnings{hostname="c-jet-ax-16-server-43d5baj-0"} 2 |
| 88 | +# HELP ClickHouseErrorMetric_FILE_DOESNT_EXIST FILE_DOESNT_EXIST |
| 89 | +# TYPE ClickHouseErrorMetric_FILE_DOESNT_EXIST counter |
| 90 | +ClickHouseErrorMetric_FILE_DOESNT_EXIST{hostname="c-jet-ax-16-server-43d5baj-0",table="system.errors"} 1 |
| 91 | +# HELP ClickHouseErrorMetric_UNKNOWN_ACCESS_TYPE UNKNOWN_ACCESS_TYPE |
| 92 | +# TYPE ClickHouseErrorMetric_UNKNOWN_ACCESS_TYPE counter |
| 93 | +ClickHouseErrorMetric_UNKNOWN_ACCESS_TYPE{hostname="c-jet-ax-16-server-43d5baj-0",table="system.errors"} 8 |
| 94 | +# HELP ClickHouse_CustomMetric_TotalNumberOfErrors The total number of errors on server since the last restart |
| 95 | +# TYPE ClickHouse_CustomMetric_TotalNumberOfErrors gauge |
| 96 | +ClickHouse_CustomMetric_TotalNumberOfErrors{hostname="c-jet-ax-16-server-43d5baj-0"} 9 |
| 97 | +``` |
| 98 | + |
| 99 | +**Authentication** |
| 100 | + |
| 101 | +A ClickHouse username and password pair can be used for authentication. We recommend creating a dedicated user with minimal permissions for scraping metrics. At minimum, a `READ` permission is required on the `system.custom_metrics` table across replicas. For example: |
| 102 | + |
| 103 | +```sql |
| 104 | +GRANT REMOTE ON *.* TO scraping_user |
| 105 | +GRANT SELECT ON system.custom_metrics TO scraping_user |
| 106 | +``` |
| 107 | + |
| 108 | +**Configuring Prometheus** |
| 109 | + |
| 110 | +An example configuration is shown below. The `targets` endpoint is the same one used for accessing the ClickHouse service. |
| 111 | + |
| 112 | +```bash |
| 113 | +global: |
| 114 | + scrape_interval: 15s |
| 115 | + |
| 116 | +scrape_configs: |
| 117 | + - job_name: "prometheus" |
| 118 | + static_configs: |
| 119 | + - targets: ["localhost:9090"] |
| 120 | + - job_name: "clickhouse" |
| 121 | + static_configs: |
| 122 | + - targets: ["<subdomain1>.<subdomain2>.aws.byoc.clickhouse.cloud:8443"] |
| 123 | + scheme: https |
| 124 | + metrics_path: "/metrics_all" |
| 125 | + basic_auth: |
| 126 | + username: <KEY_ID> |
| 127 | + password: <KEY_SECRET> |
| 128 | + honor_labels: true |
| 129 | +``` |
| 130 | + |
| 131 | +Please also see [this blog post](https://clickhouse.com/blog/clickhouse-cloud-now-supports-prometheus-monitoring) and the [Prometheus setup docs for ClickHouse](/integrations/prometheus). |
0 commit comments