Added helm chart for observability #21

MSpryszynski · 2025-06-14T17:29:29Z

No description provided.

groundnuty · 2025-06-15T16:15:01Z

charts/hyperflow-observability/templates/metric-rules.yaml

+    - name: node_cpu_usage
+      interval: 5s
+      rules:
+        - record: node_cpu_usage_percent


@balis I would recommend a 1-2 sentence per metric explaining what it does, so next people can have a less steep learning curve

groundnuty · 2025-06-15T16:15:56Z

charts/hyperflow-observability/values.yaml

+  replicas: 1
+
+  config:
+    opensearch.yml: |


@balis this looks very worring, we need to talk about this

groundnuty · 2025-06-15T16:17:01Z

charts/hyperflow-observability/values.yaml

+                  action: keep
+
+    processors:
+      batch: { }


@balis u sure this has to be defined as empty?

via https://github.com/open-telemetry/opentelemetry-collector/tree/main/processor/batchprocessor
The following configuration options can be modified:

send_batch_size (default = 8192): Number of spans, metric data points, or log records after which a batch will be sent regardless of the timeout. send_batch_size acts as a trigger and does not affect the size of the batch. If you need to enforce batch size limits sent to the next component in the pipeline see send_batch_max_size.
timeout (default = 200ms): Time duration after which a batch will be sent regardless of size. If set to zero, send_batch_size is ignored as data will be sent immediately, subject to only send_batch_max_size.
send_batch_max_size (default = 0): The upper limit of the batch size. 0 means no upper limit of the batch size. This property ensures that larger batches are split into smaller units. It must be greater than or equal to send_batch_size.

groundnuty · 2025-06-15T16:18:29Z

charts/hyperflow-run/values.yaml

@@ -173,6 +173,8 @@ hyperflow-engine:
                        value: "${enableTracing}"
                      - name: HF_VAR_ENABLE_OTEL
                        value: "${enableOtel}"
+                      - name: HF_VAR_OPT_URL
+                        value: "http://hf-obs-opentelemetry-collector"


@balis I don't like the fact that this is defined here and here c12659e#diff-7800e510fef5761baa4ff5930e280adbc39c087c52583ca395d8aa5d38c86dc6R69
we should talk why it is in 2 places

groundnuty · 2025-06-15T16:18:49Z

charts/hyperflow-worker-pool-operator/values.yaml

+                  - name: HF_VAR_ENABLE_OTEL
+                    value: "1"
+                  - name: HF_VAR_OPT_URL
+                    value: "http://hf-obs-opentelemetry-collector"


@balis or even 3 :-)

Removed one of them

balis requested a review from groundnuty June 14, 2025 19:48

groundnuty reviewed Jun 15, 2025

View reviewed changes

Added helm chart for observability

c76989e

MSpryszynski force-pushed the observability-helm branch from c12659e to c76989e Compare July 27, 2025 10:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added helm chart for observability #21

Added helm chart for observability #21

Uh oh!

MSpryszynski commented Jun 14, 2025

Uh oh!

groundnuty Jun 15, 2025

Uh oh!

groundnuty Jun 15, 2025

Uh oh!

groundnuty Jun 15, 2025

Uh oh!

MSpryszynski Jul 27, 2025

Uh oh!

groundnuty Jun 15, 2025

Uh oh!

groundnuty Jun 15, 2025

Uh oh!

MSpryszynski Jul 27, 2025

Uh oh!

Uh oh!

Added helm chart for observability #21

Are you sure you want to change the base?

Added helm chart for observability #21

Uh oh!

Conversation

MSpryszynski commented Jun 14, 2025

Uh oh!

groundnuty Jun 15, 2025

Choose a reason for hiding this comment

Uh oh!

groundnuty Jun 15, 2025

Choose a reason for hiding this comment

Uh oh!

groundnuty Jun 15, 2025

Choose a reason for hiding this comment

Uh oh!

MSpryszynski Jul 27, 2025

Choose a reason for hiding this comment

Uh oh!

groundnuty Jun 15, 2025

Choose a reason for hiding this comment

Uh oh!

groundnuty Jun 15, 2025

Choose a reason for hiding this comment

Uh oh!

MSpryszynski Jul 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!