Skip to content

cloudflare_worker_errors_count{status="exceededMemory"} disappears #197

@MarcoSaba

Description

@MarcoSaba

We observed that the metric:

cloudflare_worker_errors_count{status="exceededMemory"}

was previously exported correctly, then suddenly disappeared for multiple days, and later reappeared again without any configuration changes.

This seems to correlate with a Cloudflare API error logged by the exporter.

We had alerts based on:

increase(cloudflare_worker_errors_count{status="exceededMemory"}[15m])

but suddenly the alert started returning no data. When checking directly in Prometheus / Thanos:

count(cloudflare_worker_errors_count{status="exceededMemory"}) returned no data.

However: count by (status) (cloudflare_worker_errors_count) returned:

{status="clientDisconnected"} 1
{status="success"} 1

So the metric family still existed, but the exceededMemory label value disappeared completely. We checked the exporter directly:

curl http://localhost:8080/metrics | grep cloudflare_worker_errors_count

Output:

# HELP cloudflare_worker_errors_count Number of errors by script name
# TYPE cloudflare_worker_errors_count counter
cloudflare_worker_errors_count{account="kestra",script_name="docs",status="clientDisconnected"} 0
cloudflare_worker_errors_count{account="kestra",script_name="docs",status="success"} 0

The status="exceededMemory" series was not present at all. So this is not a Prometheus issue — the exporter was not exposing it anymore.

We also have a suspicious log entry:

level=fatal msg="Error fetching zones: could not read response body: unexpected EOF"

After this error the exporter continued running and logging Filtering zone: ...

But it never exported exceededMemory again. The metric only reappeared days later, without any configuration change.

Observations:

  • The exporter did not crash permanently
  • It did not log additional errors
  • No warning was logged about missing worker error statuses

Environment:
Helm chart: cloudflare-exporter-0.2.3
Image tag: 0.2.3
Deployed via ArgoCD
Kubernetes (GKE)
LOG_LEVEL=debug
Cloudflare zones configured via CF_ZONES

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions