Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Contour uses more than 5Gib and gets OOM killed #6763

Open
netic-rkm opened this issue Nov 14, 2024 · 2 comments
Open

Contour uses more than 5Gib and gets OOM killed #6763

netic-rkm opened this issue Nov 14, 2024 · 2 comments
Labels
kind/bug Categorizes issue or PR as related to a bug. lifecycle/needs-triage Indicates that an issue needs to be triaged by a project contributor. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale.

Comments

@netic-rkm
Copy link

What steps did you take and what happened:
Contour sometime get OOM killed multiple times an hour.

What did you expect to happen:
For it to not get OOM killed.

Anything else you would like to add:

We are seeing Contour is getting OOM killed across multiple cluster, in the beginning we just added more memory to the pod, but now that we have allocated 5Gib to the pod it feels like there is a bigger problem.

Not sure what you need for further debugging, but here is the metrics for cached objects, one strange thing is that it is caching all objects in the cluster instead of only relevant objects.

contour_dag_cache_object{kind="ConfigMap"} 48 contour_dag_cache_object{kind="Gateway"} 1 contour_dag_cache_object{kind="GatewayClass"} 0 contour_dag_cache_object{kind="HTTPProxy"} 169 contour_dag_cache_object{kind="HTTPRoute"} 1 contour_dag_cache_object{kind="Ingress"} 99 contour_dag_cache_object{kind="Namespace"} 48 contour_dag_cache_object{kind="Secret"} 1080 contour_dag_cache_object{kind="Service"} 402

Environment:

  • Contour version: 1.29.1
  • Kubernetes version: (use kubectl version): 1.31.2
  • Kubernetes installer & version: v1.29.7
  • Cloud provider or hardware configuration: 40 nodes with 6cpu 16Gib ram
  • OS (e.g. from /etc/os-release): Ubuntu 22.04.4 LTS
@netic-rkm netic-rkm added kind/bug Categorizes issue or PR as related to a bug. lifecycle/needs-triage Indicates that an issue needs to be triaged by a project contributor. labels Nov 14, 2024
Copy link

Hey @netic-rkm! Thanks for opening your first issue. We appreciate your contribution and welcome you to our community! We are glad to have you here and to have your input on Contour. You can also join us on our mailing list and in our channel in the Kubernetes Slack Workspace

Copy link

The Contour project currently lacks enough contributors to adequately respond to all Issues.

This bot triages Issues according to the following rules:

  • After 60d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, the Issue is closed

You can:

  • Mark this Issue as fresh by commenting
  • Close this Issue
  • Offer to help out with triage

Please send feedback to the #contour channel in the Kubernetes Slack

@github-actions github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug. lifecycle/needs-triage Indicates that an issue needs to be triaged by a project contributor. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale.
Projects
None yet
Development

No branches or pull requests

1 participant