Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IP is not released after deleting IPClaim #694

Open
haijianyang opened this issue Sep 24, 2024 · 4 comments
Open

IP is not released after deleting IPClaim #694

haijianyang opened this issue Sep 24, 2024 · 4 comments
Assignees
Labels
kind/bug Categorizes issue or PR as related to a bug. triage/accepted Indicates an issue is ready to be actively worked on.
Milestone

Comments

@haijianyang
Copy link

What steps did you take and what happened:

Create an IPClaim and then delete it immediately. It may happen that the IPClaim is deleted, but the IPAddress associated with the IPClaim still exists.

Here is IPAM logs:

2024-09-23T09:51:08.093162287+08:00 stderr F I0923 09:51:08.092992       1 ippool_manager.go:336] "controllers/IPPool/IPPool-controller: Getting address" metal3-ippool="default/myippool" Claim="default-myippoolqamn-jzm958-wvkz2-0"
2024-09-23T09:51:08.093188235+08:00 stderr F I0923 09:51:08.093072       1 ippool_manager.go:346] "controllers/IPPool/IPPool-controller: Address allocated" metal3-ippool="default/myippool" Claim="default-myippool-qamn-jzm958-wvkz2-0" address="10.60.70.160"
2024-09-23T09:51:08.136815948+08:00 stderr F E0923 09:51:08.133958       1 ippool_manager.go:221] "controllers/IPPool/IPPool-controller: failed to Patch IPClaim" err="ipclaims.ipam.metal3.io \"default-myippool-qamn-jzm958-wvkz2-0\" not found" metal3-ippool="default/myippool"

What did you expect to happen:
The IP should be released after deleting IPClaim, otherwise the IP resources will be leaked.

Anything else you would like to add:
I found that createAddress first sets IPClaimFinalizer for IPClaim in the same reconcile, then creates IPAddress, and finally saves IPClaimFinalizer. So if IPClaim is deleted before saving IPClaimFinalizer, there should be a situation where IPAddress is not deleted in cascade.

// https://github.com/metal3-io/ip-address-manager/blob/ce1b21f332bab3af1d52d87f7e6b95e7311c4e85/ipam/ippool_manager.go#L318

func (m *IPPoolManager) createAddress(ctx context.Context,
	addressClaim *ipamv1.IPClaim, addresses map[ipamv1.IPAddressStr]string,
) (map[ipamv1.IPAddressStr]string, error) {
	if !Contains(addressClaim.Finalizers, ipamv1.IPClaimFinalizer) {
		addressClaim.Finalizers = append(addressClaim.Finalizers,
			ipamv1.IPClaimFinalizer,
		)
	}

	// other codes

	// Create the IPAddress object. If we get a conflict (that will set
	// HasRequeueAfterError), then requeue to retrigger the reconciliation with
	// the new state
	if err := createObject(ctx, m.client, addressObject); err != nil {
		var reqAfter *RequeueAfterError
		if ok := errors.As(err, &reqAfter); !ok {
			addressClaim.Status.ErrorMessage = pointer.String("Failed to create associated IPAddress object")
		}
		return addresses, err
	}

	// other codes

	return addresses, nil
}

There are two possible solutions:

  1. First set and save IPClaimFinalizer for IPClaim, then create IPAddress.
  2. Keep the current way of using IPClaimFinalizer and clean up the IPAddress of the deleted IPClaim.

Environment:

  • Cluster-api version: v1.6.x
  • CAPM3 version:
  • IPAM version: v1.6.3
  • Minikube version:
  • environment (metal3-dev-env or other): other
  • Kubernetes version: (use kubectl version): v1.25.6

/kind bug

@metal3-io-bot metal3-io-bot added kind/bug Categorizes issue or PR as related to a bug. needs-triage Indicates an issue lacks a `triage/foo` label and requires one. labels Sep 24, 2024
@tuminoid
Copy link
Member

@kashifest @peppi-lotta

@peppi-lotta
Copy link
Member

peppi-lotta commented Sep 24, 2024

This is good catch and well defined issue! The project doesn't have a test for this kind of situation where ipclaim gets deleted immediately after creation and I imagine this doesn't come up often when running commands by hand.

The finalizer gets removed from the IPClaim in the deleteAddress function regardless if an IPAddress object is found or not. Clearly the IPclaim gets deleted before the IPAddress object is done creating so I think the first suggestion might not actually fix this issue even though it could make the time window where this issue can occur smaller. A function that cleans "orphan" IPAddress objects would work a 100%.

@Rozzii
Copy link
Member

Rozzii commented Sep 25, 2024

/triage accepted

@metal3-io-bot metal3-io-bot added triage/accepted Indicates an issue is ready to be actively worked on. and removed needs-triage Indicates an issue lacks a `triage/foo` label and requires one. labels Sep 25, 2024
@adilGhaffarDev
Copy link
Member

/assign
@haijianyang I was able to reproduce the issue, but it only occurs when kubectl apply and kubectl delete are executed within a very short time frame (within a second). I'll open a PR with a fix, but could you provide more details on the specific scenarios where this behavior might cause problems?
I don't expect this to occur in typical scenarios.

@Rozzii Rozzii moved this from Backlog to CAPM3 WIP in Metal3 - Roadmap Nov 6, 2024
@Rozzii Rozzii added this to the IPAM - v1.9 milestone Nov 6, 2024
@tuminoid tuminoid modified the milestones: IPAM - v1.9, IPAM - v1.10 Dec 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug. triage/accepted Indicates an issue is ready to be actively worked on.
Projects
Status: CAPM3 WIP
Development

No branches or pull requests

6 participants