Fix race condition in Lambda+LocalServer causing NIOAsyncWriter fatal error (Bug #635) by sebsto · Pull Request #636 · awslabs/swift-aws-lambda-runtime

sebsto · 2026-01-19T15:54:16Z

On fast machines, the local Lambda server crashes with:

Fatal error: Deinited NIOAsyncWriter without calling finish()

This occurs in NIOAsyncChannelHandler.channelActive() when child connection channels are created.

Root Cause

This is a known issue with NIO's async server channel API (see swift-nio#2637).

The fundamental problem:

The async bind() API creates NIOAsyncChannel instances for incoming connections
These channels are yielded through an async stream to the server loop
When the serving task is cancelled (or completes), the async stream iteration stops
Any channels that were accepted but not yet read from the stream are dropped
These unread channels never have executeThenClose() called on them
Their NIOAsyncWriter is deallocated without finish() being called → fatal error

Why graceful shutdown doesn't help:

Even closing the server channel gracefully doesn't eliminate the race - there's a timing window where:

A connection is accepted and queued in the async stream
The server task is cancelled or completes
The queued channel is never read and gets dropped

IMHO, this is an inherent limitation of the async bind() API when combined with task cancellation.

Solution

I stopped using the async bind() API entirely. Instead, I use the traditional callback-based childChannelInitializer:

Create NIOAsyncChannel directly in childChannelInitializer (synchronous context)
Immediately spawn a Task.detached to handle the connection
Each connection is handled independently, not through a cancellable async stream
Detached tasks are not affected by task group cancellation
Every channel has executeThenClose() called immediately, preventing the writer from being dropped

This approach avoids the async stream entirely, eliminating the race condition.

Changes

Replaced async bind() with traditional childChannelInitializer
Each connection spawns a Task.detached that immediately calls executeThenClose()
Removed the connection iteration loop (no longer needed)
Server task now simply waits for the channel to close
Simplified shutdown logic since there's no async stream to drain

Trade-offs

Uses Task.detached (unstructured concurrency) to bridge NIO's event-loop world with Swift concurrency
This is necessary until NIO provides a new bootstrap API that properly handles cancellation
Each connection is handled independently rather than through structured concurrency

Testing

Tested on fast machines where the race condition was reliably reproducible. The crash no longer occurs.

References

swift-nio#2637 - Known issue with async server channels and cancellation
Comment from NIO maintainer - Recommends avoiding cancellation or using callback-based API

Fixes #635

… error This fixes a known issue with NIO's async server channel API where cancellation can cause accepted connections to be dropped before being read from the async stream, resulting in NIOAsyncWriter being deallocated without finish() being called. The fix replaces the async bind() API with the traditional callback-based childChannelInitializer, handling each connection immediately in a Task.detached to avoid the async stream cancellation race. Fixes #635

Copilot

Pull request overview

This PR addresses a race condition in the Lambda local server that causes a fatal error: "Deinited NIOAsyncWriter without calling finish()". The fix switches from NIO's async bind() API to the traditional callback-based childChannelInitializer approach, eliminating the problematic async stream that could drop unread channels during cancellation.

Changes:

Replaced async bind() with callback-based childChannelInitializer that spawns detached tasks
Simplified server shutdown logic by removing the connection iteration loop
Removed withTaskCancellationHandler wrapper from handleConnection since detached tasks handle cancellation differently

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sources/AWSLambdaRuntime/HTTPServer/Lambda+LocalServer.swift

Copilot · 2026-01-19T16:45:31Z

Sources/AWSLambdaRuntime/HTTPServer/Lambda+LocalServer.swift

+                    Task.detached {
+                        await server.handleConnection(channel: asyncChannel, logger: logger)
+                    }


Using Task.detached means connection handling tasks are not tracked or awaited. When the server shuts down, there's no mechanism to wait for these detached tasks to complete. While this solves the NIOAsyncWriter race condition, it could lead to abrupt connection terminations during shutdown. Consider documenting this limitation or tracking these tasks in a collection if graceful connection shutdown is important for local testing scenarios.

Sources/AWSLambdaRuntime/HTTPServer/Lambda+LocalServer.swift

… error (Bug #635) (#636) On fast machines, the local Lambda server crashes with: ``` Fatal error: Deinited NIOAsyncWriter without calling finish() ``` This occurs in `NIOAsyncChannelHandler.channelActive()` when child connection channels are created. ## Root Cause This is a known issue with NIO's async server channel API (see [swift-nio#2637](apple/swift-nio#2637)). **The fundamental problem:** 1. The async `bind()` API creates `NIOAsyncChannel` instances for incoming connections 2. These channels are yielded through an async stream to the server loop 3. When the serving task is cancelled (or completes), the async stream iteration stops 4. Any channels that were accepted but not yet read from the stream are dropped 5. These unread channels never have `executeThenClose()` called on them 6. Their `NIOAsyncWriter` is deallocated without `finish()` being called → fatal error **Why graceful shutdown doesn't help:** Even closing the server channel gracefully doesn't eliminate the race - there's a timing window where: - A connection is accepted and queued in the async stream - The server task is cancelled or completes - The queued channel is never read and gets dropped IMHO, this is an inherent limitation of the `async bind()` API when combined with task cancellation. ## Solution I stopped using the `async bind()` API entirely. Instead, I use the traditional callback-based `childChannelInitializer`: 1. Create `NIOAsyncChannel` directly in `childChannelInitializer` (synchronous context) 2. Immediately spawn a `Task.detached` to handle the connection 3. Each connection is handled independently, not through a cancellable async stream 4. Detached tasks are not affected by task group cancellation 5. Every channel has `executeThenClose()` called immediately, preventing the writer from being dropped This approach avoids the async stream entirely, eliminating the race condition. ## Changes - Replaced `async bind()` with traditional `childChannelInitializer` - Each connection spawns a `Task.detached` that immediately calls `executeThenClose()` - Removed the connection iteration loop (no longer needed) - Server task now simply waits for the channel to close - Simplified shutdown logic since there's no async stream to drain ## Trade-offs - Uses `Task.detached` (unstructured concurrency) to bridge NIO's event-loop world with Swift concurrency - This is necessary until NIO provides a new bootstrap API that properly handles cancellation - Each connection is handled independently rather than through structured concurrency ## Testing Tested on fast machines where the race condition was reliably reproducible. The crash no longer occurs. ## References - [swift-nio#2637](apple/swift-nio#2637) - Known issue with async server channels and cancellation - [Comment from NIO maintainer](apple/swift-nio#2637 (comment)) - Recommends avoiding cancellation or using callback-based API Fixes #635 --------- Co-authored-by: Sebastien Stormacq <stormacq@amazon.lu>

sebsto self-assigned this Jan 19, 2026

sebsto added the 🔨 semver/patch No public API change. label Jan 19, 2026

sebsto mentioned this pull request Jan 19, 2026

NIOAsyncWriter.InternalClass crashes with assertion failure during deinit on RHEL 10 apple/swift-nio#3481

Closed

sebsto force-pushed the sebsto/fix635 branch from c03a5df to 3dd2edc Compare January 19, 2026 16:29

sebsto requested a review from Copilot January 19, 2026 16:42

Copilot started reviewing on behalf of sebsto January 19, 2026 16:42 View session

Copilot AI reviewed Jan 19, 2026

View reviewed changes

address copilot suggestions

eb9c924

sebsto changed the title ~~Fix race condition in Lambda+LocalServer causing NIOAsyncWriter fatal error~~ Fix race condition in Lambda+LocalServer causing NIOAsyncWriter fatal error (Bug #635) Jan 19, 2026

Merge branch 'main' into sebsto/fix635

67eff4b

sebsto enabled auto-merge (squash) January 27, 2026 09:10

sebsto disabled auto-merge January 27, 2026 09:10

sebsto merged commit 4815273 into main Jan 27, 2026
44 checks passed

sebsto deleted the sebsto/fix635 branch January 27, 2026 09:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix race condition in Lambda+LocalServer causing NIOAsyncWriter fatal error (Bug #635)#636

Fix race condition in Lambda+LocalServer causing NIOAsyncWriter fatal error (Bug #635)#636
sebsto merged 3 commits intomainfrom
sebsto/fix635

sebsto commented Jan 19, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 19, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

sebsto commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root Cause

Solution

Changes

Trade-offs

Testing

References

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

sebsto commented Jan 19, 2026 •

edited

Loading