Fix concurrency issues due to preconcurrency imports and in MLTensor extension by naykutguven · Pull Request #430 · argmaxinc/WhisperKit

naykutguven · 2026-02-24T20:19:50Z

Swift 6 concurrency fixes for `MLTensor` extensions + async sampler integration

Summary

This PR resolves Swift 6 concurrency issues around MLTensor helper APIs by removing semaphore-based bridging and making tensor conversion helpers natively async. It propagates the async API to token sampling call sites and adds targeted unit tests for coverage.

What changed

Updated MLTensor public helpers to async in Sources/WhisperKit/Utilities/Extensions+Public.swift:
- asIntArray() -> [Int] -> asIntArray() async -> [Int]
- asFloatArray() -> [Float] -> asFloatArray() async -> [Float]
- asMLMultiArray() -> MLMultiArray -> asMLMultiArray() async -> MLMultiArray
Removed DispatchSemaphore + Task blocking patterns in those helpers and replaced them with direct async shapedArray(of:) usage.
Updated token sampling APIs in Sources/WhisperKit/Core/Text/TokenSampler.swift:
- TokenSampling.update(...) is now async
- GreedyTokenSampler.sampleWithMLTensor(...) is now async
- Call sites now await tensor conversions.
Updated decoder call sites in Sources/WhisperKit/Core/TextDecoder.swift to await tokenSampler.update(...).
Added @preconcurrency import for framework interoperability:
- Sources/WhisperKit/Utilities/Extensions+Public.swift: CoreML
- Sources/WhisperKit/Core/Audio/AudioProcessor.swift: AVFoundation
Added new test suite Tests/WhisperKitTests/MLTensorExtensionsTests.swift covering:
- asIntArray
- asFloatArray for Float32, FloatType, and Int32
- asMLMultiArray round-trips for FloatType and Int32

Breaking changes (Before / After)

1) `TokenSampling.update(...)` is now async

Before

public protocol TokenSampling {
    func update(tokens: [Int], logits: MLMultiArray, logProbs: [Float]) -> SamplingResult
}

let sampleResult = tokenSampler.update(tokens: currentTokens, logits: logits, logProbs: logProbs)

After

public protocol TokenSampling {
    func update(tokens: [Int], logits: MLMultiArray, logProbs: [Float]) async -> SamplingResult
}

let sampleResult = await tokenSampler.update(tokens: currentTokens, logits: logits, logProbs: logProbs)

2) `MLTensor` helper methods are now async

Before

let ids = tensor.asIntArray()
let probs = tensor.asFloatArray()
let multiArray = tensor.asMLMultiArray()

After

let ids = await tensor.asIntArray()
let probs = await tensor.asFloatArray()
let multiArray = await tensor.asMLMultiArray()

3) Conforming sampler implementations must be async

Before

public func update(tokens: [Int], logits: MLMultiArray, logProbs: [Float]) -> SamplingResult

After

public func update(tokens: [Int], logits: MLMultiArray, logProbs: [Float]) async -> SamplingResult

Risk

API migration risk for external callers due async signature changes.
Runtime behavior is intended to remain equivalent for supported scalar types.
@preconcurrency is intentionally used as an interoperability bridge with current SDK annotations.

Copilot

Pull request overview

This PR updates WhisperKit’s CoreML MLTensor helper APIs and token sampling flow to be Swift 6 concurrency-safe by making tensor conversion helpers and sampling updates natively async, removing semaphore-based blocking, and adding tests for the new async helpers.

Changes:

Converted MLTensor helpers (asIntArray, asFloatArray, asMLMultiArray) to async and removed semaphore/Task blocking bridging.
Propagated async sampling through TokenSampling.update(...), GreedyTokenSampler.sampleWithMLTensor(...), and TextDecoder call sites.
Added @preconcurrency import for CoreML/AVFoundation interoperability and introduced a new async test suite for the tensor helpers.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
Tests/WhisperKitTests/MLTensorExtensionsTests.swift	Adds async unit tests covering the new async `MLTensor` conversion helpers.
Sources/WhisperKit/Utilities/Extensions+Public.swift	Makes `MLTensor` helper conversions async; adds `@preconcurrency import CoreML`.
Sources/WhisperKit/Core/TextDecoder.swift	Updates decoding flow to `await` the now-async sampler update.
Sources/WhisperKit/Core/Text/TokenSampler.swift	Makes sampling update async and integrates async tensor conversion calls.
Sources/WhisperKit/Core/Audio/AudioProcessor.swift	Switches to `@preconcurrency import AVFoundation` for concurrency annotation compatibility.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sources/WhisperKit/Utilities/Extensions+Public.swift

Sources/WhisperKit/Core/Text/TokenSampler.swift

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

naykutguven added 2 commits February 24, 2026 20:53

Mark CoreML and AVFoundation @preconcurrency

c201da3

Fix concurrency errors in MLTensor extension

6512a39

Copilot AI review requested due to automatic review settings February 24, 2026 20:19

Copilot started reviewing on behalf of naykutguven February 24, 2026 20:20 View session

Copilot AI reviewed Feb 24, 2026

View reviewed changes

Sources/WhisperKit/Utilities/Extensions+Public.swift Outdated Show resolved Hide resolved

Sources/WhisperKit/Core/Text/TokenSampler.swift Show resolved Hide resolved

Fix typo

16cd252

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

naykutguven force-pushed the mltensor-fix branch from b77c9c4 to 5268003 Compare February 25, 2026 10:30

Fix type inference issue

16946dc

naykutguven force-pushed the mltensor-fix branch from 5268003 to 16946dc Compare February 25, 2026 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix concurrency issues due to preconcurrency imports and in MLTensor extension#430

Fix concurrency issues due to preconcurrency imports and in MLTensor extension#430
naykutguven wants to merge 4 commits intoargmaxinc:swift-6from
naykutguven:mltensor-fix

naykutguven commented Feb 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

naykutguven commented Feb 24, 2026

Swift 6 concurrency fixes for MLTensor extensions + async sampler integration

Summary

What changed

Breaking changes (Before / After)

1) TokenSampling.update(...) is now async

2) MLTensor helper methods are now async

3) Conforming sampler implementations must be async

Risk

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Swift 6 concurrency fixes for `MLTensor` extensions + async sampler integration

1) `TokenSampling.update(...)` is now async

2) `MLTensor` helper methods are now async