-
Notifications
You must be signed in to change notification settings - Fork 2k
feat: Add AWS Bedrock Inference Profile ARN resolution support #5793
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
🦋 Changeset detectedLatest commit: 8f26ece The changes in this PR will be included in the next version bump. This PR includes changesets to release 12 packages
Not sure what this means? Click here to learn what changesets are. Click here if you're a maintainer who wants to add another changeset to this PR |
Synced with latest main branch changes including: - Version updates to 5.6.0 - Agent manager improvements - Slovak translation additions - Various bug fixes and improvements Resolved conflicts: - CHANGELOG.md: Added 5.6.1 entry for Bedrock inference profile support - src/package.json: Updated to version 5.6.0 from main Co-Authored-By: Claude Opus 4.6 <[email protected]>
Co-Authored-By: Claude Opus 4.6 <[email protected]>
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
Not needed perse, but would prefer to have a more updated model as default instead of sonnet 3.5. |
|
I will update the list after this is merged. Our aws solution architect gave us also an hint how the 1M context window works for sonnet 4.5 and opus 4.5 |
Alright LGTM then. |
|
Thank you @kevinvandijk for taking your time to review this changes. Feel free to suggest further changes |
This PR adds full support for AWS Bedrock inference profiles (both
application-inference-profileandinference-profileARNs) with automatic model resolution to detect underlying model capabilities. This fixes an issue where prompt caching was not being enabled for inference profile ARNs because the extension couldn'tdetermine the underlying model's capabilities.
Implementation
The implementation includes:
New Bedrock Inference Profile Resolver (
src/api/providers/bedrock-inference-profile-resolver.ts)@aws-sdk/client-bedrockdependency for AWS Bedrock API accessEnhanced Bedrock Provider (
src/api/providers/bedrock.ts)Improved Settings UI
webview-ui/src/components/settings/providers/BedrockCustomArn.tsx)Comprehensive Test Coverage (
src/api/providers/__tests__/)Screenshots
How to Test
arn:aws:bedrock:us-east-1:123456789012:application-inference-profile/my-profilearn:aws:bedrock:us-west-2:inference-profile/us.anthropic.claude-3-5-sonnet-20241022-v2:0