feat: add configurable batch_size for embedding API calls #114

zhixiangxue · 2025-12-06T07:35:25Z

Problem

Different embedding API providers have different batch size limits:

OpenAI: ~25 items per batch (no strict limit documented)
Bailian/DashScope: max 10 items per batch (strict limit)
Other providers may have different limits

Currently, MemU processes all embeddings in a single batch, which causes errors when the batch size exceeds the provider's limit.

Solution

This PR adds a configurable batch_size parameter to handle provider-specific limits:

Added batch_size to EmbeddingConfig
- Default: 25 (suitable for OpenAI)
- Users can configure it based on their provider (e.g., 10 for Bailian)
Implemented batch processing in OpenAIEmbeddingSDKClient
- Automatically splits large input lists into smaller batches
- Optimized: skips batching when input size <= batch_size
Updated service initialization
- Passes batch_size from config to embedding client

Changes

src/memu/app/settings.py: Add batch_size field to EmbeddingConfig
src/memu/embedding/openai_sdk.py:
- Add batch_size parameter to __init__
- Implement batch processing in embed() method
src/memu/app/service.py: Pass batch_size to embedding client

Example Usage

from memu.app import MemoryService

# For Bailian/DashScope (max 10 per batch)
embedding_config = {
    "base_url": "https://dashscope.aliyuncs.com/compatible-mode/v1",
    "api_key": "YOUR_KEY",
    "embed_model": "text-embedding-v3",
    "batch_size": 10  # Configure batch size
}

service = MemoryService(embedding_config=embedding_config)

- Add batch_size parameter to EmbeddingConfig (default: 25) - Implement batch processing in OpenAIEmbeddingSDKClient.embed() - Support providers with different batch size limits (e.g., Bailian/DashScope: 10) - Optimize performance: skip batching when input size <= batch_size

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add configurable batch_size for embedding API calls #114

feat: add configurable batch_size for embedding API calls #114

Uh oh!

zhixiangxue commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add configurable batch_size for embedding API calls #114

Are you sure you want to change the base?

feat: add configurable batch_size for embedding API calls #114

Uh oh!

Conversation

zhixiangxue commented Dec 6, 2025

Problem

Solution

Changes

Example Usage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant