Add `embed` to Index `configure` calls #515

austin-denoble · 2025-06-16T20:21:26Z

Problem

embed was never exposed as an argument for calling configure on IndexResource.

Solution

Add a new simple ConfigureIndexEmbed(TypedDict) class for representing the argument dictionary shape. I went with this because it aligned with the existing CreateIndexForModelEmbedTypedDict, but I'm not sure if this is best practice in the repo at this point. Maybe a class would be better.
Update factory, sync, and async resources to pass through embed on configure calls.
Update legacy Pinecone.configure_index method to support embed.
Add integration tests to serverless resources to validate converting an existing serverless index to an integrated index using configure or configure_index.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

New integration tests added. You can pull this branch down and use poetry to run a repl and quickly evaluate things yourself:

poetry run repl

>>> from pinecone import Pinecone
>>> pc = Pinecone(api_key="YOUR_API_KEY")

>>> pc.create_index(name="test-int-inf-convert", dimension=1024, metric="cosine", spec={"serverless": {"cloud": "aws", "region": "us-east-1"}})
{
    "name": "test-int-inf-convert",
    "metric": "cosine",
    "host": "test-int-inf-convert-bt8x3su.svc.preprod-aws-0.pinecone.io",
    "spec": {
        "serverless": {
            "cloud": "aws",
            "region": "us-east-1"
        }
    },
    "status": {
        "ready": true,
        "state": "Ready"
    },
    "vector_type": "dense",
    "dimension": 1024,
    "deletion_protection": "disabled",
    "tags": null
}

>>> pc.db.index.configure(name="test-int-inf-convert", embed={"model": "multilingual-e5-large", "field_map":{"text": "chunk_text"}})
>>> pc.db.index.describe(name="test-int-inf-convert")
{
    "name": "test-int-inf-convert",
    "metric": "cosine",
    "host": "test-int-inf-convert-bt8x3su.svc.preprod-aws-0.pinecone.io",
    "spec": {
        "serverless": {
            "cloud": "aws",
            "region": "us-east-1"
        }
    },
    "status": {
        "ready": true,
        "state": "Ready"
    },
    "vector_type": "dense",
    "dimension": 1024,
    "deletion_protection": "disabled",
    "tags": null,
    "embed": {
        "model": "multilingual-e5-large",
        "field_map": {
            "text": "chunk_text"
        },
        "dimension": 1024,
        "metric": "cosine",
        "write_parameters": {
            "input_type": "passage",
            "truncate": "END"
        },
        "read_parameters": {
            "input_type": "query",
            "truncate": "END"
        },
        "vector_type": "dense"
    }
}

### repeat with async resources / pc.configure_index()

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1210417294961252

…igure method, and building in the request factory

…how we're building the configure index request call to be a bit more flexible

…-index-configure

…s ConfigureIndexRequestEmbed as the class to the generated core

… interface and asyncio interface to support the embed parameter on configure

…-index-configure

…reaks

rohanshah18 · 2025-06-18T15:46:51Z

pinecone/__init__.py

@@ -98,6 +98,11 @@
    "RestoreJobList": ("pinecone.db_control.models", "RestoreJobList"),
    "BackupModel": ("pinecone.db_control.models", "BackupModel"),
    "BackupList": ("pinecone.db_control.models", "BackupList"),
+    "ConfigureIndexEmbed": ("pinecone.db_control.types", "ConfigureIndexEmbed"),


what's the reason for adding ConfigureIndexEmbed and CreateIndexForModelEmbedTypedDict here?

Adding the classes to _db_control_lazy_imports, which we seem to do for all of our custom models and types. It seemed like CreateIndexForModelEmbedTypedDict was also not included here, so I added it.

This is to allow lazy loaded imports to be exported from the top level of the package here:

pinecone-python-client/pinecone/__init__.py

Line 145 in dfd0125

*list(_LAZY_IMPORTS.keys()),

yeah I was wondering why we added types here but here's the reason why:
#507 (comment)

rohanshah18

LGTM! Nice work. Also it makes sense to follow CreateIndexForModelEmbed approach.

austin-denoble added 10 commits June 16, 2025 13:46

add ConfigureIndexEmbed model and allow passing in IndexResource conf…

17ef903

…igure method, and building in the request factory

move to using a simpler ConfigureIndexEmbed(TypedDict) class, update …

21c2b06

…how we're building the configure index request call to be a bit more flexible

Merge remote-tracking branch 'origin/main' into adenoble/add-embed-to…

5fec10b

…-index-configure

lint format

7f279ca

export db_control types from the top of the package, make sure to pas…

a5035df

…s ConfigureIndexRequestEmbed as the class to the generated core

add embed to top level Pinecone configure_index method, update legacy…

5b6476e

… interface and asyncio interface to support the embed parameter on configure

Merge remote-tracking branch 'origin/main' into adenoble/add-embed-to…

5a4487a

…-index-configure

Merge remote-tracking branch 'origin/main' into adenoble/add-embed-to…

f00c752

…-index-configure

Merge remote-tracking branch 'origin/main' into adenoble/add-embed-to…

448a9f6

…-index-configure

expose embed on asyncio configure method

7b1146b

austin-denoble marked this pull request as ready for review June 17, 2025 19:00

austin-denoble added 6 commits June 18, 2025 00:15

add integration tests for integrated inference upgrade path

77a2118

fix assertions in embed integration tests

107a1cd

add black defaults in pyproject.toml, undo a bunch of the <100 line b…

48db04e

…reaks

lint

4058f24

use proper fixture name in sync resources test

570c5ff

fix resource test

b4ab820

austin-denoble requested review from rohanshah18 and akelch11 June 18, 2025 08:25

rohanshah18 reviewed Jun 18, 2025

View reviewed changes

rohanshah18 approved these changes Jun 18, 2025

View reviewed changes

austin-denoble merged commit 28c142a into main Jun 18, 2025
68 checks passed

austin-denoble deleted the adenoble/add-embed-to-index-configure branch June 18, 2025 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `embed` to Index `configure` calls #515

Add `embed` to Index `configure` calls #515

austin-denoble commented Jun 16, 2025 •

edited

Loading

Uh oh!

rohanshah18 Jun 18, 2025

Uh oh!

austin-denoble Jun 18, 2025 •

edited

Loading

Uh oh!

rohanshah18 Jun 18, 2025

Uh oh!

rohanshah18 left a comment

Uh oh!

Uh oh!

Uh oh!

Add embed to Index configure calls #515

Add embed to Index configure calls #515

Conversation

austin-denoble commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Type of Change

Test Plan

Uh oh!

rohanshah18 Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

austin-denoble Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rohanshah18 Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

rohanshah18 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Add `embed` to Index `configure` calls #515

Add `embed` to Index `configure` calls #515

austin-denoble commented Jun 16, 2025 •

edited

Loading

austin-denoble Jun 18, 2025 •

edited

Loading