Skip to content

Integrate a JVector codec for KNN searches #14681

@RKSPD

Description

@RKSPD

JVector (https://github.com/datastax/jvector) is "a graph-based index that builds on the HNSW and DiskANN designs with composable extensions."

JVector features a DiskANN implementation and allows for multi-phase vector matching and disk-based rerank. The need for JVector as a search engine is neatly summarized in this OpenSearch issue.

Proposing a new codec in Lucene as a standalone KnnVectorsFormat that is based on OpenSearch's implementation of JVector. This implementation would integrate with Lucene's existing vector APIs and codec SPI.

Opening this issue to get feedback on the need, implementation, and long term considerations for this codec.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions