Skip to content

Conversation

@tanujnay112
Copy link
Contributor

@tanujnay112 tanujnay112 commented Oct 2, 2025

Description of changes

Summarize the changes made by this PR.

  • Improvements & Bug fixes
    • Added db models and SysDB routes to Create, Delete and Get Tasks.
  • New functionality
    • ^^^
      CreateTask here does not kick off the requisite backfill task runs in this change. That is left as followup work.

Test plan

Tests have been added in go/pkg/sysdb/metastore/db/dao/task_test.go.

  • Tests pass locally with pytest for python, yarn test for js, cargo test for rust

Migration plan

Are there any migrations, or any forwards/backwards compatibility changes needed in order to make sure this change deploys reliably?

Observability plan

What is the plan to instrument and monitor this change?

Documentation Changes

Are all docstrings for user-facing APIs updated if required? Do we need to make documentation changes in the _docs section?_

Copy link
Contributor Author

tanujnay112 commented Oct 2, 2025

@github-actions
Copy link

github-actions bot commented Oct 2, 2025

Reviewer Checklist

Please leverage this checklist to ensure your code review is thorough before approving

Testing, Bugs, Errors, Logs, Documentation

  • Can you think of any use case in which the code does not behave as intended? Have they been tested?
  • Can you think of any inputs or external events that could break the code? Is user input validated and safe? Have they been tested?
  • If appropriate, are there adequate property based tests?
  • If appropriate, are there adequate unit tests?
  • Should any logging, debugging, tracing information be added or removed?
  • Are error messages user-friendly?
  • Have all documentation changes needed been made?
  • Have all non-obvious changes been commented?

System Compatibility

  • Are there any potential impacts on other parts of the system or backward compatibility?
  • Does this change intersect with any items on our roadmap, and if so, is there a plan for fitting them together?

Quality

  • Is this code of a unexpectedly high quality (Readability, Modularity, Intuitiveness)

@tanujnay112 tanujnay112 changed the title Add models and migration for Create/Delete/Get Tasks [ENH]: Add models and migration for Create/Delete/Get Tasks Oct 2, 2025
@blacksmith-sh blacksmith-sh bot deleted a comment from tanujnay112 Oct 2, 2025
@tanujnay112 tanujnay112 requested a review from rescrv October 2, 2025 21:44
// Segment metadata errors
ErrUnknownSegmentMetadataType = errors.New("segment metadata value type not supported")

// Task errors
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

small nit that I'd be better off taking up with the creators of go: Static strings for errors are hard to debug, especially if they occur at multiple places. Can we at least embed some information about what the error means? Like context about it.

)

// CreateTask creates a new task in the database
func (s *Coordinator) CreateTask(ctx context.Context, req *coordinatorpb.CreateTaskRequest) (*coordinatorpb.CreateTaskResponse, error) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the transaction somehow carried on the context? Otherwise this is not transactional code.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is not, I was going to leave making task creation + collection creation a follow up to this because collection creation also needs to be refactored to take a transaction.

@tanujnay112 tanujnay112 force-pushed the task_api_db_schema branch 5 times, most recently from e96d01b to 728aefb Compare October 3, 2025 08:58
@tanujnay112 tanujnay112 marked this pull request as ready for review October 3, 2025 09:15
@tanujnay112 tanujnay112 requested a review from rescrv October 3, 2025 09:16
@propel-code-bot
Copy link
Contributor

propel-code-bot bot commented Oct 3, 2025

Task/Operator System: Add Models, Migrations, API, and Codegen Synchronization for Create/Delete/Get Tasks

This PR introduces a new system for defining, managing, and retrieving 'task' objects and their associated 'operators' in the metadata database. The changes include new database tables (tasks, operators), GORM models and DAOs in Go, gRPC and Protobuf APIs, associated test coverage, Rust bindings/codegen to guarantee cross-language constant consistency, and supporting logic in both Go and Rust services. The implementation adds transactional API endpoints to create, delete, and retrieve tasks and exposes operator metadata via API, with robust validation and idempotency features. Additionally, there is cross-codebase automation and test enforcement to avoid migration/constant drift between Go, Rust, and SQL.

Key Changes

• Created new tasks and operators tables in the metadata database with appropriate unique constraints and example data
• Added GORM models (Task, Operator), interfaces, DAOs, and mock implementations for testability
• Implemented transactional SysDB API endpoints (gRPC/Protobuf) for CreateTask, GetTaskByName, DeleteTask, and GetOperators
• Migration, model validations, and soft-delete flow (renaming with _deleted_ prefix) for tasks; explicit error codes for duplicate, invalid name, and not-found scenarios
• Rust-side: added operator/task model definitions, and an automated build system in rust/types/operator_codegen.rs to parse Go constants and generate Rust constants, enforced via tests
• Test suites (unit/integration) to verify all major task flows and enforce drift checks between database, Go, and Rust operator constants
• Tiltfile, Dockerfile, and build pipeline updates to ensure new Go constant files are available for Rust build/codegen, keeping images reproducible and up-to-date

Affected Areas

go/pkg/sysdb/metastore/db/dao/ (new DAOs: task.go, operator.go)
go/pkg/sysdb/metastore/db/dbmodel/ (models and interface changes for task/operator)
go/pkg/sysdb/metastore/db/migrations/20251001073000.sql (database changes for new tables)
idl/chromadb/proto/coordinator.proto (API/proto changes)
go/pkg/sysdb/coordinator/task.go (coordinator logic)
go/pkg/sysdb/grpc/task_service.go (expose SysDB APIs)
rust/types/ (operator codegen, new Rust models, auto-generated/constants, operators.rs)
rust/types/build.rs, rust/types/operator_codegen.rs (build and cross-language codegen)
Tiltfile, rust/Dockerfile (build/deploy infra so Rust can pick up Go constants)

This summary was automatically generated by @propel-code-bot

return &coordinatorpb.GetTaskByNameResponse{}, nil
}

// Look up operator name from operators table
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[PerformanceOptimization]

N+1 query pattern: In GetTaskByName, after retrieving the task, there's a separate query to get the operator details. If this function is called multiple times (e.g., in a batch operation), each call will make 2 database queries instead of using a JOIN.

Suggested change
// Look up operator name from operators table
// Consider using a JOIN query to fetch task and operator in one query:
task, err := s.catalog.metaDomain.TaskDb(ctx).GetByNameWithOperator(req.InputCollectionId, req.TaskName)

Committable suggestion

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Context for Agents
[**PerformanceOptimization**]

N+1 query pattern: In `GetTaskByName`, after retrieving the task, there's a separate query to get the operator details. If this function is called multiple times (e.g., in a batch operation), each call will make 2 database queries instead of using a JOIN.

```suggestion
// Consider using a JOIN query to fetch task and operator in one query:
task, err := s.catalog.metaDomain.TaskDb(ctx).GetByNameWithOperator(req.InputCollectionId, req.TaskName)
```

⚡ **Committable suggestion**

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

File: go/pkg/sysdb/coordinator/task.go
Line: 127

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this is a priority if we put the operation in a transaction.

@blacksmith-sh blacksmith-sh bot deleted a comment from tanujnay112 Oct 3, 2025
}

// Debug logging
log.Info("Found task", zap.String("task_id", task.ID.String()), zap.String("name", task.Name), zap.String("input_collection_id", task.InputCollectionID), zap.String("output_collection_id", task.OutputCollectionID))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[BestPractice]

This log message for a successful GetTaskByName operation seems more appropriate for the Debug level. Logging successful reads at the Info level can create a lot of noise in production environments, making it harder to spot actual informational messages or warnings.

Context for Agents
[**BestPractice**]

This log message for a successful `GetTaskByName` operation seems more appropriate for the `Debug` level. Logging successful reads at the `Info` level can create a lot of noise in production environments, making it harder to spot actual informational messages or warnings.

File: go/pkg/sysdb/coordinator/task.go
Line: 151

@blacksmith-sh blacksmith-sh bot deleted a comment from tanujnay112 Oct 3, 2025
Copy link
Contributor

@rescrv rescrv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comments inline. Propel hit a few out of the park that we should look into.

@tanujnay112 tanujnay112 force-pushed the task_api_db_schema branch 3 times, most recently from c210ac9 to 067ee2a Compare October 6, 2025 21:24
@tanujnay112 tanujnay112 requested a review from rescrv October 6, 2025 21:29
@@ -1,3 +1,5 @@
mod operator_codegen;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[CriticalError]

The build is failing because the operator_codegen module cannot be found. It seems the file rust/types/operator_codegen.rs, which should contain this module, was not included in the pull request. This is causing CI failures across all Rust-related jobs.

Please add the missing file to resolve the build errors.

Context for Agents
[**CriticalError**]

The build is failing because the `operator_codegen` module cannot be found. It seems the file `rust/types/operator_codegen.rs`, which should contain this module, was not included in the pull request. This is causing CI failures across all Rust-related jobs.

Please add the missing file to resolve the build errors.

File: rust/types/build.rs
Line: 1

);

-- Create "task_templates" table
CREATE TABLE "public"."task_templates" (
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[BestPractice]

This migration creates a task_templates table, but I don't see any corresponding Go models (dbmodel) or logic in this PR that uses this table. Is this intended for future work? If so, it might be better to move the creation of task_templates to a future PR where it's actually implemented. This would keep this PR focused on the task and operator functionality and avoid introducing unused schema into the database.

Context for Agents
[**BestPractice**]

This migration creates a `task_templates` table, but I don't see any corresponding Go models (`dbmodel`) or logic in this PR that uses this table. Is this intended for future work? If so, it might be better to move the creation of `task_templates` to a future PR where it's actually implemented. This would keep this PR focused on the task and operator functionality and avoid introducing unused schema into the database.

File: go/pkg/sysdb/metastore/db/migrations/20251001073000.sql
Line: 46

Comment on lines +122 to +129
if err == common.ErrTaskAlreadyExists {
log.Error("CreateTask: task already exists")
return common.ErrTaskAlreadyExists
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[BestPractice]

The current implementation has inconsistent behavior for handling pre-existing tasks. If a task is found with GetByName on line 30, its ID is returned successfully. However, if a task is created concurrently between that check and the Insert call (a race condition), the function returns ErrTaskAlreadyExists.

This could be confusing for clients. For robust idempotency, it would be better to handle the race condition by re-fetching the concurrently created task and returning its ID, making the function's behavior consistent in all "already exists" scenarios.

Consider replacing the error return with logic to re-fetch the task to ensure the CreateTask operation is fully idempotent.

Context for Agents
[**BestPractice**]

The current implementation has inconsistent behavior for handling pre-existing tasks. If a task is found with `GetByName` on line 30, its ID is returned successfully. However, if a task is created concurrently between that check and the `Insert` call (a race condition), the function returns `ErrTaskAlreadyExists`.

This could be confusing for clients. For robust idempotency, it would be better to handle the race condition by re-fetching the concurrently created task and returning its ID, making the function's behavior consistent in all "already exists" scenarios.

Consider replacing the error return with logic to re-fetch the task to ensure the `CreateTask` operation is fully idempotent.

File: go/pkg/sysdb/coordinator/task.go
Line: 125

@tanujnay112 tanujnay112 force-pushed the task_api_db_schema branch 3 times, most recently from 51e7975 to a4823d6 Compare October 7, 2025 08:08
use std::fs;
use std::path::Path;

pub fn generate_operator_constants() -> Result<(), Box<dyn std::error::Error>> {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[BestPractice]

The current parsing logic for the Go constants file is a bit fragile as it relies on split and strip_prefix. This might be the cause of the CI failures (Invalid UUID length: type). To make it more robust, consider using regular expressions to precisely match and extract the constant names and their values. This would prevent accidentally parsing comments or other lines that might coincidentally match the simple string checks.

For example:

use regex::Regex;

// For UUIDs:
let re_uuid = Regex::new(r#"^\s*([A-Za-z0-9_]+)\s*=\s*uuid\.MustParse\("([a-fA-F0-9-]+)"\)"#).unwrap();

// For names:
let re_name = Regex::new(r#"^\s*([A-Za-z0-9_]+)\s*=\s*"([a-z_]+)""#).unwrap();

// Then iterate through lines and use captures from the regex match.
Context for Agents
[**BestPractice**]

The current parsing logic for the Go constants file is a bit fragile as it relies on `split` and `strip_prefix`. This might be the cause of the CI failures (`Invalid UUID length: type`). To make it more robust, consider using regular expressions to precisely match and extract the constant names and their values. This would prevent accidentally parsing comments or other lines that might coincidentally match the simple string checks.

For example:
```rust
use regex::Regex;

// For UUIDs:
let re_uuid = Regex::new(r#"^\s*([A-Za-z0-9_]+)\s*=\s*uuid\.MustParse\("([a-fA-F0-9-]+)"\)"#).unwrap();

// For names:
let re_name = Regex::new(r#"^\s*([A-Za-z0-9_]+)\s*=\s*"([a-z_]+)""#).unwrap();

// Then iterate through lines and use captures from the regex match.
```

File: rust/types/operator_codegen.rs
Line: 8

@blacksmith-sh blacksmith-sh bot deleted a comment from tanujnay112 Oct 7, 2025
@tanujnay112 tanujnay112 force-pushed the task_api_db_schema branch 2 times, most recently from ad3767e to 3baa37f Compare October 7, 2025 10:29
'rust-log-service',
'docker image tag rust-log-service:ci $EXPECTED_REF',
['./rust/', './idl/', './Cargo.toml', './Cargo.lock'],
['./rust/', './idl/', './Cargo.toml', './Cargo.lock', './go/pkg/sysdb/metastore/db/dbmodel/constants.go'],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[BestPractice]

The list of dependencies ['./rust/', './idl/', './Cargo.toml', './Cargo.lock', './go/pkg/sysdb/metastore/db/dbmodel/constants.go'] is repeated for all Rust services (rust-log-service, rust-frontend-service, query-service, etc.).

To improve maintainability and keep the file DRY, consider defining this list as a variable at the top and reusing it for each service.

For example:

RUST_DEPS = ['./rust/', './idl/', './Cargo.toml', './Cargo.lock', './go/pkg/sysdb/metastore/db/dbmodel/constants.go']

# ...

if config.tilt_subcommand == "ci":
  custom_build(
    'rust-log-service',
    'docker image tag rust-log-service:ci $EXPECTED_REF',
    RUST_DEPS,
    disable_push=True
  )
else:
  docker_build(
    'rust-log-service',
    '.',
    only=RUST_DEPS,
    dockerfile='./rust/Dockerfile',
    target='log_service'
  )

# ... apply the same for other rust services
Context for Agents
[**BestPractice**]

The list of dependencies `['./rust/', './idl/', './Cargo.toml', './Cargo.lock', './go/pkg/sysdb/metastore/db/dbmodel/constants.go']` is repeated for all Rust services (`rust-log-service`, `rust-frontend-service`, `query-service`, etc.).

To improve maintainability and keep the file DRY, consider defining this list as a variable at the top and reusing it for each service.

For example:
```python
RUST_DEPS = ['./rust/', './idl/', './Cargo.toml', './Cargo.lock', './go/pkg/sysdb/metastore/db/dbmodel/constants.go']

# ...

if config.tilt_subcommand == "ci":
  custom_build(
    'rust-log-service',
    'docker image tag rust-log-service:ci $EXPECTED_REF',
    RUST_DEPS,
    disable_push=True
  )
else:
  docker_build(
    'rust-log-service',
    '.',
    only=RUST_DEPS,
    dockerfile='./rust/Dockerfile',
    target='log_service'
  )

# ... apply the same for other rust services
```

File: Tiltfile
Line: 23

Comment on lines +213 to +216
deleteCollection := &model.DeleteCollection{
ID: collectionUUID,
TenantID: task.TenantID,
DatabaseName: task.DatabaseID,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[CriticalError]

There appears to be a bug here. The model.DeleteCollection struct expects a DatabaseName, but you are passing task.DatabaseID. This will likely cause SoftDeleteCollection to fail because it will be searching for a database with a name that is actually a UUID string.

The task model doesn't store the database name, so you'll need to fetch it. This might require adding a GetDatabaseByID method to the database DAO and interface, as one doesn't seem to exist currently.

Example of what the fix might look like:

// First, fetch the database to get its name
// This assumes a new method GetDatabaseByID is added to the DB interface
database, err := s.catalog.metaDomain.DatabaseDb(ctx).GetDatabaseByID(task.DatabaseID)
if err != nil {
    log.Error("DeleteTask: failed to get database by ID", zap.Error(err), zap.String("database_id", task.DatabaseID))
    return nil, err
}
if database == nil {
    log.Error("DeleteTask: database not found for ID", zap.String("database_id", task.DatabaseID))
    return nil, common.ErrDatabaseNotFound
}

deleteCollection := &model.DeleteCollection{
    ID:           collectionUUID,
    TenantID:     task.TenantID,
    DatabaseName: database.Name, // Use the fetched database name
}
Context for Agents
[**CriticalError**]

There appears to be a bug here. The `model.DeleteCollection` struct expects a `DatabaseName`, but you are passing `task.DatabaseID`. This will likely cause `SoftDeleteCollection` to fail because it will be searching for a database with a name that is actually a UUID string.

The `task` model doesn't store the database name, so you'll need to fetch it. This might require adding a `GetDatabaseByID` method to the database DAO and interface, as one doesn't seem to exist currently.

Example of what the fix might look like:
```go
// First, fetch the database to get its name
// This assumes a new method GetDatabaseByID is added to the DB interface
database, err := s.catalog.metaDomain.DatabaseDb(ctx).GetDatabaseByID(task.DatabaseID)
if err != nil {
    log.Error("DeleteTask: failed to get database by ID", zap.Error(err), zap.String("database_id", task.DatabaseID))
    return nil, err
}
if database == nil {
    log.Error("DeleteTask: database not found for ID", zap.String("database_id", task.DatabaseID))
    return nil, common.ErrDatabaseNotFound
}

deleteCollection := &model.DeleteCollection{
    ID:           collectionUUID,
    TenantID:     task.TenantID,
    DatabaseName: database.Name, // Use the fetched database name
}
```

File: go/pkg/sysdb/coordinator/task.go
Line: 216

Comment on lines +200 to +203
if task == nil {
log.Error("DeleteTask: task not found")
return nil, status.Errorf(codes.NotFound, "task not found")
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[ArchitectureDecision]

In DeleteTask, the function returns a gRPC status.Errorf error directly. This is inconsistent with other functions in the coordinator layer (like GetTaskByName which returns common.ErrTaskNotFound) that return errors from the common package. The coordinator layer should ideally be agnostic of the transport layer. It's better to return common errors and let the gRPC service layer handle the translation to gRPC status codes.

Suggested change
if task == nil {
log.Error("DeleteTask: task not found")
return nil, status.Errorf(codes.NotFound, "task not found")
}
if task == nil {
log.Error("DeleteTask: task not found")
return nil, common.ErrTaskNotFound
}

Committable suggestion

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Context for Agents
[**ArchitectureDecision**]

In `DeleteTask`, the function returns a gRPC `status.Errorf` error directly. This is inconsistent with other functions in the coordinator layer (like `GetTaskByName` which returns `common.ErrTaskNotFound`) that return errors from the `common` package. The coordinator layer should ideally be agnostic of the transport layer. It's better to return `common` errors and let the gRPC service layer handle the translation to gRPC status codes.

```suggestion
	if task == nil {
		log.Error("DeleteTask: task not found")
		return nil, common.ErrTaskNotFound
	}
```

⚡ **Committable suggestion**

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

File: go/pkg/sysdb/coordinator/task.go
Line: 203

Comment on lines +207 to +211
collectionUUID, err := types.ToUniqueID(task.OutputCollectionID)
if err != nil {
log.Error("DeleteTask: invalid output_collection_id", zap.Error(err))
return nil, status.Errorf(codes.InvalidArgument, "invalid output_collection_id: %v", err)
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[ArchitectureDecision]

Similar to the other error handling feedback, returning status.Errorf here couples the coordinator with gRPC. It's better to return a standard Go error and let the caller (the gRPC handler) wrap it in a gRPC status.

Suggested Change
Suggested change
collectionUUID, err := types.ToUniqueID(task.OutputCollectionID)
if err != nil {
log.Error("DeleteTask: invalid output_collection_id", zap.Error(err))
return nil, status.Errorf(codes.InvalidArgument, "invalid output_collection_id: %v", err)
}
collectionUUID, err := types.ToUniqueID(task.OutputCollectionID)
if err != nil {
log.Error("DeleteTask: invalid output_collection_id", zap.Error(err))
return nil, err
}

Committable suggestion

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Context for Agents
[**ArchitectureDecision**]

Similar to the other error handling feedback, returning `status.Errorf` here couples the coordinator with gRPC. It's better to return a standard Go error and let the caller (the gRPC handler) wrap it in a gRPC status.

<details>
<summary>Suggested Change</summary>

```suggestion
		collectionUUID, err := types.ToUniqueID(task.OutputCollectionID)
		if err != nil {
			log.Error("DeleteTask: invalid output_collection_id", zap.Error(err))
			return nil, err
		}
```

⚡ **Committable suggestion**

Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

</details>

File: go/pkg/sysdb/coordinator/task.go
Line: 211

@blacksmith-sh blacksmith-sh bot deleted a comment from tanujnay112 Oct 7, 2025
@tanujnay112 tanujnay112 merged commit 53edf57 into main Oct 7, 2025
59 checks passed
tanujnay112 added a commit that referenced this pull request Oct 7, 2025
tanujnay112 added a commit that referenced this pull request Oct 7, 2025
… Tasks (#5546)" (#5570)

## Description of changes

_Summarize the changes made by this PR._

- Improvements & Bug fixes
    - ...
- New functionality
    - ...

## Test plan

_How are these changes tested?_

- [ ] Tests pass locally with `pytest` for python, `yarn test` for js, `cargo test` for rust

## Migration plan

_Are there any migrations, or any forwards/backwards compatibility changes needed in order to make sure this change deploys reliably?_

## Observability plan

_What is the plan to instrument and monitor this change?_

## Documentation Changes

_Are all docstrings for user-facing APIs updated if required? Do we need to make documentation changes in the_ [_docs section](https://github.com/chroma-core/chroma/tree/main/docs/docs.trychroma.com)?_
tanujnay112 added a commit that referenced this pull request Oct 8, 2025
## Description of changes

_Summarize the changes made by this PR._

- Improvements & Bug fixes
    - Added db models and SysDB routes to Create, Delete and Get Tasks.
- New functionality
    - ^^^
`CreateTask` here does not kick off the requisite backfill task runs in this change. That is left as followup work.
## Test plan

Tests have been added in `go/pkg/sysdb/metastore/db/dao/task_test.go`.

- [ ] Tests pass locally with `pytest` for python, `yarn test` for js, `cargo test` for rust

## Migration plan

_Are there any migrations, or any forwards/backwards compatibility changes needed in order to make sure this change deploys reliably?_

## Observability plan

_What is the plan to instrument and monitor this change?_

## Documentation Changes

_Are all docstrings for user-facing APIs updated if required? Do we need to make documentation changes in the_ [_docs section](https://github.com/chroma-core/chroma/tree/main/docs/docs.trychroma.com)?_
tanujnay112 added a commit that referenced this pull request Oct 8, 2025
…5573)

## Description of changes

_Summarize the changes made by this PR._

This is a redo of [this change](#5546) which inadvertently merge conflicted with another change when it landed. That change generated task operator constants in rust by moving the go operator constants file into each service and manually generating a corresponding rust file during each docker containers build phase. This would've required every new service to make sure to copy in this Go file during build even if it didn't otherwise need Go code.

This diff changes that by having a contributor manually generate said constants in rust using a supplied script to avoid the above logistics. There is a rust unit test to make sure the generated constants are in sync with what is prepopulated in the SysDB operators table.

- Improvements & Bug fixes
    - ^^^
- New functionality
    - ...

## Test plan

_How are these changes tested?_

- [ ] Tests pass locally with `pytest` for python, `yarn test` for js, `cargo test` for rust

## Migration plan

_Are there any migrations, or any forwards/backwards compatibility changes needed in order to make sure this change deploys reliably?_

## Observability plan

_What is the plan to instrument and monitor this change?_

## Documentation Changes

_Are all docstrings for user-facing APIs updated if required? Do we need to make documentation changes in the_ [_docs section](https://github.com/chroma-core/chroma/tree/main/docs/docs.trychroma.com)?_
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants