[SCIM 4/4]: Draw the rest of the owl #9276

jmpesp · 2025-10-23T14:07:10Z

The majority of this PR fills out the CrdbScimProviderStore implementation that the scim2-rs crate's Provider object will use to implement SCIM where CRDB is the durable store for users and groups, and adds a boat load of integration tests.

The notion of a silo user being "active" or not has also been added to support users being deactivated by the SCIM client. Non-SCIM silo users should not be affected.

The majority of this PR fills out the CrdbScimProviderStore implementation that the scim2-rs crate's Provider object will use to implement SCIM where CRDB is the durable store for users and groups, and adds a boat load of integration tests. The notion of a silo user being "active" or not has also been added to support users being deactivated by the SCIM client. Non-SCIM silo users should not be affected.

nexus/db-queries/src/db/datastore/scim_provider_store.rs

david-crespo · 2025-10-30T22:20:04Z

nexus/db-queries/src/db/datastore/scim_provider_store.rs

+        for user in users {
+            let groups = self
+                .get_user_groups_for_user_in_txn(conn, user.identity.id.into())
+                .await?;
+
+            let SiloUser::Scim(user) = user.into() else {
+                // With the user provision type filter, this should never be
+                // another type.
+                unreachable!();
+            };
+
+            returned_users.push(convert_to_scim_user(user, groups));
+        }


This should be able to be done in one query with the list of user IDs

Follow on PR by @david-crespo in #9325

Since this is classified as a performance optimization I am inclined to say that we should punt on the addition of a follow on PR until after R17 is cut. The main reasoning behind this is all of the testing done so far by me, James, Angela, and Jay has been against the current implementation and the window for R17 is closing asap.

I would like to give the perf optimaztion more time to soak as we already know some of this code will have to change as we have to fix oxidecomputer/scim2-rs#7 relatively soon anyways.

I am concerned about running 1000+ queries in a single GET request — has that been exercised?

Using this gist we compared the time to list users before and after my changes and it was 75-80% faster after the optimization, at least on my machine. However both response times were tolerable — the slower one was 260-290ms. The fast one was 62-65ms. On @papertigers machine the slow version was more like 800-900ms.

https://gist.github.com/papertigers/5aa9f341797e0a82e66c3f8188c9701f

I modified one of the integration tests to do the following:

Create 1000 users

Create a group with those 1000 users as member

Captured the time it takes to hit the GET /Users endpoint

On my machine over multiple runs it seemed to take ~700-900ms.
Again, I think this should be improved upon and I suspect many optimizations can be made here. Regardless I think some of this code will need to be modified anyways to support pagination.

Thoughts on having this go in as is?

We probably want to get this landed as is and follow up with tuning work afterwards to unblock customers waiting to validate their integration workflow.

Yeah, if it's tolerable for now and considered a starting point anyway then that's fine.

nexus/db-queries/src/db/datastore/scim_provider_store.rs

david-crespo · 2025-10-30T22:24:41Z

nexus/db-queries/src/db/datastore/scim_provider_store.rs

+        let SiloUser::Scim(user) = user.into() else {
+            // With the user provision type filter, this should never be another
+            // type.
+            unreachable!();


Maybe better to 500 here? This will be a 500, I guess, but doing it explicitly lets you stick a message on it. This could be an argument against using diesel::result::Error as your error type for these functions. Most of the datastore functions return external::Error.

I started prototyping a new error type that can be returned however the transaction retry calls are expecting a diesel::result::Error . So I don't think we can return another type there, which is probably fine since we do have the filter on the query already.

david-crespo · 2025-10-31T19:20:32Z

nexus/db-queries/src/db/datastore/scim_provider_store.rs

+        let maybe_other_user = dsl::silo_user
+            .filter(dsl::silo_id.eq(self.authz_silo.id()))
+            .filter(dsl::user_provision_type.eq(model::UserProvisionType::Scim))
+            .filter(lower(dsl::user_name).eq(lower(user_request.name.clone())))


Because the DB itself isn't enforcing case-sensitive uniqueness, is it possible that two concurrent requests could write different cases of the same name at the same time? You might want a unique index like CREATE UNIQUE INDEX ON silo_user (silo_id, LOWER(user_name)) WHERE user_provision_type = 'scim' AND time_deleted IS NULL.

I don't think two concurrent requests could race in this way because of the other calls to lower in the filters but a belt and suspenders approach here is the right one.

done in 6285129

papertigers · 2025-10-31T22:06:19Z

nexus/db-model/src/schema_versions.rs

        // v
        // KnownVersion::new(next_int, "unique-dirname-with-the-sql-files"),
+        KnownVersion::new(203, "scim-users-and-groups-lower"),
+        KnownVersion::new(202, "scim-actor-audit-log"),


Oops I didn't realize this PR already touched this and can probably collapse it into one version bump. Will fix this soonish when back at the keyboard.

jmpesp requested a review from papertigers October 23, 2025 14:07

david-crespo reviewed Oct 23, 2025

View reviewed changes

nexus/db-queries/src/db/datastore/scim_provider_store.rs Show resolved Hide resolved

david-crespo reviewed Oct 23, 2025

View reviewed changes

nexus/db-queries/src/db/datastore/scim_provider_store.rs Show resolved Hide resolved

Add OpContext to CrdbScimProviderStore

f884d15

morlandi7 added this to the 17 milestone Oct 29, 2025

papertigers added 2 commits October 29, 2025 20:15

Use OpContext in scim endpoints and update polar

a2c88be

Users and groups deletion leave behind role assignment records

3d1a4d5

papertigers mentioned this pull request Oct 30, 2025

Users and groups deletion leave behind role assignment records #8605

Open

Remove sessions and auth on user delete

13470cb

papertigers mentioned this pull request Oct 30, 2025

Bump scim2-rs crate dep (has unauthorized status check) #9322

Open

david-crespo reviewed Oct 30, 2025

View reviewed changes

nexus/db-queries/src/db/datastore/scim_provider_store.rs Outdated Show resolved Hide resolved

david-crespo reviewed Oct 30, 2025

View reviewed changes

Fix missed pool_connection_authorized calls

f73efa2

david-crespo reviewed Oct 31, 2025

View reviewed changes

papertigers added 2 commits October 31, 2025 16:00

merge main

7546318

Add LOWER to scim users and groups

6285129

papertigers reviewed Oct 31, 2025

View reviewed changes

[SCIM 4/4]: Draw the rest of the owl #9276

Are you sure you want to change the base?

[SCIM 4/4]: Draw the rest of the owl #9276

Conversation

jmpesp commented Oct 23, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

david-crespo Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

papertigers Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

david-crespo Oct 31, 2025 •

edited

Loading

papertigers Oct 31, 2025 •

edited

Loading