Skip to content

Conversation

@KyleAMathews
Copy link
Collaborator

@KyleAMathews KyleAMathews commented Oct 15, 2025

Overview

This PR extends Query Collections to support predicate pushdown from live queries by enabling multiple concurrent queries with different predicates/filters. When live queries push down predicates via loadSubset, Query Collections now create separate TanStack Query instances for each unique set of options, pass those options to both the query key builder and query function, and manage the lifecycle of multiple queries with proper reference counting and garbage collection.

Problem

When live queries push down predicates (via loadSubset), Query Collections need to:

  1. Pass LoadSubsetOptions (predicates, limits, ordering) to the query key builder and query function
  2. Create separate TanStack Query instances for each unique set of options
  3. Track which rows belong to which queries (reference counting)
  4. Only remove rows when no active queries reference them
  5. Handle query garbage collection when queries are no longer needed
  6. Return promises that resolve when query data is first available

Without this, Query Collections couldn't properly support the on-demand sync mode introduced in #669.

Solution

This PR implements a comprehensive multi-query management system that flows predicates from live queries through to your TanStack Query implementation:

Predicate Flow

When a live query calls loadSubset with predicates, those options flow through the system:

  1. Live Query → calls collection._sync.loadSubset(options)
  2. Query Collection → calls createQueryFromOpts(options)
  3. Query Key Builder → receives options to create unique query key
  4. Query Function → receives options via context.meta.loadSubsetOptions

Example Usage

import { createCollection } from "@tanstack/react-db"
import { queryCollectionOptions } from "@tanstack/query-db-collection"

const itemsPerPage = 20

export const todoCollection = createCollection(
  queryCollectionOptions({
    syncMode: 'on-demand', // Enable predicate pushdown
    
    // Query key builder receives LoadSubsetOptions
    queryKey: ({ limit, orderBy, where }) => {
      const page = computePageNumber(limit ?? itemsPerPage)
      return ["todos", { page, where, orderBy }]
    },

    // Query function receives options via context.meta.loadSubsetOptions
    queryFn: async (ctx) => {
      const { limit, where, orderBy } = ctx.meta?.loadSubsetOptions ?? {}
      const page = computePageNumber(limit ?? itemsPerPage)
      
      const params = new URLSearchParams({
        page: page.toString(),
        ...(where && { filter: JSON.stringify(where) }),
        ...(orderBy && { sort: JSON.stringify(orderBy) }),
      })
      
      const res = await fetch(\`/api/todos?\${params}\`)
      if (!res.ok) throw new Error("Failed to fetch todos")
      return res.json()
    },

    getKey: (item) => item.id,
    schema: todoSchema,
  })
)

function computePageNumber(limit: number): number {
  return Math.max(1, Math.ceil(limit / itemsPerPage))
}

In this example:

  • The queryKey function builds different cache keys based on page/filters
  • The queryFn receives the same options via ctx.meta.loadSubsetOptions to fetch the right data
  • Each unique combination of predicates creates a separate TanStack Query
  • Rows are reference-counted across all queries

1. Dynamic Query Keys

Type: Added TQueryKeyBuilder<TQueryKey> type

type TQueryKeyBuilder<TQueryKey> = (opts: LoadSubsetOptions) => TQueryKey

interface QueryCollectionConfig {
  queryKey: TQueryKey | TQueryKeyBuilder<TQueryKey>
  // ... other properties
}

The queryKey config option now accepts either:

  • A static query key (for eager mode with no predicates)
  • A function that builds a query key from LoadSubsetOptions (for on-demand mode with predicate pushdown)

LoadSubsetOptions Structure:

interface LoadSubsetOptions {
  where?: any      // Filter predicates
  orderBy?: any    // Ordering directives
  limit?: number   // Result limit
  offset?: number  // Result offset
}

2. Meta Property Extension

When creating a query, the collection merges LoadSubsetOptions into the query's meta:

const extendedMeta = { ...meta, loadSubsetOptions: opts }

const observerOptions = {
  queryKey: key,
  queryFn: queryFn,
  meta: extendedMeta,  // Contains loadSubsetOptions
  // ... other options
}

Your queryFn can then access these options via context.meta.loadSubsetOptions to fetch the appropriate data.

3. Multi-Query Tracking System

Implemented comprehensive state tracking using Maps:

// hashedQueryKey → QueryKey
const hashToQueryKey = new Map<string, QueryKey>()

// hashedQueryKey → Set<RowKey> (which rows belong to which query)
const queryToRows = new Map<string, Set<string | number>>()

// RowKey → Set<hashedQueryKey> (which queries reference each row)
const rowToQueries = new Map<string | number, Set<string>>()

// hashedQueryKey → QueryObserver (active query observers)
const observers = new Map<string, QueryObserver>()

// hashedQueryKey → unsubscribe function
const unsubscribes = new Map<string, () => void>()

Reference Counting: Rows are only deleted from the collection when their reference count drops to zero (no queries reference them anymore).

4. createQueryFromOpts Function

New internal function that creates or reuses queries based on LoadSubsetOptions:

Return Type: true | Promise<void>

  • Returns true synchronously if query data is already available
  • Returns Promise<void> that resolves when query data loads
  • Returns Promise<void> that rejects if query encounters an error

Behavior:

  • Hashes the query key (built from options) to check for existing queries
  • Reuses existing QueryObserver instances when the same predicates are requested
  • Creates new observers for unique predicate combinations
  • Passes options to both query key builder and query function (via meta.loadSubsetOptions)
  • Automatically subscribes if sync has started or collection has subscribers

5. Query Garbage Collection

Listens to TanStack Query's cache events to handle query removal:

queryClient.getQueryCache().subscribe((event) => {
  if (event.type === 'removed') {
    cleanupQuery(hashedKey)
  }
})

Cleanup Process:

  1. Decrements reference counts for all rows in the removed query
  2. Deletes rows when reference count reaches zero
  3. Cleans up observer and unsubscribe function
  4. Removes query from all tracking Maps

6. Sync Mode Integration

Eager Mode (default):

  • Creates single initial query with empty options ({})
  • Bypasses loadSubset (returns undefined)
  • Static query key (no builder function needed)

On-Demand Mode:

  • No initial query created
  • Calls markReady() immediately since there's nothing to wait for
  • Creates queries dynamically as loadSubset(options) is called
  • Requires query key builder function to handle different predicates
  • Returns createQueryFromOpts directly as the loadSubset implementation

Sync Started Tracking:
Added syncStarted flag to determine when to subscribe to new queries:

  • Set to true when sync begins (via preload(), startSync, or first subscriber)
  • Used instead of checking config.startSync to handle all sync scenarios

Changes

Files Modified:

  • packages/query-db-collection/src/query.ts - Core implementation (+354 lines)
  • packages/query-db-collection/tests/query.test.ts - Comprehensive test suite (+567 lines)
  • .changeset/silent-trains-tell.md - Changeset entry

Test Coverage:

  • Added 4 new tests for Query Garbage Collection scenarios
  • Added test for preload() in on-demand mode
  • All 64 tests passing
  • Code coverage: 88.66% lines

Key Features

Predicate Pushdown - Pass LoadSubsetOptions from live queries to TanStack Query
Multiple Concurrent Queries - Manage multiple TanStack Query instances with different predicates
Reference Counting - Track which queries reference which rows
Automatic Garbage Collection - Clean up queries and rows when no longer needed
Promise-Based Loading - Return promises that resolve when data is available
Sync Mode Support - Works with both eager and on-demand sync modes
Immediate Ready State - On-demand collections transition to ready immediately

Breaking Changes

None - this is a backward-compatible extension. Existing Query Collections with static query keys continue to work as before.

Migration Guide

If you want to enable predicate pushdown:

  1. Set syncMode: 'on-demand'
  2. Change queryKey from a static value to a builder function
  3. Access predicates in queryFn via context.meta.loadSubsetOptions

Before:

queryCollectionOptions({
  queryKey: ['todos'],
  queryFn: async () => fetch('/api/todos').then(r => r.json()),
  // ...
})

After:

queryCollectionOptions({
  syncMode: 'on-demand',
  queryKey: ({ where, limit, orderBy }) => ['todos', { where, limit, orderBy }],
  queryFn: async (ctx) => {
    const { where, limit } = ctx.meta?.loadSubsetOptions ?? {}
    const params = new URLSearchParams()
    if (where) params.set('filter', JSON.stringify(where))
    if (limit) params.set('limit', limit.toString())
    return fetch(\`/api/todos?\${params}\`).then(r => r.json())
  },
  // ...
})

Related


Note: This PR replaces #646 with a rebased branch to remove duplicate commits from #669 that were already merged into main.

@changeset-bot
Copy link

changeset-bot bot commented Oct 15, 2025

🦋 Changeset detected

Latest commit: 21f9f10

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 2 packages
Name Type
@tanstack/query-db-collection Patch
@tanstack/db-example-react-todo Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

@pkg-pr-new
Copy link

pkg-pr-new bot commented Oct 15, 2025

More templates

@tanstack/angular-db

npm i https://pkg.pr.new/@tanstack/angular-db@681

@tanstack/db

npm i https://pkg.pr.new/@tanstack/db@681

@tanstack/db-ivm

npm i https://pkg.pr.new/@tanstack/db-ivm@681

@tanstack/electric-db-collection

npm i https://pkg.pr.new/@tanstack/electric-db-collection@681

@tanstack/query-db-collection

npm i https://pkg.pr.new/@tanstack/query-db-collection@681

@tanstack/react-db

npm i https://pkg.pr.new/@tanstack/react-db@681

@tanstack/rxdb-db-collection

npm i https://pkg.pr.new/@tanstack/rxdb-db-collection@681

@tanstack/solid-db

npm i https://pkg.pr.new/@tanstack/solid-db@681

@tanstack/svelte-db

npm i https://pkg.pr.new/@tanstack/svelte-db@681

@tanstack/trailbase-db-collection

npm i https://pkg.pr.new/@tanstack/trailbase-db-collection@681

@tanstack/vue-db

npm i https://pkg.pr.new/@tanstack/vue-db@681

commit: 21f9f10

@github-actions
Copy link
Contributor

github-actions bot commented Oct 15, 2025

Size Change: 0 B

Total Size: 89.4 kB

ℹ️ View Unchanged
Filename Size
./packages/db/dist/esm/collection/change-events.js 1.63 kB
./packages/db/dist/esm/collection/changes.js 1.01 kB
./packages/db/dist/esm/collection/events.js 413 B
./packages/db/dist/esm/collection/index.js 3.23 kB
./packages/db/dist/esm/collection/indexes.js 1.16 kB
./packages/db/dist/esm/collection/lifecycle.js 1.8 kB
./packages/db/dist/esm/collection/mutations.js 2.52 kB
./packages/db/dist/esm/collection/state.js 3.79 kB
./packages/db/dist/esm/collection/subscription.js 2.2 kB
./packages/db/dist/esm/collection/sync.js 2.2 kB
./packages/db/dist/esm/deferred.js 230 B
./packages/db/dist/esm/errors.js 3.48 kB
./packages/db/dist/esm/event-emitter.js 798 B
./packages/db/dist/esm/index.js 1.72 kB
./packages/db/dist/esm/indexes/auto-index.js 794 B
./packages/db/dist/esm/indexes/base-index.js 835 B
./packages/db/dist/esm/indexes/btree-index.js 2 kB
./packages/db/dist/esm/indexes/lazy-index.js 1.21 kB
./packages/db/dist/esm/indexes/reverse-index.js 577 B
./packages/db/dist/esm/local-only.js 967 B
./packages/db/dist/esm/local-storage.js 2.4 kB
./packages/db/dist/esm/optimistic-action.js 294 B
./packages/db/dist/esm/proxy.js 3.86 kB
./packages/db/dist/esm/query/builder/functions.js 615 B
./packages/db/dist/esm/query/builder/index.js 4.04 kB
./packages/db/dist/esm/query/builder/ref-proxy.js 938 B
./packages/db/dist/esm/query/compiler/evaluators.js 1.55 kB
./packages/db/dist/esm/query/compiler/expressions.js 760 B
./packages/db/dist/esm/query/compiler/group-by.js 2.04 kB
./packages/db/dist/esm/query/compiler/index.js 2.21 kB
./packages/db/dist/esm/query/compiler/joins.js 2.65 kB
./packages/db/dist/esm/query/compiler/order-by.js 1.43 kB
./packages/db/dist/esm/query/compiler/select.js 1.28 kB
./packages/db/dist/esm/query/ir.js 785 B
./packages/db/dist/esm/query/live-query-collection.js 404 B
./packages/db/dist/esm/query/live/collection-config-builder.js 5.54 kB
./packages/db/dist/esm/query/live/collection-registry.js 233 B
./packages/db/dist/esm/query/live/collection-subscriber.js 2.11 kB
./packages/db/dist/esm/query/optimizer.js 3.26 kB
./packages/db/dist/esm/query/predicate-utils.js 3.87 kB
./packages/db/dist/esm/query/subset-dedupe.js 1.1 kB
./packages/db/dist/esm/scheduler.js 1.29 kB
./packages/db/dist/esm/SortedMap.js 1.24 kB
./packages/db/dist/esm/transactions.js 3.05 kB
./packages/db/dist/esm/utils.js 1.01 kB
./packages/db/dist/esm/utils/browser-polyfills.js 365 B
./packages/db/dist/esm/utils/btree.js 6.01 kB
./packages/db/dist/esm/utils/comparison.js 754 B
./packages/db/dist/esm/utils/index-optimization.js 1.73 kB

compressed-size-action::db-package-size

@github-actions
Copy link
Contributor

github-actions bot commented Oct 15, 2025

Size Change: 0 B

Total Size: 2.89 kB

ℹ️ View Unchanged
Filename Size
./packages/react-db/dist/esm/index.js 168 B
./packages/react-db/dist/esm/useLiveInfiniteQuery.js 1.41 kB
./packages/react-db/dist/esm/useLiveQuery.js 1.31 kB

compressed-size-action::react-db-package-size

@kevin-dp kevin-dp force-pushed the kevin/pred-pushdown-query-coll-rebased branch from 1db4f71 to 285fff5 Compare October 20, 2025 14:27
@kevin-dp kevin-dp changed the base branch from main to kevin/dedup-callback October 21, 2025 07:16
@kevin-dp
Copy link
Contributor

I pushed a unit test for limited ordered queries. The test currently fails because the query collection uses currentStateAsChanges to execute the query locally when it is deduplicated but the currentStateAsChanges function currently ignores the orderBy and limit options and so it contains all the query results instead of just the ones that fulfil the limit. Something that i'm going to fix in a separate PR.

@kevin-dp kevin-dp force-pushed the kevin/dedup-callback branch from a844ab2 to 61beafa Compare October 21, 2025 10:10
@kevin-dp kevin-dp force-pushed the kevin/pred-pushdown-query-coll-rebased branch from 9630dbe to e865acd Compare October 21, 2025 10:11
@kevin-dp
Copy link
Contributor

I added support for orderBy and limit in currentStateAsChanges in #701 and rebased the chain of PRs. All tests are green now.

@kevin-dp kevin-dp force-pushed the kevin/pred-pushdown-query-coll-rebased branch from e865acd to 21f9f10 Compare October 22, 2025 09:18
@kevin-dp kevin-dp force-pushed the kevin/dedup-callback branch from 61beafa to 313299a Compare October 22, 2025 09:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants