Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

tw4l · 2025-11-18T19:18:39Z

Full backend and frontend implementation, with a new email notification to org admins when a crawl is paused because an org quota has been reached.

Backend changes

Modify operator to auto-pause crawls when quotas are reached or archiving is disabled rather than stopping the crawls
Add new crawl states: paused_storage_quota_reached, paused_time_quota_reached, paused_org_readonly
Add uploaded WACZs to org storage totals immediately after upload so that auto-paused crawls will actually put the org's bytesStored above the storage quota
Send an email from new template to all org admins when a crawl is auto-paused with information about what to do
Fix datetime deprecation in tests

Updated nightly tests all pass: https://github.com/webrecorder/browsertrix/actions/runs/19684324914

Frontend changes

Add new paused crawl states
Update checks throughout frontend for whether crawl is paused to compare against all paused states

Dependencies

Relies on crawler changes introduced in webrecorder/browsertrix-crawler#919

Out of scope

Crawl workflow counts are a bit off, counting all crawls that complete as successful regardless of state and sometimes incrementing workflow storage counts incorrectly. I started trying to address that in this branch but it's a bit involved and may require a migration so best handled separately, I think. Issue: #3011

tw4l · 2025-11-25T20:14:50Z

backend/btrixcloud/operator/crawls.py

+
+        # sizes = await redis.hkeys(f"{crawl.id}:size")
+        # for size in sizes:
+        #    await redis.hmset(f"{crawl.id}:size", {size: 0 for size in sizes})


Suggested change

# sizes = await redis.hkeys(f"{crawl.id}:size")

# for size in sizes:

# await redis.hmset(f"{crawl.id}:size", {size: 0 for size in sizes})

Remove before merging

tw4l · 2025-11-25T20:15:20Z

backend/btrixcloud/operator/crawls.py

+        print(f"pending size: {pending_size}", flush=True)
+        print(f"status.filesAdded: {status.filesAdded}", flush=True)
+        print(f"status.filesAddedSize: {status.filesAddedSize}", flush=True)
+        print(f"total: {total_size}", flush=True)
+        print(
+            f"org quota: {crawl.org.bytesStored + stats.size} <= {crawl.org.quotas.storageQuota}",
+            flush=True,
+        )
+


Suggested change

print(f"pending size: {pending_size}", flush=True)

print(f"status.filesAdded: {status.filesAdded}", flush=True)

print(f"status.filesAddedSize: {status.filesAddedSize}", flush=True)

print(f"total: {total_size}", flush=True)

print(

f"org quota: {crawl.org.bytesStored + stats.size} <= {crawl.org.quotas.storageQuota}",

flush=True,

)

Remove before merging, useful for testing

tw4l · 2025-11-25T20:17:01Z

Tagging @emma-sg @SuaYoo for review in addition to @ikreymer , with particular interest in getting your eyes on the frontend, email, and email copy parts of this. Thanks!

SuaYoo

Nice! Still doing manual testing, my initial impression is it's probably worth adding an isPaused helper to utils/crawler.

export function isPaused({ state }: { state: string | null }) {
  return state && (PAUSED_STATES as readonly string[]).includes(state);
}

ikreymer · 2025-11-26T07:25:27Z

We want to send the e-mails multiple times, if a crawl reaches quota, then is resumed, then reaches quota again, right?
If so, should also clear autoPausedEmailsSent when crawl is running again

tw4l · 2025-11-26T17:59:12Z

Nice! Still doing manual testing, my initial impression is it's probably worth adding an isPaused helper to utils/crawler.
export function isPaused({ state }: { state: string | null }) {
  return state && (PAUSED_STATES as readonly string[]).includes(state);
}

I added a helper but made it except a string or null rather than an object with state property, as none of the uses of this take an object with that key. Take a look and let me know what you think.

tw4l · 2025-11-26T18:05:53Z

We want to send the e-mails multiple times, if a crawl reaches quota, then is resumed, then reaches quota again, right? If so, should also clear autoPausedEmailsSent when crawl is running again

Done, and now storing this state in the db to be more reliable.

SuaYoo

Frontend portion looks good!

emma-sg

Email language looks good! Left a few suggestions, one splitting a sentence into two and a few just using curly quotes/removing unused code. Nice work!

I'll take another look for frontend & backend changes, just wanted to get you some feedback on the email template now.

emails/emails/crawl-auto-paused.tsx

Needs to be tested, just pushing as-is so that I can pick it up next week. There's an issue in local testing where crawls sometimes appear to be twice as big as they really are, which is making Browsertrix think the storage quota is reached prematurely. I haven't yet pinned down the cause of this and it seems intermittent.

#3013) … pending, un-uploaded size - use pending size to determine if quota reached - also request pause to be set before assuming paused state - also ensure data is actually committed before shutting down pods (in case of any edge cases) - clear paused flag in redis after crawler pods shutdown - add OpCrawlStats to avoid adding unnecessary profile_update to public API this assumes changes in crawler to support: clearing size after WACZ upload, ensure upload happens if pod starts when crawl is paused --------- Co-authored-by: Tessa Walsh <[email protected]>

This is much more reliable, prevents duplicate emails as was sometimes happening before, and makes it easier to clear the state when a crawl is unpaused.

Co-authored-by: Emma Segal-Grossman <[email protected]>

tw4l force-pushed the issue-2957-pause-crawl-on-quota-reached branch 6 times, most recently from 4e5d015 to 6730c7f Compare November 25, 2025 17:03

tw4l marked this pull request as ready for review November 25, 2025 20:14

tw4l commented Nov 25, 2025

View reviewed changes

tw4l requested review from SuaYoo, emma-sg and ikreymer November 25, 2025 20:15

SuaYoo reviewed Nov 25, 2025

View reviewed changes

SuaYoo self-requested a review November 26, 2025 19:19

SuaYoo approved these changes Nov 26, 2025

View reviewed changes

tw4l force-pushed the issue-2957-pause-crawl-on-quota-reached branch from 7726a59 to 0ad1644 Compare November 26, 2025 20:34

emma-sg reviewed Nov 26, 2025

View reviewed changes

tw4l added 9 commits November 26, 2025 16:03

Pause crawls instead of stopping when quotas are reached

d8e4f93

Update nightly tests

bbdee73

Update frontend for new paused states

3ed0021

Fix comments

8614431

Fix status.stopReason handling for paused states

7303c5b

Fix datetime deprecation in nightly test fixture

441a096

WIP: Mark current issues with some TODOs

c440a7c

WIP: Add debug logging to beginning of sync_crawls

94daea7

Modify execution time test to account for pausing

a3217e9

tw4l and others added 19 commits November 26, 2025 16:03

WIP: Add email notification

aeed378

Inc org bytes stored when crawl files are added, not at end of crawl

d12a296

More incremental storage work

da75c09

One more TODO

b609ccd

Move paused with no stop reason condition below quota checks

7e93e93

Decrement org in delete_failed_crawl_files

4bdb2e9

Shorten docstring

5089a49

Fix email sending (but still not yet idempotent)

c8ca9b4

Only send auto-paused emails once

24669ce

Add TODO to address already-existing bug that now matters more

ca61bee

TEMP: Add print logging to help figure out bug

502d1ad

Semi-solution with comments describing why it's not perfect

2787eca

Small tweaks

fb6428d

Track autoPausedEmailsSent state in db instead of crawl state

2cb3094

This is much more reliable, prevents duplicate emails as was sometimes happening before, and makes it easier to clear the state when a crawl is unpaused.

Add isPaused crawler util function for frontend

2aa8b9c

Check if email already sent from notify method

6aaf74c

Apply suggestions from code review for email

97dd148

Co-authored-by: Emma Segal-Grossman <[email protected]>

SuaYoo force-pushed the issue-2957-pause-crawl-on-quota-reached branch from d2cba1b to 97dd148 Compare November 27, 2025 00:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

tw4l commented Nov 18, 2025 •

edited

Loading

Uh oh!

tw4l Nov 25, 2025

Uh oh!

tw4l Nov 25, 2025

Uh oh!

tw4l commented Nov 25, 2025

Uh oh!

SuaYoo left a comment

Uh oh!

ikreymer commented Nov 26, 2025

Uh oh!

tw4l commented Nov 26, 2025

Uh oh!

tw4l commented Nov 26, 2025

Uh oh!

SuaYoo left a comment

Uh oh!

emma-sg left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Are you sure you want to change the base?

Pause crawls instead of stopping when quotas are reached or archiving is disabled #2997

Conversation

tw4l commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backend changes

Frontend changes

Dependencies

Out of scope

Uh oh!

tw4l Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

tw4l Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

tw4l commented Nov 25, 2025

Uh oh!

SuaYoo left a comment

Choose a reason for hiding this comment

Uh oh!

ikreymer commented Nov 26, 2025

Uh oh!

tw4l commented Nov 26, 2025

Uh oh!

tw4l commented Nov 26, 2025

Uh oh!

SuaYoo left a comment

Choose a reason for hiding this comment

Uh oh!

emma-sg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tw4l commented Nov 18, 2025 •

edited

Loading