Survive minor db drops #427

adamruzicka · 2023-04-12T15:27:12Z

Includes #462

Essentially it means wrapping all db interactions in the persistence adapter with retries. We don't really need the retries, but we need to wrap the db-specific errors with general dynflow persistence errors which the rest of the codebase expects.

ofedoren · 2025-11-26T12:14:47Z

Thanks, @adamruzicka !

IIUC, this will help if a DB won't be available for cca 1 minute, and if more it will just kill the process now? If so, apologies for the newbie question, but looking at the related tickets, how will that help users to recover from the DB drop for e.g. 15 minutes? Is it a clear state that dynflow process is dead and needs to be started, is it that something will automatically try to restart the process since it's dead now?

adamruzicka · 2025-11-26T12:24:23Z

IIUC, this will help if a DB won't be available for cca 1 minute, and if more it will just kill the process now?

Yup, that's about right. The timing isn't exact, but the breaking point between carrying on and crashing hard is somewhere around that 1 minute mark.

is it that something will automatically try to restart the process since it's dead now?

In Foreman deployment, systemd should attempt to restart it, once we move to containers I'd expect again systemd or the container runtime to monitor it and restart if needed. In worst case without anything restarting it, this would at least clearly mark it as being down, rather than leaving it "up" in a broken state.

ofedoren

Given the explanation in #427 (comment) and that I'm not much familiar with dynflow internals, I don't have any concerns looking at the code changes themselves.

In other words, LGTM, but how would you like to proceed with the commits before merge?

adamruzicka · 2025-11-26T12:45:12Z

In other words, LGTM, but how would you like to proceed with the commits before merge?

I'd prefer to get your blessing on #462 , squash and merge that and then rebase this one on top

Essentially it means wrapping all db interactions in the persistence adapter with retries. We don't really need the retries, but we need to wrap the db-specific errors with general dynflow persistence errors which the rest of the codebase expects.

ofedoren

as soon as it's green

adamruzicka force-pushed the db-conn-drop branch from 38ab4de to f0bc6ed Compare November 12, 2025 10:36

adamruzicka marked this pull request as ready for review November 12, 2025 10:36

adamruzicka force-pushed the db-conn-drop branch 3 times, most recently from 2d3dd9d to e1a9865 Compare November 25, 2025 18:53

ofedoren approved these changes Nov 26, 2025

View reviewed changes

adamruzicka force-pushed the db-conn-drop branch 2 times, most recently from 7211e39 to 8f1596d Compare November 26, 2025 14:08

adamruzicka added 3 commits November 26, 2025 16:08

Survive minor db drops

0fa2f99

Essentially it means wrapping all db interactions in the persistence adapter with retries. We don't really need the retries, but we need to wrap the db-specific errors with general dynflow persistence errors which the rest of the codebase expects.

Allow "clean" termination even without a database connection

d468e11

Add bats tests covering database connection loss handling

248ad99

adamruzicka force-pushed the db-conn-drop branch from 8f1596d to 248ad99 Compare November 26, 2025 15:09

ofedoren approved these changes Nov 26, 2025

View reviewed changes

adamruzicka merged commit 25f2af6 into Dynflow:master Nov 26, 2025
11 checks passed

adamruzicka deleted the db-conn-drop branch November 26, 2025 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Survive minor db drops #427

Survive minor db drops #427

Uh oh!

adamruzicka commented Apr 12, 2023 •

edited

Loading

Uh oh!

ofedoren commented Nov 26, 2025

Uh oh!

adamruzicka commented Nov 26, 2025

Uh oh!

ofedoren left a comment

Uh oh!

adamruzicka commented Nov 26, 2025

Uh oh!

ofedoren left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Survive minor db drops #427

Survive minor db drops #427

Uh oh!

Conversation

adamruzicka commented Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ofedoren commented Nov 26, 2025

Uh oh!

adamruzicka commented Nov 26, 2025

Uh oh!

ofedoren left a comment

Choose a reason for hiding this comment

Uh oh!

adamruzicka commented Nov 26, 2025

Uh oh!

ofedoren left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adamruzicka commented Apr 12, 2023 •

edited

Loading

ofedoren left a comment •

edited

Loading