Include R errors in the log files. #18

plietar · 2025-02-04T17:39:41Z

Currently, when an R error is thrown by a report, it is caught by the rrq worker and is stored in Redis, but it is not exposed over the runner API anywhere.

Rather than introduce yet another field in API, we print the error from the worker process, which will make it visible at the end of the tasks log file.

The API tests used to setup individual endpoints to run the tests. That is a bit verbose and duplicates a bunch of code from the `api.R` file. This will be made much worse when we introduce `url` parameters to all the endpoints, since all the tests will need to make a API call to fetch the repository first, which will lead to more boilerplate. Using the api object solves this issue.

In the previous design, outpack_server, the runner API and the workers all shared a single outpack directory, and a Git repository (mostly, the workers used a clone of the shared repository for the actual execution). This creates a very tight and brittle coupling between all the components. It makes it impossible to deploy the different components on separate machines. It requires careful reasoning about data races and conflicts between the different bits. It prevents us from sharing worker processes across multiple Packit instances, and it prevents us from using multiple Git repositories within a single instance. The new design completely splits up the storage. - The API server and each worker have their own local Git clones of the repositories, that are directly pulled from the upstream (eg. GitHub). - The API servers and workers store bare Git clones of the repositories, without any worktree. When running a report, workers create a new worktree in a temporary directory, run the report and delete the worktree. This ensures a completely clean slate every time. - The workers use their own outpack store, that is not shared with any other process. - The workers can pull and push packets using any protocol supported by orderly2. In practice, we will be using HTTP to interact with the outpack_server used by Packit. Currently, the workers create a new outpack store for each run, meaning they do not cache any of the packet dependencies and need to download them from the outpack_server from scratch every time. Given that, at least for now, workers and outpack_server will be operating on the same or nearby machines, this seems like a reasonable overhead. Ideally we would keep a per-worker cache, however we need to be careful not to mix packets between different instances. One possible approach may be to re-use the file store, but start from an empty metadata store everytime. This way large unnecessary file downloads are avoided, while preserving some degree of isolation between runs and instances.

…-6126

Co-authored-by: Paul Liétar <[email protected]>

Currently, when an R error is thrown by a report, it is caught by the rrq worker and is stored in Redis, but it is not exposed over the runner API anywhere. Rather than introduce yet another field in API, we print the error from the worker process, which will make it visible at the end of the tasks log file.

plietar and others added 12 commits January 14, 2025 18:22

Merge branch 'mrc-6154-fix-modification-time'

b637f04

Merge branch 'refactor-api-test'

7abc528

Merge branch 'mrc-6123' of github.com:mrc-ide/orderly.runner into mrc…

941ffc5

…-6126

wip, need to complete tests

0ccfd52

Merge branch 'mrc-6123' of github.com:mrc-ide/orderly.runner into mrc…

837eae6

…-6126

pauls review comments

0c503eb

empty for ci

3e83658

add content type

4364bca

update artefact action

caffd7e

change testing to use string of ssh key

353eb2c

plietar force-pushed the mrc-6152 branch from 98e3287 to 45350e6 Compare February 4, 2025 17:41

plietar mentioned this pull request Feb 4, 2025

Update to new orderly.runner interface. mrc-ide/packit#159

Merged

M-Kusumgar and others added 8 commits February 5, 2025 14:48

add unit test for git_sync

11ff880

made git sync unit test more reliable

d19c8d9

Merge branch 'main' of github.com:mrc-ide/orderly.runner into mrc-6126

2c4c63b

use fs file info instead of stinky command

59fb8cd

update doc and add comment

9929ad4

Update R/git.R

7c17118

Co-authored-by: Paul Liétar <[email protected]>

Update R/queue.R

00cdec0

Co-authored-by: Paul Liétar <[email protected]>

update docs

675f0df

plietar force-pushed the mrc-6152 branch from 45350e6 to 8c2d05a Compare February 10, 2025 16:35

plietar changed the title ~~Mrc 6152~~ Include R errors in the log files. Feb 10, 2025

plietar requested a review from M-Kusumgar February 10, 2025 16:35

plietar marked this pull request as ready for review February 10, 2025 16:58

plietar force-pushed the mrc-6152 branch from 8c2d05a to 2dcfcdf Compare February 10, 2025 16:59

plietar force-pushed the mrc-6152 branch from 2dcfcdf to 1ced512 Compare February 10, 2025 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include R errors in the log files. #18

Include R errors in the log files. #18

plietar commented Feb 4, 2025 •

edited

Loading

Include R errors in the log files. #18

Are you sure you want to change the base?

Include R errors in the log files. #18

Conversation

plietar commented Feb 4, 2025 • edited Loading

plietar commented Feb 4, 2025 •

edited

Loading