A global config and `setup` method #64

AlanCoding · 2025-02-11T05:06:47Z

Fixes #44

The argument for this is difficult and something I've struggled with. But it's a strong argument. We have 3 different actors:

The service itself, it runs tasks
The publisher, it submits tasks
Any dispatcherctl type command, exists in AWX, does control-and-reply to get debugging information

Because these are all invoked separately, we should requires the setup() method to be called, and if not, throw an error. Otherwise, it's very difficult to assure that the service is using the same config as the publisher, for example. On higher levels, this is easy to enforce using the Django .ready() method. But this will set the tone for how tests are written on the psycopg layer.

Alex-Izquierdo

Since this PR is a draft, I'm not sure if some of my questions are premature. I'n sorry in that case.

dispatcher/brokers/base.py

dispatcher/brokers/pg_notify.py

dispatcher/config.py

dispatcher/registry.py

dispatcher/config.py

First pass at global config setup Finish running and just starting on tests Add a config test Make half-way progress through demo script Cut some more stuff out of the config Fix failing unit test, handle queue can not be found Review comment to consolidate factory handling Factories refactor Adopt new patterns up to some tests passing Unfinished start on settings serialization

dispatcher/pool.py

tests/integration/publish/test_registry.py

AlanCoding · 2025-02-15T16:01:17Z

Additional incoming change:

I've changed my tune on how __init__ arguments should be passed. We should be consistent, and this would result in needing to pass related objects as arguments when one object needs another. So the worker pool needs to be passed the process manager. Then the main object needs to be passed the worker pool. Then the factories will work out how to extract the kwargs from the config file. This suggests a new YAML structure like this:

service:
  pool_cls: dispatcher.pool.WorkerPool
  pool_kwargs:
    min_workers: 2
    max_workers: 12
  process_manager_cls: dispatcher.process.ForkServerManager
  process_manager_kwargs:
    preload_modules: my_app.hazmat

I was trying to make something work where we passed the process-manager kwargs into the main object... and this pattern doesn't scale well. And this all needs to be as "dumb", and straight-to-the-code as possible. Rephrasing, making the config file look messy is fine if it makes things simpler overall.

Alex-Izquierdo

Should have the config a schema from where it can be easily validated?

dispatcher/brokers/pg_notify.py

dispatcher.yml

AlanCoding · 2025-02-17T18:14:17Z

Should have the config a schema from where it can be easily validated?

No and yes.

yes we can validate the schema. This is not jsonschema, but validating the content, which is a python dictionary by the point we would care to do this. Demo:

from dispatcher.config import setup, settings
setup(from_file='dispatcher.yml')
pkw = settings.service['pool_kwargs']

import inspect
from dispatcher.pool import WorkerPool
signature = inspect.signature(WorkerPool.__init__)
parameters = signature.parameters
valid_kwargs = [k for k, v in parameters.items() if v.kind != inspect.Parameter.VAR_POSITIONAL and v.kind != inspect.Parameter.VAR_KEYWORD]

>>> valid_kwargs
['self', 'worker_id', 'process']
>>> set(pkw.keys()) - set(valid_kwargs)
set()

This assures that all keys are valid. This doesn't do type validation, that would be a general problem that I would rather take to be solved, as opposed to solve myself.

I hope this answers the first part that no we don't want to write out a schema somewhere, because this general method defines the schema we want.

What we're lacking is an articulation of the value-add from doing fancy inspect stuff like this, as opposed to just letting the code go error. Possible answers:

We could use this inspection to construct a schema dynamically. Put it where then?
Validate the config schema in the publisher, not just in the service. But in that case, we could just initialize the objects and let the error happen that way, which would be simpler and more reliable.

Alex-Izquierdo · 2025-02-18T12:35:46Z

Should have the config a schema from where it can be easily validated?

No and yes.

yes we can validate the schema. This is not jsonschema, but validating the content, which is a python dictionary by the point we would care to do this. Demo:
from dispatcher.config import setup, settings
setup(from_file='dispatcher.yml')
pkw = settings.service['pool_kwargs']

import inspect
from dispatcher.pool import WorkerPool
signature = inspect.signature(WorkerPool.__init__)
parameters = signature.parameters
valid_kwargs = [k for k, v in parameters.items() if v.kind != inspect.Parameter.VAR_POSITIONAL and v.kind != inspect.Parameter.VAR_KEYWORD]

>>> valid_kwargs
['self', 'worker_id', 'process']
>>> set(pkw.keys()) - set(valid_kwargs)
set()
This assures that all keys are valid. This doesn't do type validation, that would be a general problem that I would rather take to be solved, as opposed to solve myself.

I hope this answers the first part that no we don't want to write out a schema somewhere, because this general method defines the schema we want.

What we're lacking is an articulation of the value-add from doing fancy inspect stuff like this, as opposed to just letting the code go error. Possible answers:

We could use this inspection to construct a schema dynamically. Put it where then?

Validate the config schema in the publisher, not just in the service. But in that case, we could just initialize the objects and let the error happen that way, which would be simpler and more reliable.

I agree that using JSON Schema might be overkill. However, my main concern is not how the validation is done but rather ensuring that validation happens effectively.

The primary argument for validation is the "fail faster, fail earlier" principle, which improves both error handling and user experience. If an error in the configuration can be detected early, we can provide a clear and specific error message explaining what is wrong and why it is not accepted. This prevents unnecessary execution and debugging effort.

I also disagree with the idea of simply initializing the objects and letting the error occur naturally. This approach may lead to errors that are harder to debug or, worse, result in silent failures or unexpected behaviors.

Your approach is certainly creative, but I see some potential issues. It would not account for optional parameters, and it relies on a strict 1:1 relationship between the configuration and function/object signatures. This limits flexibility and, in my opinion, introduces a partial coupling of concerns.

For these reasons, I advocate for an independent configuration that can be validated as early as possible.

Alex-Izquierdo · 2025-02-18T12:40:36Z

For these reasons, I advocate for an independent configuration that can be validated as early as possible.

I would like to clarify that I'm not going to block the PR on this matter. In one way or another, it is something that can be addressed perfectly in later iterations. I'm just sharing my PoV on how it could be. :)

AlanCoding · 2025-02-18T13:19:15Z

Auto-gen schema could help for documentation and versioning. We could put this in the repo and check for diff in checks. If there's a diff, then we can bump a version.

This is to help with a real problem, which is versioning. My intent was to cut a version (PyPI, or just a tag) before merging this because it requires changes to the eda-server branch. Auto-generating a spec file (that is enforced) would document when and how something changed to expected schema.

But failing earlier doesn't make sense to me. Initializing the objects from the factories (from settings) is "free", and can be done in any context.

AlanCoding · 2025-02-18T14:45:17Z

This is moving a bit fast, but I went ahead and pushed a commit that matches the description of my last comment.

I appreciate that this would be better as jsonschema. But that requires a general tool to convert type hints to jsonschema, there's no point in us maintaining that logic. Let's file an issue for it. Current format is kind of human readable, but jsonschema would be better.

dispatcher/brokers/base.py

dispatcher/registry.py

tests/integration/test_main.py

tests/unit/conftest.py

tests/unit/test_config.py

Alex-Izquierdo reviewed Feb 11, 2025

View reviewed changes

dispatcher/config.py Outdated Show resolved Hide resolved

dispatcher/config.py Outdated Show resolved Hide resolved

AlanCoding force-pushed the class_config branch 2 times, most recently from c8020c0 to 7f5a44d Compare February 12, 2025 18:40

AlanCoding added 3 commits February 14, 2025 13:23

Complete worker settings initialization

477439f

linter fixups after worker settings fixing

46b808d

AlanCoding force-pushed the class_config branch from a77cbb2 to 46b808d Compare February 14, 2025 18:30

AlanCoding added 3 commits February 14, 2025 14:28

Work factories into control module

622cd4a

Add docs on the config

bbf9a36

Fix type hinting issue

34f7098

AlanCoding changed the title ~~[WIP] A global config and setup method~~ A global config and setup method Feb 14, 2025

Fix events data structure pattern goof

6b4653f

AlanCoding commented Feb 14, 2025

View reviewed changes

dispatcher/pool.py Show resolved Hide resolved

AlanCoding commented Feb 14, 2025

View reviewed changes

tests/integration/publish/test_registry.py Show resolved Hide resolved

AlanCoding marked this pull request as ready for review February 14, 2025 21:07

AlanCoding requested a review from pb82 February 14, 2025 21:07

AlanCoding mentioned this pull request Feb 15, 2025

Allow using forkserver #78

Merged

Alex-Izquierdo reviewed Feb 17, 2025

View reviewed changes

dispatcher/brokers/pg_notify.py Show resolved Hide resolved

dispatcher.yml Show resolved Hide resolved

AlanCoding added 2 commits February 17, 2025 09:46

Implement change described in comment, cls and kwargs patterns

ff1d6e4

Fix unit tests

f97962a

AlanCoding added 2 commits February 17, 2025 16:52

Convert broker base to protocol

9706c14

Refactor into single broker class

a489c1c

AlanCoding added 2 commits February 18, 2025 09:36

Produce a reference schema

7cc2b01

Fix linters

739861b

Fix link

a351706

Alex-Izquierdo reviewed Feb 19, 2025

View reviewed changes

dispatcher/brokers/base.py Outdated Show resolved Hide resolved

dispatcher/registry.py Show resolved Hide resolved

tests/integration/test_main.py Show resolved Hide resolved

tests/unit/conftest.py Outdated Show resolved Hide resolved

tests/unit/test_config.py Outdated Show resolved Hide resolved

AlanCoding added 4 commits February 19, 2025 09:04

Add type hints to async generator

5a5125b

Add type hint to make clear default enforcing

ea9c940

Test fix from review

883a8fb

Schema re-gen docs

39c7259

Alex-Izquierdo approved these changes Feb 19, 2025

View reviewed changes

Add yield to keep protcol method async

1e20eae

Alex-Izquierdo approved these changes Feb 19, 2025

View reviewed changes

AlanCoding merged commit 33d4256 into ansible:main Feb 20, 2025
7 checks passed

This was referenced Feb 27, 2025

Add classes for broker types #45

Closed

Do control-and-reply with existing (synchronous) Django connection #37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A global config and `setup` method #64

A global config and `setup` method #64

AlanCoding commented Feb 11, 2025

Alex-Izquierdo left a comment

AlanCoding commented Feb 15, 2025

Alex-Izquierdo left a comment

AlanCoding commented Feb 17, 2025

Alex-Izquierdo commented Feb 18, 2025

Alex-Izquierdo commented Feb 18, 2025 •

edited

Loading

AlanCoding commented Feb 18, 2025

AlanCoding commented Feb 18, 2025

A global config and setup method #64

A global config and setup method #64

Conversation

AlanCoding commented Feb 11, 2025

Alex-Izquierdo left a comment

Choose a reason for hiding this comment

AlanCoding commented Feb 15, 2025

Alex-Izquierdo left a comment

Choose a reason for hiding this comment

AlanCoding commented Feb 17, 2025

Alex-Izquierdo commented Feb 18, 2025

Alex-Izquierdo commented Feb 18, 2025 • edited Loading

AlanCoding commented Feb 18, 2025

AlanCoding commented Feb 18, 2025

A global config and `setup` method #64

A global config and `setup` method #64

Alex-Izquierdo commented Feb 18, 2025 •

edited

Loading