Skip to content

Commit 9b56168

Browse files
committed
(torchx/components) Fix entrypoint loading to deal with deferred loading of modules to enable component registration to work properly
1 parent 3b790a9 commit 9b56168

File tree

11 files changed

+496
-176
lines changed

11 files changed

+496
-176
lines changed

docs/source/advanced.rst

+98-15
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ resource can then be used in the following manner:
156156

157157
Registering Custom Components
158158
-------------------------------
159-
It is possible to author and register a custom set of components with the
159+
You can author and register a custom set of components with the
160160
``torchx`` CLI as builtins to the CLI. This makes it possible to customize
161161
a set of components most relevant to your team or organization and support
162162
it as a CLI ``builtin``. This way users will see your custom components
@@ -166,8 +166,25 @@ when they run
166166
167167
$ torchx builtins
168168
169-
Custom components can be registered via the following modification of the ``entry_points``:
169+
Custom components can be registered via ``[torchx.components]`` entrypoints.
170+
If ``my_project.bar`` had the following directory structure:
171+
172+
::
173+
174+
$PROJECT_ROOT/my_project/bar/
175+
|- baz.py
176+
177+
And ``baz.py`` had a single component (function) called ``trainer``:
178+
179+
::
180+
181+
# baz.py
182+
import torchx.specs as specs
183+
184+
def trainer(...) -> specs.AppDef: ...
185+
170186

187+
And the entrypoints were added as:
171188

172189
.. testcode::
173190

@@ -179,29 +196,95 @@ Custom components can be registered via the following modification of the ``entr
179196
],
180197
}
181198

182-
The line above registers a group ``foo`` that is associated with the module ``my_project.bar``.
183-
TorchX will recursively traverse lowest level dir associated with the ``my_project.bar`` and will find
184-
all defined components.
185-
186-
.. note:: If there are two registry entries, e.g. ``foo = my_project.bar`` and ``test = my_project``
187-
there will be two sets of overlapping components with different aliases.
199+
TorchX will search the module ``my_project.bar`` for all defined components and group the found
200+
components under the ``foo.*`` prefix. In this case, the component ``my_project.bar.baz.trainer``
201+
would be registered with the name ``foo.baz.trainer``.
188202

203+
.. note::
204+
Only python packages (those directories with an ``__init__.py`` file)
205+
are searched for and TorchX makes no attempt to recurse into namespace packages
206+
(directories without a ``__init__.py`` file).
207+
However you may register a top level namespace package.
189208

190-
After registration, torchx cli will display registered components via:
209+
``torchx`` CLI will display registered components via:
191210

192211
.. code-block:: shell-session
193212
194213
$ torchx builtins
214+
Found 1 builtin components:
215+
1. foo.baz.trainer
195216
196-
If ``my_project.bar`` had the following directory structure:
217+
The custom component can then be used as:
197218

198-
::
219+
.. code-block:: shell-session
199220
200-
$PROJECT_ROOT/my_project/bar/
201-
|- baz.py
221+
$ torchx run foo.baz.trainer -- --name "test app"
222+
223+
224+
When you register your own components, TorchX will not include its own builtins. To add TorchX's
225+
builtin components you must specify another entry as:
226+
227+
228+
.. testcode::
229+
230+
# setup.py
231+
...
232+
entry_points={
233+
"torchx.components": [
234+
"foo = my_project.bar",
235+
"torchx = torchx.components",
236+
],
237+
}
238+
239+
This will add back the TorchX builtins but with a ``torchx.*`` component name prefix (e.g. ``torchx.dist.ddp``
240+
versus the default ``dist.ddp``).
241+
242+
If there are two registry entries pointing to the same component, for instance
243+
244+
.. testcode::
245+
246+
# setup.py
247+
...
248+
entry_points={
249+
"torchx.components": [
250+
"foo = my_project.bar",
251+
"test = my_project,
252+
],
253+
}
202254

203-
And ``baz.py`` defines a component (function) called ``trainer``. Then the component can be run as a job in the following manner:
255+
There will be two sets of overlapping components for those components in ``my_project.bar`` with different
256+
prefix aliases: ``foo.*`` and ``test.bar.*``. Concretely,
204257

205258
.. code-block:: shell-session
206259
207-
$ torchx run foo.baz.trainer -- --name "test app"
260+
$ torchx builtins
261+
Found 2 builtin components:
262+
1. foo.baz.trainer
263+
2. test.bar.baz.trainer
264+
265+
To omit groupings and make the component names shorter, use underscore (e.g ``_`` or ``_0``, ``_1``, etc).
266+
For example:
267+
268+
.. testcode::
269+
270+
# setup.py
271+
...
272+
entry_points={
273+
"torchx.components": [
274+
"_0 = my_project.bar",
275+
"_1 = torchx.components",
276+
],
277+
}
278+
279+
This has the effect of exposing the trainer component as ``baz.trainer`` (as opposed to ``foo.baz.trainer``)
280+
and adds back the builtin components as in the vanilla installation of torchx, without the ``torchx.*`` prefix.
281+
282+
.. shell-session::
283+
284+
$ torchx builtins
285+
Found 11 builtin components:
286+
1. baz.trainer
287+
2. dist.ddp
288+
3. utils.python
289+
4. ... <more builtins from torchx.components.* ...>
290+

0 commit comments

Comments
 (0)