Skip to content

Commit 326e589

Browse files
authored
Fix setting the name of PythonNode. (#443)
1 parent b191559 commit 326e589

13 files changed

+146
-118
lines changed

.readthedocs.yaml

+9-4
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,17 @@
11
version: 2
22

33
build:
4-
os: "ubuntu-20.04"
4+
os: ubuntu-22.04
55
tools:
6-
python: "mambaforge-4.10"
6+
python: "3.10"
77

88
sphinx:
9+
configuration: docs/source/conf.py
910
fail_on_warning: true
1011

11-
conda:
12-
environment: docs/rtd_environment.yml
12+
python:
13+
install:
14+
- method: pip
15+
path: .
16+
extra_requirements:
17+
- docs

docs/rtd_environment.yml

-36
This file was deleted.

docs/source/changes.md

+5
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,11 @@ chronological order. Releases follow [semantic versioning](https://semver.org/)
55
releases are available on [PyPI](https://pypi.org/project/pytask) and
66
[Anaconda.org](https://anaconda.org/conda-forge/pytask).
77

8+
## 0.4.1 - 2023-10-08
9+
10+
- {pull}`443` ensures that `PythonNode.name` is always unique by only handling it
11+
internally.
12+
813
## 0.4.0 - 2023-10-07
914

1015
- {pull}`323` remove Python 3.7 support and use a new Github action to provide mamba.

docs/source/conf.py

+1
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,7 @@
4141
"sphinx.ext.viewcode",
4242
"sphinx_copybutton",
4343
"sphinx_click",
44+
"sphinx_toolbox.more_autodoc.autoprotocol",
4445
"nbsphinx",
4546
"myst_parser",
4647
"sphinx_design",

docs/source/how_to_guides/writing_custom_nodes.md

+12-11
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
# Writing custom nodes
22

33
In the previous tutorials and how-to guides, you learned that dependencies and products
4-
can be represented as plain Python objects with {class}`pytask.PythonNode` or as paths
5-
where every {class}`pathlib.Path` is converted to a {class}`pytask.PathNode`.
4+
can be represented as plain Python objects with {class}`~pytask.PythonNode` or as paths
5+
where every {class}`pathlib.Path` is converted to a {class}`~pytask.PathNode`.
66

77
In this how-to guide, you will learn about the general concept of nodes and how to write
88
your own to improve your workflows.
@@ -54,13 +54,13 @@ A custom node needs to follow an interface so that pytask can perform several ac
5454
- Load and save values when tasks are executed.
5555

5656
This interface is defined by protocols [^structural-subtyping]. A custom node must
57-
follow at least the protocol {class}`pytask.PNode` or, even better,
58-
{class}`pytask.PPathNode` if it is based on a path. The common node for paths,
59-
{class}`pytask.PathNode`, follows the protocol {class}`pytask.PPathNode`.
57+
follow at least the protocol {class}`~pytask.PNode` or, even better,
58+
{class}`~pytask.PPathNode` if it is based on a path. The common node for paths,
59+
{class}`~pytask.PathNode`, follows the protocol {class}`~pytask.PPathNode`.
6060

6161
## `PickleNode`
6262

63-
Since our {class}`PickleNode` will only vary slightly from {class}`pytask.PathNode`, we
63+
Since our {class}`PickleNode` will only vary slightly from {class}`~pytask.PathNode`, we
6464
use it as a template, and with some minor modifications, we arrive at the following
6565
class.
6666

@@ -85,8 +85,8 @@ class.
8585

8686
Here are some explanations.
8787

88-
- The node does not need to inherit from the protocol {class}`pytask.PPathNode`, but you
89-
can do it to be more explicit.
88+
- The node does not need to inherit from the protocol {class}`~pytask.PPathNode`, but
89+
you can do it to be more explicit.
9090
- The node has two attributes
9191
- `name` identifies the node in the DAG, so the name must be unique.
9292
- `path` holds the path to the file and identifies the node as a path node that is
@@ -107,9 +107,10 @@ Nodes are an important in concept pytask. They allow to pytask to build a DAG an
107107
generate a workflow, and they also allow users to extract IO operations from the task
108108
function into the nodes.
109109

110-
pytask only implements two node types, {class}`PathNode` and {class}`PythonNode`, but
111-
many more are possible. In the future, there should probably be a plugin that implements
112-
nodes for many other data sources like AWS S3 or databases. [^kedro]
110+
pytask only implements two node types, {class}`~pytask.PathNode` and
111+
{class}`~pytask.PythonNode`, but many more are possible. In the future, there should
112+
probably be a plugin that implements nodes for many other data sources like AWS S3 or
113+
databases. [^kedro]
113114

114115
## References
115116

docs/source/reference_guides/api.md

+15-12
Original file line numberDiff line numberDiff line change
@@ -239,25 +239,30 @@ The remaining exceptions convey specific errors.
239239
240240
```
241241

242-
## Nodes
242+
## Protocols
243243

244-
Nodes are the interface for different kinds of dependencies or products. They inherit
245-
from {class}`pytask.MetaNode`.
244+
Protocols define how tasks and nodes for dependencies and products have to be set up.
246245

247246
```{eval-rst}
248-
.. autoclass:: pytask.MetaNode
247+
.. autoprotocol:: pytask.MetaNode
248+
:show-inheritance:
249+
.. autoprotocol:: pytask.PNode
250+
:show-inheritance:
251+
.. autoprotocol:: pytask.PPathNode
252+
:show-inheritance:
253+
.. autoprotocol:: pytask.PTask
254+
:show-inheritance:
255+
.. autoprotocol:: pytask.PTaskWithPath
256+
:show-inheritance:
249257
```
250258

251-
Then, different kinds of nodes can be implemented.
259+
## Nodes
252260

253-
```{eval-rst}
254-
.. autoclass:: pytask.PathNode
255-
:members:
256-
```
261+
Nodes are the interface for different kinds of dependencies or products.
257262

258263
```{eval-rst}
264+
.. autoclass:: pytask.PathNode
259265
.. autoclass:: pytask.PythonNode
260-
:members:
261266
```
262267

263268
To parse dependencies and products from nodes, use the following functions.
@@ -357,8 +362,6 @@ There are some classes to handle different kinds of reports.
357362
358363
An indicator to mark arguments of tasks as products.
359364
360-
Examples
361-
--------
362365
>>> def task_example(path: Annotated[Path, Product]) -> None:
363366
... path.write_text("Hello, World!")
364367

docs/source/tutorials/defining_dependencies_products.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -320,7 +320,7 @@ Secondly, dictionaries use keys instead of positions that are more verbose and
320320
descriptive and do not assume a fixed ordering. Both attributes are especially desirable
321321
in complex projects.
322322

323-
## Multiple decorators
323+
**Multiple decorators**
324324

325325
pytask merges multiple decorators of one kind into a single dictionary. This might help
326326
you to group dependencies and apply them to multiple tasks.
@@ -344,7 +344,7 @@ Inside the task, `depends_on` will be
344344
{"first_text": ... / "text_1.txt", "second_text": "text_2.txt", 0: "text_3.txt"}
345345
```
346346

347-
## Nested dependencies and products
347+
**Nested dependencies and products**
348348

349349
Dependencies and products can be nested containers consisting of tuples, lists, and
350350
dictionaries. It is beneficial if you want more structure and nesting.

environment.yml

+1
Original file line numberDiff line numberDiff line change
@@ -47,4 +47,5 @@ dependencies:
4747
- sphinxext-opengraph
4848

4949
- pip:
50+
- sphinx-toolbox
5051
- -e .

setup.cfg

+13
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,19 @@ where = src
5454
console_scripts =
5555
pytask=_pytask.cli:cli
5656

57+
[options.extras_require]
58+
docs =
59+
furo
60+
ipython
61+
myst-parser
62+
nbsphinx
63+
sphinx
64+
sphinx-click
65+
sphinx-copybutton
66+
sphinx-design>=0.3.0
67+
sphinx-toolbox
68+
sphinxext-opengraph
69+
5770
[check-manifest]
5871
ignore =
5972
src/_pytask/_version.py

src/_pytask/collect.py

+17-23
Original file line numberDiff line numberDiff line change
@@ -325,20 +325,8 @@ def pytask_collect_node(session: Session, path: Path, node_info: NodeInfo) -> PN
325325
node = node_info.value
326326

327327
if isinstance(node, PythonNode):
328-
prefix = (
329-
node_info.task_path.as_posix() + "::" + node_info.task_name
330-
if node_info.task_path
331-
else node_info.task_name
332-
)
333-
if node.name:
334-
node.name = prefix + "::" + node.name
335-
else:
336-
node.name = prefix + "::" + node_info.arg_name
337-
338-
suffix = "-".join(map(str, node_info.path)) if node_info.path else ""
339-
if suffix:
340-
node.name += "::" + suffix
341-
328+
node_name = _create_name_of_python_node(node_info)
329+
node.name = node_name
342330
return node
343331

344332
if isinstance(node, PPathNode) and not node.path.is_absolute():
@@ -366,15 +354,7 @@ def pytask_collect_node(session: Session, path: Path, node_info: NodeInfo) -> PN
366354
)
367355
return PathNode.from_path(node)
368356

369-
prefix = (
370-
node_info.task_path.as_posix() + "::" + node_info.task_name
371-
if node_info.task_path
372-
else node_info.task_name
373-
)
374-
node_name = prefix + "::" + node_info.arg_name
375-
suffix = "-".join(map(str, node_info.path)) if node_info.path else ""
376-
if suffix:
377-
node_name += "::" + suffix
357+
node_name = _create_name_of_python_node(node_info)
378358
return PythonNode(value=node, name=node_name)
379359

380360

@@ -514,3 +494,17 @@ def pytask_collect_log(
514494
)
515495

516496
raise CollectionError
497+
498+
499+
def _create_name_of_python_node(node_info: NodeInfo) -> str:
500+
"""Create name of PythonNode."""
501+
prefix = (
502+
node_info.task_path.as_posix() + "::" + node_info.task_name
503+
if node_info.task_path
504+
else node_info.task_name
505+
)
506+
node_name = prefix + "::" + node_info.arg_name
507+
if node_info.path:
508+
suffix = "-".join(map(str, node_info.path))
509+
node_name += "::" + suffix
510+
return node_name

src/_pytask/node_protocols.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ class MetaNode(Protocol):
2222
"""Protocol for an intersection between nodes and tasks."""
2323

2424
name: str
25-
"""The name of node that must be unique."""
25+
"""Name of the node that must be unique."""
2626

2727
@abstractmethod
2828
def state(self) -> str | None:

0 commit comments

Comments
 (0)