Fix warnings in dask-cudf test suite #19993

TomAugspurger · 2025-09-17T13:57:15Z

Description

This PR fixes some warnings in the dask-cudf test suite and elevates any unhandled warnings to errors.

dask/backends.py:140: UserWarning: Warning gzip compression does not support breaking apart files

ParserWarning: Falling back to the 'python' engine because the 'c' engine does not support skipfooter; you can avoid this warning by specifying engine='python'.

UserWarning: You did not provide metadata, so Dask is running ...

UserWarning: Using CPU via PyArrow to read ORC dataset.

- cudf/core/dataframe.py:7708: RuntimeWarning: Degrees of freedom <= 0 for slice - cupy/_statistics/correlation.py:210: RuntimeWarning: divide by zero encountered in scalar divide

RuntimeWarning: invalid value encountered in cast

copy-pr-bot · 2025-09-17T13:57:19Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

TomAugspurger · 2025-09-17T13:57:27Z

/ok to test da7cb07

TomAugspurger · 2025-09-17T15:01:42Z

python/dask_cudf/dask_cudf/tests/test_sort.py

        {"a": list(range(15)) + [None] * 5, "b": list(reversed(range(20)))},
    ],
 )
+# This warning comes from dask-expr, and is probably a consequence of


This one is probably a bug in dask-cudf / dask.dataframe. It occurs as part of normal dask-cudf operations:

>>> import cudf, dask.dataframe as dd, cupy as cp >>> data = { ... "a": [None] * 100 + list(range(100, 150)), ... "b": list(range(50)) + [None] * 50 + list(range(50, 100)), ... } >>> df = cudf.DataFrame(data) >>> ddf = dd.from_pandas(df, npartitions=5) >>> ddf.sort_values(by="a", na_position="first") /raid/toaugspurger/envs/gh/rapidsai/cudf/lib/python3.13/site-packages/pandas/core/arrays/numpy_.py:130: RuntimeWarning: invalid value encountered in cast result = np.asarray(scalars, dtype=dtype) # type: ignore[arg-type]

The equivalent operation on a pandas.DataFrame doesn't emit the warning. I think this is from cudf's not handling NA / NaN the same for a column that would otherwise be integer dtype.

…df-warnings

TomAugspurger · 2025-09-18T18:50:38Z

/ok to test 3f0633c

TomAugspurger added 7 commits September 17, 2025 06:20

Fix dask-cudf test warning

ee60d50

dask/backends.py:140: UserWarning: Warning gzip compression does not support breaking apart files

Fix warning in dask-cudf tests

bb85d8d

ParserWarning: Falling back to the 'python' engine because the 'c' engine does not support skipfooter; you can avoid this warning by specifying engine='python'.

Fix warnings in dask-cudf tests

8bccf00

UserWarning: You did not provide metadata, so Dask is running ...

Fix warnings in dask_cudf tests

33318b4

UserWarning: Using CPU via PyArrow to read ORC dataset.

Fix warnings in dask-cudf tests

e9e338b

- cudf/core/dataframe.py:7708: RuntimeWarning: Degrees of freedom <= 0 for slice - cupy/_statistics/correlation.py:210: RuntimeWarning: divide by zero encountered in scalar divide

Fix warnings in dask-cudf tests

435b980

RuntimeWarning: invalid value encountered in cast

Treat warnings as errors in dask-cudf tests

da7cb07

github-actions bot assigned TomAugspurger Sep 17, 2025

github-actions bot added the Python Affects Python cuDF API. label Sep 17, 2025

github-project-automation bot added this to cuDF Python Sep 17, 2025

GPUtester moved this to In Progress in cuDF Python Sep 17, 2025

TomAugspurger commented Sep 17, 2025

View reviewed changes

TomAugspurger added 2 commits September 18, 2025 11:28

Merge remote-tracking branch 'upstream/branch-25.10' into tom/dask-cu…

a74896a

…df-warnings

filter more warnings

3f0633c

TomAugspurger added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Sep 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix warnings in dask-cudf test suite #19993

Fix warnings in dask-cudf test suite #19993

Uh oh!

TomAugspurger commented Sep 17, 2025

Uh oh!

copy-pr-bot bot commented Sep 17, 2025

Uh oh!

TomAugspurger commented Sep 17, 2025

Uh oh!

TomAugspurger Sep 17, 2025

Uh oh!

TomAugspurger commented Sep 18, 2025

Uh oh!

Uh oh!

Fix warnings in dask-cudf test suite #19993

Are you sure you want to change the base?

Fix warnings in dask-cudf test suite #19993

Uh oh!

Conversation

TomAugspurger commented Sep 17, 2025

Description

Uh oh!

copy-pr-bot bot commented Sep 17, 2025

Uh oh!

TomAugspurger commented Sep 17, 2025

Uh oh!

TomAugspurger Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

TomAugspurger commented Sep 18, 2025

Uh oh!

Uh oh!