Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mypy doesn't recurse through directories, but -h says it does #8548

Closed
rvandegrift opened this issue Mar 16, 2020 · 10 comments · Fixed by #9614
Closed

mypy doesn't recurse through directories, but -h says it does #8548

rvandegrift opened this issue Mar 16, 2020 · 10 comments · Fixed by #9614
Assignees

Comments

@rvandegrift
Copy link

According to mypy -h:

Mypy is a program that will type check your Python code.

Pass in any files or folders you want to type check. Mypy will
recursively traverse any provided folders to find .py files:

    $ mypy my_program.py my_src_folder

But it doesn't recurse, it only checks what's explicitly provided on the command line:

$ python3 -m venv venv
$ . venv/bin/activate
(venv) $ pip install mypy
...
(venv) $ mkdir -p test/test/
(venv) $ touch test/test/a.py
(venv) $ mypy test
There are no .py[i] files in directory 'test'
(venv) $ touch test/b.py
(venv) $ mypy test
Success: no issues found in 1 source file
(venv) $ mypy test test/test
Success: no issues found in 2 source files

The fix might be to change the documentation. This is easy to workaround with find.

@Michael0x2a
Copy link
Collaborator

I believe this is because you're using namespace packages instead of regular packages. If you try running:

mypy --namespace-packages -p test

...mypy should detect that you have two source files. The command line docs should have more details about both flags.

Alternatively, you could do:

touch test/__init__.py
touch test/test/__init__.py

...to convert your package back into a regular one. Then running mypy test should detect four files, as expected.

@rvandegrift
Copy link
Author

That might fix the above example. But my actual use case is to check any python files regardless of where in the directory hierarchy they appear. So I actually want the recursive behavior described in the help text.

@JukkaL
Copy link
Collaborator

JukkaL commented Mar 20, 2020

@rvandegrift Can you give a more concrete example of a directory hierarchy where -p FOO doesn't work for you? You can give multiple -p FOO arguments.

@rvandegrift
Copy link
Author

Well, there's no concrete directory hierarchy that lead me here.

I wanted to enable mypy on all of our repos via a single script in a shared CI pipeline. The script won't know which repo it's running for ahead of time. And since we're human and messy, we have different layouts in different repos. So I can't supply any -p FOO arguments, since I don't know what FOO is.

One of my team members put together a quick hack to find package roots and run once for each: https://github.com/jceresini/mypy_r

I'm not sure that this is a valuable use case. So maybe the only mypy issue is that the help text doesn't describe the actual operation.

@rjmorris
Copy link

The following directory hierarchy led me to this issue:

- setup.py
+ src
  + mypkg
    - __init__.py
    - module1.py
    - module2.py
+ tests
  - test_1.py
  - test_2.py

So this is a src layout, not a namespace package.

As @rvandegrift quoted from the help text:

Mypy will recursively traverse any provided folders to find .py files

Based on this statement, I expect that invoking mypy via mypy . will cause mypy to check all the .py files in my hierarchy. However, setup.py is the only one it checks. To check everything, I need to invoke it as mypy setup.py src/ tests/. And that's fine, I have no problem invoking it that way, but the behavior doesn't match what I'd expect given the documentation.

The description from the online docs is similar, although it doesn't specifically say it finds .py files:

In addition, mypy will recursively type check the entire contents of any provided directories.

I think the words "recursively" and "entire" are strong signals that it will descend down into every directory looking for every .py file, which isn't the case. A link to further details is helpfully provided right after this, where we learn how it actually works:

Directories that don’t represent Python packages (i.e. not directly containing an __init__.py[i] file) are checked as follows:

  • All *.py[i] files contained directly therein are checked as toplevel Python modules;
  • All packages contained directly therein (i.e. immediate subdirectories with an __init__.py[i] file) are checked as toplevel Python packages.

In the case of mypy ., the current directory doesn't contain an __init__.py, so the above logic applies. setup.py is contained directly therein, so that one is checked. But neither of the immediate subdirectories src or test contain an __init__.py, so they aren't checked. Stopping after one subdirectory level is hardly what I'd call recursive.

I understand that I can address the issue by invoking mypy differently. But like @rvandegrift suggested, I think the solution to this particular issue is to update the help text and docs so that they aren't misleading. I suggest removing the references to "recursive", keeping things concise by leaving the directory rules a bit vague, and pointing to where the procedure is described in greater detail.

For example, the help text could be changed from:

Pass in any files or folders you want to type check. Mypy will recursively traverse any provided folders to find .py files:

to:

Pass in any files or folders you want to type check:

Links to further documentation are already provided right after the sample command, so no need to add those.

And the online docs could be changed from:

The above command tells mypy it should type check all of the provided files together. In addition, mypy will recursively type check the entire contents of any provided directories.

For more details about how exactly this is done, see Mapping file paths to modules.

to:

The above command tells mypy it should type check all of the provided files together. In addition, mypy will type check the contents of any provided directories. For more details about how exactly this is done, see Mapping file paths to modules.

If you wanted to provide more details about the directory checking up front, maybe something like this:

For each provided directory, mypy will type check every package it finds in that directory and its immediate subdirectories, and it will type check all .py files it finds in that directory.

@ulope
Copy link

ulope commented Sep 24, 2020

I find this behaviour highly surprising and unintuitive.

We just discovered that this caused a part of our source tree not to be checked for over a year...

@gvanrossum
Copy link
Member

I am convinced. We just need to fix this. Someone please send a PR that makes this properly recursive. There is quite a difference between mypy <dir> and mypy -p <dir> because the latter takes a package name (a valid Python identifier).

If it breaks people they can fix it by writing something like mypy <dir>/*.py. (Hm, does the files config flag allow globs?)

@Eisbrenner
Copy link

my layout:

- pyproject.toml
- setup.cfg
+ src/
  + my_package_name/
    - __init__.py
    + subm/
      - __init__.py
+ tests/
   + test_group1/
      - test_*.py

with this layout, the following works (I have not yet found a way to combine the two into one command, for example using the configuration file)

$ MYPYPATH=src mypy -p my_package_name
$ mypy tests/**/*.py

However, this really is a nuisance compared to mypy . - which I expect to work from:

"Pass in any files or folders you want to type check. Mypy will recursively traverse any provided folders to find .py files"

$ mypy .                  
There are no .py[i] files in directory '.'

I agree with rjmorris above, that at least the help and docs need to be updated. I would also appreciate a less cumbersome method to use mypy for a src-layout package tho.

@gvanrossum
Copy link
Member

There is no need to post more evidence to this issue. What we need is someone to volunteer a PR. The code is pretty straightforward (assuming you know how to iterate over directories recursively), it should all be contained in mypy/main.py.

@hauntsaninja
Copy link
Collaborator

hauntsaninja commented Oct 17, 2020

I agree that this causes bad surprises and needs a solution.

I'll also mention that fixing this might be a good time to figure out / have a plan for #5759 and #8584 (assuming we want to be able to have some sort of support for checking namespace packages by passing directory arguments).

@hauntsaninja hauntsaninja self-assigned this Oct 19, 2020
hauntsaninja pushed a commit to hauntsaninja/mypy that referenced this issue Oct 19, 2020
Fixes python#8548

This is important to fix because it's a really common source of
surprising false negatives.

I also lay some groundwork for supporting passing in namespace packages
on the command line. I wanted to make things work in this PR itself, but
it looks like we have some __init__.py assumptions somewhere in build /
cache that I'll need to leave for another day (I was surprised to
discover fscache._fake_init and secret side effects that the innocuous
looking get_init_file seems to have).

The approach towards namespace packages that this anticipates is
that if you pass in files to mypy and you want mypy to understand your
namespace packages, you'll need to specify explicit package roots.

We also make many fewer calls to crawl functions, since we just pass
around what we need.
hauntsaninja pushed a commit to hauntsaninja/mypy that referenced this issue Oct 19, 2020
Fixes python#8548

This is important to fix because it's a really common source of
surprising false negatives.

I also lay some groundwork for supporting passing in namespace packages
on the command line. The approach towards namespace packages that this
anticipates is that if you pass in files to mypy and you want mypy to
understand your namespace packages, you'll need to specify explicit
package roots.

We also make many fewer calls to crawl functions, since we just pass
around what we need.
hauntsaninja added a commit that referenced this issue Oct 23, 2020
Fixes #8548

This is important to fix because it's a really common source of
surprising false negatives.

I also lay some groundwork for supporting passing in namespace packages
on the command line. The approach towards namespace packages that this
anticipates is that if you pass in files to mypy and you want mypy to
understand your namespace packages, you'll need to specify explicit
package roots.

We also make many fewer calls to crawl functions, since we just pass
around what we need.

Co-authored-by: hauntsaninja <>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

8 participants