Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Research how to find the POSIX-compatible shell #1761

Open
5 of 12 tasks
EliahKagan opened this issue Jan 12, 2025 · 4 comments
Open
5 of 12 tasks

Research how to find the POSIX-compatible shell #1761

EliahKagan opened this issue Jan 12, 2025 · 4 comments
Labels
enhancement New feature or request

Comments

@EliahKagan
Copy link
Member

EliahKagan commented Jan 12, 2025

Summary 💡

It is a truism that the POSIX-compatible shell is sh and that it is located at /bin/sh. For systems where that is not so, or where it is in some important way not the whole truth, I'd like to figure out what should be done for running commands that should otherwise be run in /bin/sh.

  • Common situation on Unix-like systems: it's just /bin/sh!
    • Should we use relative sh instead? POSIX does not require /bin/sh.
  • Solaris and illumos: Is the POSIX sh elsewhere than /bin/sh? Which do we use?
  • Git for Windows, full installation: (git root)\usr\bin\sh.exe (#1758)
    • (git root)\bin\sh.exe? (No, not SDK-compatible, see below.)
    • "Portable" Git (same as Scoop package except for Scoop's own shim)
  • Git for Windows, traditional MinGit: Does it need anything special?
  • Git for Windows, BusyBox-based MinGit: Does it need anything special?
  • Git for Windows SDK: (git root)\usr\bin\sh.exe (report, repro)
  • Windows with other identifiable MSYS2: Do we use its sh as a fallback?
  • Windows with an .sh file association: Do we fall back to it if we can prove it's a shell?
  • Allow specifying at runtime which POSIX shell to use git-for-windows/git#3301

Motivation 🔦

Rationale

gitoxide runs some commands in shells. It must do so, for some commands, because the semantics of some environment variables and configuration variables, as defined by Git, which gitoxide wishes to be compatible with, stipulates it in some cases.

(It is also often important not to run commands in shells where applicable semantics make that the wrong thing to do. But that is independent of this issue.)

Whatever shell is provided as POSIX sh, and named as such, should generally be used to run shell commands in situations where no #! is applicable because we are running a command rather than a script. For example, core.sshCommand is a shell command. It must, at least in the general case, be run in some shell. (See also #1752 and #1758.)

Relevance

This is not a blocker for #1758. These are ideas that don't belong in the scope of #1758--except for the parts that, as noted above, are already covered there--and that I may not get to immediately. I am opening this so it is not forgotten.

The relationship to running hooks is indirect: hooks with a #!/bin/sh or similar hashbang, on systems like Windows where hashbangs cannot be used directly, could use the sh.exe shell, which #1758 helps find correctly and which some of the items here would help find in even more situations. However, if considering what is discovered here for running hooks that don't specify a hashbang, I recommend making sure to confine it only to cases where that is needed for compatibility with existing Git behavior. (In general, whether scripts with no #! are run with a shell like /bin/sh or simply fail to execute varies, and is implemented, per application. exec-family functions would generally fail to run them.)

This does not directly help, and probably will not help at all, with the problem of getting an environment that picks up things like ssh-agent in setups where that is wanted but does not automatically work.

@EliahKagan EliahKagan added the enhancement New feature or request label Jan 12, 2025
@Byron
Copy link
Member

Byron commented Jan 12, 2025

Thanks a lot for surfacing this problem. Thus far I just tried to solve it quickly, without having a full understanding of it which seems to be required for correctness.

Whatever shell is provided as POSIX sh, and named as such, should generally be used to run shell commands in situations where no #! is applicable because we are running a command rather than a script. For example, core.sshCommand is a shell command. It must, at least in the general case, be run in some shell. (See also #1752 and #1758.)

Actually, Git for Windows applies an optimization that runs the programs directly after performing argument splitting manually, when possible.

I also wonder if this shell selection should actually use the shell() function that was recently added to gix_path::env:: to assure when sh is used, it's the right one. Maybe this fits into the research as well.
My intuition here is to do it as the using the right shell (as opposed to some shell picked up from PATH) is important.

@EliahKagan
Copy link
Member Author

Actually, Git for Windows applies an optimization that runs the programs directly after performing argument splitting manually, when possible.

Thanks--I was unaware that Git for Windows did that. I did know about gitoxide has such functionality, though, which is what motivated me to say "at least in the general case."

@Byron
Copy link
Member

Byron commented Jan 12, 2025

I think I copied it from the Git for Windows implementation, but also wanted to say that as part of this research this may be scrutinised. Maybe it's not always the same - probably that pretty much depends on environment variables. My guess is that GUI applications would prefer to use a shell always so there is a chance additional environment is picked up, and as a matter of fact GitButler is already doing it. It does so not just to increase chances of working correctly, not because it could reproduce an issue that occurred when not using a shell. So all of this is still quite vague at least in terms of practical experience.

@rvolgers
Copy link

rvolgers commented Jan 12, 2025

It is probably a good idea to do what existing software on Windows does, since it increases the chances of being compatible with existing configurations. I would also err on the side of using fewer intermediate binaries.

Windows argument passing is an unexpectedly deep rabbit hole. As an example, executing binaries which happen to be compiled to use the ANSI instead of Unicode APIs can cause wildly inconsistent command line parsing, leading to problems: https://devco.re/blog/2025/01/09/worstfit-unveiling-hidden-transformers-in-windows-ansi/

TLDR: Windows will map unicode characters to characters which vaguely resemble them, including ones with special meaning like backslash and double quote. This combined with Windows passing the command line to each program as a single string (each program is responsible for argument splitting) causes problems.

EliahKagan added a commit to EliahKagan/gitoxide that referenced this issue Mar 18, 2025
Now that Git for Windows 2.49.0 has a stable release, this changes
the upgrade step that was added to `test-fixtures-windows` in
4237e5a (GitoxideLabs#1870), so that it downloads an installer from the release
marked as "latest", rather than the release that has the newest
tag. The release marked "latest" is usually a stable release in
projects that have any stable releases, and in particular it is a
stable release in Git for Windows.

This is *not* needed to switch from the release candidate to the
stable release for 2.49.0. The download logic already in place
currently gets the stable release automatically, because it is the
newest tag.

Nonetheless, there are three reasons to prefer the "latest" tag to
get the stable release, now that the stable release is available.
In descending order of significance, they are:

- We upgrade to work around GitoxideLabs#1849, for which 2.49.0 is preferable
  to 2.48.1 (which the Windows runner images currently have).
  Continuing to take the newest tag will eventually take a
  pre-release for the next version. That would probably work, but
  it is not currently a goal.

  There is sometimes a delay between when a stable release of Git
  for Windows comes out and when the stable runner images are
  released with it. (Pre-release runner images exist, but they are
  not run on GitHub-hosted runners.) So even assuming this upgrade
  step is to be removed once it is no longer needed, it could
  easily end up remaining long enough for a new Git for Windows
  pre-release to come out.

- An update may potentially be released for an earlier minor
  version (y in x.y.z), in which case the tag for it would be
  newer and we would downgrade instead. Now that the release
  marked "latest" is usable here, we can use it and avoid that.

- If we decide to eventually deliberately test pre-releases, the
  step added in GitoxideLabs#1849 would probably not be usable in that form,
  because it could take either the next pre-release or a patch to
  an ealier release per the above points, and also for the separate
  reason that this CI job is not necessarily where we would want to
  test that. (As one example, there is currently no CI testing of
  the Git for Windows SDK, even though supporting it is an explicit
  goal discussed in GitoxideLabs#1758, GitoxideLabs#1761, GitoxideLabs#1862, and GitoxideLabs#1864. If that is
  added, it may be a more opportune way to test prereleases.)
EliahKagan added a commit to EliahKagan/gitoxide that referenced this issue Mar 18, 2025
Now that Git for Windows 2.49.0 has a stable release, this changes
the upgrade step that was added to `test-fixtures-windows` in
4237e5a (GitoxideLabs#1870), so that it downloads an installer from the release
marked as "latest", rather than the release that has the newest
tag. The release marked "latest" is usually a stable release in
projects that have any stable releases, and in particular it is a
stable release in Git for Windows.

This is *not* needed to switch from the release candidate to the
stable release for 2.49.0. The download logic already in place
currently gets the stable release automatically, because it is the
newest tag.

Nonetheless, there are three reasons to prefer the "latest" tag to
get the stable release, now that the stable release is available.
In descending order of significance, they are:

- We upgrade to work around GitoxideLabs#1849, for which 2.49.0 is preferable
  to 2.48.1 (which the Windows runner images currently have).
  Continuing to take the newest tag will eventually take a
  pre-release for the next version. That would probably work, but
  it is not currently a goal.

  There is sometimes a delay between when a stable release of Git
  for Windows comes out and when the stable runner images are
  released with it. (Pre-release runner images exist, but they are
  not run on GitHub-hosted runners.) So even assuming this upgrade
  step is to be removed once it is no longer needed, it could
  easily end up remaining long enough for a new Git for Windows
  pre-release to come out.

- An update may potentially be released for an earlier minor
  version (y in x.y.z), in which case the tag for it would be
  newer and we would downgrade instead. Now that the release
  marked "latest" is usable here, we can use it and avoid that.

- If we decide to eventually deliberately test pre-releases, the
  step added in GitoxideLabs#1849 would probably not be usable in that form,
  because it could take either the next pre-release or a patch to
  an ealier release per the above points, and also for the separate
  reason that this CI job is not necessarily where we would want to
  test that.

  (As one example, there is currently no CI testing of the Git for
  Windows SDK, even though supporting it, in general and for
  running the test suite, is an explicit goal discussed in GitoxideLabs#1758,
  GitoxideLabs#1761, GitoxideLabs#1862, and GitoxideLabs#1864. If that is added, it may be a more
  opportune way to test prereleases.)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants