Improve APE shell script #85

tomberek · 2021-02-28T19:38:49Z

I'm helping package this up for NixOS where our executables in the store are read-only. This causes an issue with the small bootstrapping shell script to convert itself into a native ELF. We can self-bootstrap during build-time, but this makes the result non-portable.

When run as sh hello.com it just fails with Permission denied. When run as ./hello.com it enters an infinite loop (perhaps the mini-script should check the result of the redirect for failure before exec'ing itself?)

Not sure the best way to proceed. Random list of thoughts:

include a conditional to check if self-modification was successful (prevent infinite loop)
Allow for the modified binary to be placed into TMP, (runtime cost for each invocation)
known hash/location for modification (used by arx for a similar reason: https://github.com/solidsnack/arx)
dd/mmap into a memory location and exec into there?
allow for a mechanism to override the location of the new binary in some way? out=${COSMO_LOC_HASH-$(command -v "$0")} or similar?
something else?

Referencing:

if [ -d /Applications ]; then
dd if="$o" of="$o" bs=8 skip="     351" count="      87" conv=notrunc 2>/dev/null
elif exec 7<> "$o"; then
printf '\177ELF\2\1\1\011\0\0\0\0\0\0\0\0\2\0\076\0\1\0\0\0\076\023\100\000\000\000\000\000\220\010\000\000\000\000\000\000\000\000\000\000\000\000\000\000\0\0\0\0\100\0\070\0\004\000\0\0\000\000\000\000' >&7
exec 7<&-
fi
exec "$0" "$@"
R=$?

btw: thanks for the great work on this project

The text was updated successfully, but these errors were encountered:

tomberek · 2021-02-28T20:08:13Z

Bringing the exec "$0" "$@" line into each branch of the previous conditional would allow the R=$? to do it's work. That would stop the infinite loop. It would proceed to the exit $R line further down.

tomberek · 2021-03-01T00:29:23Z

Or something like this, where

executes ./hello.com
if $COSMO_TEMP/hello does not exist...
....copy self to $COSMO_TEMP/hello, update to ELF
exec into $COSMO_TEMP/hello

where "hello" can also be a unique identifier or hash to prevent collisions.

feeley · 2021-03-01T00:52:30Z

Hashing or using a "unique" identifier is a dangerous practice because if there's a collision it could overwrite an existing executable (with possible security consequences). It is less problematic to derive the name in $COSMO_TEMP from the path of the executable. So if the path is /home/bob/hello.com the file created would be $COSMO_TEMP/home/bob/hello.com .

alisonatwork · 2021-03-01T01:15:53Z

I guess on a lot of UNIXes the trick will be finding a temp place that is both 1) writeable by users (i.e. not /usr/bin or /bin) and 2) not mounted noexec (i.e. not /var/tmp or /tmp). I think if these binaries were to be distributed via a traditional package manager in a locked-down UNIX environment, the right way would be to have a preinstall step where you execute it once as root (to ELF-ify it) before dropping it into the standard binary location.

To me this project seems more interesting for more "unzip and go" style software distribution, where you download something as a normal user and then just run it from your downloads directory, or put it into bin under your home directory.

jart · 2021-03-01T01:48:07Z

I'm willing to merge small improvements to the APE shell script. So long as the change isn't copying the whole executable to /tmp and executing that instead. I like the fact that the current design only requires changing 64-bytes, that it re-execs from the same location, and that subsequent executions happen in a purely native way.

The ideal thing to do would be patching the Linux kernel so that it recognizes the APE format and is able to to load it directly into memory without handing off execution to /bin/sh. In that case, the shell script would serve the purpose of enabling us to continue to support older kernels.

Cosmopolitan provides alternative ways to meet your requirements too. As discussed in another issue you can do two things:

You can ask the APE bootloader to generate ELF binaries by saying make CPPFLAGS=-DSUPPORT_VECTOR=113 which disables Windows + Metal + XNU support. See Compiling Lua #61 (comment)
You can write a fast wrapper program that performs an atomic /tmp copy only for the times when you need it. See Compiling Lua #61 (comment) Having a native program do this is better than a shell script because it means we can use vfork() and copy_file_range() which are less hairy than what a shell script is able to do. Native programs are also able to supply the original path as argv[0] which helps make that operation less disruptive although it unfortunately can't override getauxval(AT_EXECFN).

tomberek · 2021-03-01T14:30:01Z

Would memfd_create help here? I got this running:

@myenv = ();
foreach my $key (keys %ENV) {
    push(@myenv,"$key=$ENV{$key}");
    push(@myenv,0);
}
my $n="ape";
my $p=getppid();
my $pid=$$;

my $filename = "hello.com";
my $size = -s $filename;

$fd = syscall(319,$n,0); die "memfd_create $!" if-1==$n;
$rs = syscall(77,$fd,$size); die "ftruncate $!" if -1==$rs;
$ex = syscall(2,$filename,0); die "open: $!" if -1==$ex;
$rs = syscall(40,$fd,$ex,0,$size); die "sendfile $!" if -1==$rs;
# $rs = syscall(326,$ex,0,$fd,0,$size,0); die "copy_file_range: $!" if -1==$rs;
$rs = syscall(3,$ex,0); die "close: $!" if -1==$rs;
$rs = syscall(8,$fd,0,0); die "lseek $!" if -1==$rs;

my $hdr ="\177ELF\2\1\1\011\0\0\0\0\0\0\0\0\2\0\076\0\1\0\0\0\076\023\100\000\000\000\000\000\220\010\000\000\000\000\000\000\000\000\000\000\000\000\000\000\0\0\0\0\100\0\070\0\004\000\0\0\000\000\000\000";
$rs = syscall(1,$fd,$hdr,64); die ("write $!") if -1==$rs;
$rs = syscall(59,"/proc/$pid/fd/$fd",$ARGV,pack("p*",@myenv)); die ("execve: $!");

This leaves no trace on filesystem. No need for tmpfs or TMP; only requires procfs.

elimisteve · 2021-03-02T03:11:50Z

@tomberek Is the idea that you're making the 64-byte modification to the version of the binary that's just been loaded into RAM, not making that same edit to it on-disk? (Then executing that modified, in-RAM version.)

tomberek · 2021-03-02T03:47:46Z

@elimisteve This is a mechanism by which you can load a file into memory (it SHOULD be able to do this via copy_file_range, splice, sendfile or some other quick mechanism, i don't know which is best), modify the 64 bytes, then exec into it. It's Perl, so, not ideal.

Without patching the kernel, we'd need to either make the wrapper available somehow, (self extract a binary wrapper for any APE in order to re-use it? extract it each time or check PATH, it can be made super small), or find some other mechanism to make the correct syscalls from the shell context. I'm very likely re-exploring well-trodden ground.

My thought is that something like this can be a fallback if the script detects that the original is not writable, or can't find anywhere else to write. (trying to write it to TMP should be the first fallback? ew....)

alisonatwork · 2021-03-02T04:38:14Z

If the only solution is to end up writing it into the filesystem, my initial thought on a good location would be to do something similar to Go's UserCacheDir:

https://github.com/golang/go/blob/4c1a7ab49c4c68907bc7f7f7f776edd9116584a5/src/os/file.go#L393-L401

This uses XDG Base Directory Specification variables by default, which feels about as good as you can get on UNIX, especially from the shell. I think it's important not to write it to a shared directory, because you can't guarantee that the user running it can write to that destination, or execute files on the partition. Presumably no sysadmins are cruel enough to mount /home noexec.

This reminds me a bit of Python's pyc files. PEP 3147 has a bit of discussion on alternatives.

See PR #96 and issue #85

jart · 2021-03-08T07:08:19Z

The goal with APE is to overcome arbitrary platform boundaries that create toil for developers. Using memfd_create to worm around an intentional choice to mount ${TMPDIR:-/tmp} as noexec is not the kind of thing we do here. XDG appears to be Systemd thing and it's a great example of the depths I was aiming to avoid in the boot sector. Now that we have a better failure condition, I'm happy to trust the administrator to understand what's happening and then work around it in their preferred manner.

jart added the contributions welcome We'll commit to review and maintenance if the people who need it write the changes. label Mar 1, 2021

jart changed the title ~~Non self-modifying executable~~ Improve APE shell script Mar 1, 2021

jart pushed a commit that referenced this issue Mar 2, 2021

Fix APE error if unable to modify self

0eaad9d

See PR #96 and issue #85

jart closed this as completed Mar 8, 2021

lemaitre mentioned this issue Sep 2, 2021

Add APE interpreter #263

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve APE shell script #85

Improve APE shell script #85

tomberek commented Feb 28, 2021

tomberek commented Feb 28, 2021

Uh oh!

tomberek commented Mar 1, 2021

Uh oh!

feeley commented Mar 1, 2021

Uh oh!

alisonatwork commented Mar 1, 2021

Uh oh!

jart commented Mar 1, 2021

Uh oh!

tomberek commented Mar 1, 2021

Uh oh!

elimisteve commented Mar 2, 2021 •

edited

Loading

Uh oh!

tomberek commented Mar 2, 2021

Uh oh!

alisonatwork commented Mar 2, 2021

Uh oh!

jart commented Mar 8, 2021

Uh oh!

Uh oh!

Improve APE shell script #85

Improve APE shell script #85

Comments

tomberek commented Feb 28, 2021

tomberek commented Feb 28, 2021

Uh oh!

tomberek commented Mar 1, 2021

Uh oh!

feeley commented Mar 1, 2021

Uh oh!

alisonatwork commented Mar 1, 2021

Uh oh!

jart commented Mar 1, 2021

Uh oh!

tomberek commented Mar 1, 2021

Uh oh!

elimisteve commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tomberek commented Mar 2, 2021

Uh oh!

alisonatwork commented Mar 2, 2021

Uh oh!

jart commented Mar 8, 2021

Uh oh!

elimisteve commented Mar 2, 2021 •

edited

Loading