std::os::argparse module #1897

alexveden · 2025-01-27T13:46:16Z

std::os::argsparse module

hwchen · 2025-01-27T15:49:18Z

I don't know if the API for std argparse has already been discussed (It's not obvious from a quick search of issues or looking at the test runner pr). If not, I've got opinions 😄 and code I'd be willing to donate. But if this has already been decided I don't want to derail.

alexveden · 2025-01-27T16:09:59Z

There are a bunch of tests of argparse there in the test runner PR. So you may try to get a sense of it API. Anyway, I'm open for ideas.

lerno · 2025-01-29T20:46:09Z

@hwchen did you have some feedback?

hwchen · 2025-01-31T04:39:01Z

Just want to be clear that I'm not really commenting on the the current implementation. I'm more interested in whether there's a certain type of API we're looking for in an argparse module.

I come from Rust, and not C, so I'll explain in terms of those libraries.

Clap is very full featured. Help text generation, deriving parser using struct attributes (tags), explicit subcommands, validation, built-in API for parsing common types.
lexopt is very minimal, it only provides a stream of values/options.

The ripgrep crate moved away from clap to lexopt, in part to reduce dependencies, and also because lexopt would end up providing more control over arg parsing (at the cost of having to implement more boilerplate).

I feel that the current PR API sits between the two (more towards simplicity). I think for stdlib, I'd prefer either extreme; if it's simpler, more complex parsers can be built on it, and if it has more features it can be used easily as-is for more scenarios. Odin ended up with something more comprehensive (can defined opts using a struct with tags).

Also, I believe that wherever we want to sit on the spectrum, it's good to be explicit about it.

As for my own biases, I've written an arg parsing library for c3 which follows the general structure of lexopt's API. I might prefer something like it in the std library, but I can also see the appeal of other approaches. And seeing as everybody ends up writing their own argparse, there's probably a lot of other opinions out there too :)

tomaskallup · 2025-02-01T21:43:36Z

I have a bit of feedback on this.

I feel like the API is fine, it's exactly what it says that it is, argument parser. If something more like a full blown CLI app API would be needed (to have 0 hassle subcommands and what not) I could be in another module, which would utilize argparse under the hood.

What I currently don't see is a way to provide an array option, since from the implementation it would seem that providing a single option multiple times would result in an error of "duplicated option". The value of the option could be handled by the callback function from the looks of it.

The only other thing that came to mind was a bit more "hackability", for example if I wanted to somehow implement validation of a parameter, I would have to do it myself after the parsing and I would also have to write the extra help info (if it was for example an enum). But again, this could be solved by the wrapping module, which would hold the users hand a bit more. Altough my view is similar to hwchens' above, I feel like the current implementation here is good enough and if one wants to opt-out of some of the features, they still can (for example the help option is opt-in).

Edit: I see now that the callback function can return optional, which makes the hackability possible for validation or exclusivity of options.

alexveden · 2025-02-02T13:39:57Z

What I currently don't see is a way to provide an array option, since from the implementation it would seem that providing a single option multiple times would result in an error of "duplicated option". The value of the option could be handled by the callback function from the looks of it.

This is a kind a thing I was thinking about. I think it's common for CLI to have accumulated values, e.g. -vvvv for verbosity levels. I didn't implement arrays, because I wanted to have argparse non-allocating. But I think it may be a good idea to add multiple values, at least make it possible to do it with callbacks.

So by design, the callback mechanism is the way to extend the argparse to whatever is needed. I can refine callbacks and arrays of arguments after PR approval.

The only other thing that came to mind was a bit more "hackability", for example if I wanted to somehow implement validation of a parameter, I would have to do it myself after the parsing and I would also have to write the extra help info (if it was for example an enum).

All hackability is implemented via callbacks, or explicit param validation after parsing in the main (or other function). argparse module still does simple validation, so if you expect int type in the option value and given a string, it will raise validation error. More complex cases, should be handled by the program via callback of argparse, or after parse completes in regular code.

tomaskallup · 2025-02-02T14:08:56Z

So for the arrays, just a simple flag multiple would be needed for the arg? Also requiring you to use the callback.

Since now it would call the callback once and then error. I'm fine with arrays not being available by default and requiring custom implementation.

alexveden · 2025-02-02T15:48:22Z

FYI, I found array args impractical in most cases, I barely can remember anything I used with array args except maybe gcc :). For simple use, it's possible to use --flag + array of arguments

tomaskallup · 2025-02-02T19:26:35Z

That's what most tools do, single flag with values separated by some character. But sometimes you might want those values to be arbitrary strings and there might not be a feasible separator, like when specifying ENV variables for docker etc.

lerno · 2025-02-04T23:33:19Z

I am sorry this one isn't looked at yet. It's half past midnight and I don't have the time this lib deserves to check it. I'll need to push it to the weekend.

lerno · 2025-02-05T23:39:11Z

Maybe I'm not the kind of audience who is using something like this, but for me it's more natural with a simpler design, as you might have guessed from the way build_options.c work.

It is quite simple: have a switch which looks at each arg, then if the arg starts with - it instead runs through the switch with - opts, and if it finds another - then that's a long opt and will be checked with the longopts.

This way checking is trivially stateful, which can be useful.

So the useful functionality is not parsing the arguments but rather:

Skip an argument
Check if a string (argument) is a vaild file or directory
Check if a string (argument) is an int
Check if a string (argument) is one in a list of values, and return that index.

What are your thoughts?

std::os::argparse module

473e519

Merge branch 'c3lang:master' into alexveden/args_parse

e207013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

std::os::argparse module #1897

std::os::argparse module #1897

alexveden commented Jan 27, 2025

hwchen commented Jan 27, 2025 •

edited

Loading

alexveden commented Jan 27, 2025

lerno commented Jan 29, 2025

hwchen commented Jan 31, 2025

tomaskallup commented Feb 1, 2025 •

edited

Loading

alexveden commented Feb 2, 2025

tomaskallup commented Feb 2, 2025

alexveden commented Feb 2, 2025

tomaskallup commented Feb 2, 2025

lerno commented Feb 4, 2025

lerno commented Feb 5, 2025

std::os::argparse module #1897

Are you sure you want to change the base?

std::os::argparse module #1897

Conversation

alexveden commented Jan 27, 2025

hwchen commented Jan 27, 2025 • edited Loading

alexveden commented Jan 27, 2025

lerno commented Jan 29, 2025

hwchen commented Jan 31, 2025

tomaskallup commented Feb 1, 2025 • edited Loading

alexveden commented Feb 2, 2025

tomaskallup commented Feb 2, 2025

alexveden commented Feb 2, 2025

tomaskallup commented Feb 2, 2025

lerno commented Feb 4, 2025

lerno commented Feb 5, 2025

hwchen commented Jan 27, 2025 •

edited

Loading

tomaskallup commented Feb 1, 2025 •

edited

Loading