-
-
Notifications
You must be signed in to change notification settings - Fork 32k
argparse: mismatch between choices parsing and usage/error message #61181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
When passing a string for the choices argument, argparse's usage and error messages differ from its behavior: >>> p = argparse.ArgumentParser()
>>> p.add_argument('a', choices='abc')
>>> p.parse_args(['d'])
usage: [-h] {a,b,c}
: error: argument a: invalid choice: 'd' (choose from 'a', 'b', 'c')
>>> p.parse_args(['bc'])
Namespace(a='bc') This is because argparse uses the "in" operator instead of sequence iteration to check whether an argument value is valid. Any resolution should also consider the behavior for string subclasses as well as perhaps bytes-like objects. |
I forgot to mention that argparse uses such cases as examples in its documentation (one of which was replaced in bddbaaf332d7). |
Isn't this really just an inappropriate use of a string instead of a list? If indeed this is in the documentation, it should be changed. I still don't like:
>>> p.add_argument('a', choices=list('abc'))
but at least it would work. This call to list() could be done internally, but I think passing in a string is a bad practice and argparse should not contain internal workarounds to cater to this usage. If you're proposing that argparse should use sequence iteration instead of the "in" operator, I disagree with that solution. |
This wasn't just in the documentation. It was the *first* example of how to use the choices argument out of the two examples in that section (from the time Python first adopted argparse and before): ---- >>> parser = argparse.ArgumentParser(prog='PROG')
>>> parser.add_argument('foo', choices='abc')
>>> parser.parse_args('c'.split())
Namespace(foo='c')
>>> parser.parse_args('X'.split())
usage: PROG [-h] {a,b,c}
PROG: error: argument foo: invalid choice: 'X' (choose from 'a', 'b', 'c') (from http://hg.python.org/cpython/file/c0ddae67f4df/Doc/library/argparse.rst#l1021 ) So I think it's a bit late to say it's an inappropriate usage or bad practice. To preserve backwards compatibility, I think we should use sequence iteration for strings, or equivalently apply "in" to iter(choices), set(choices), etc. when choices is a string. I don't think, however, that we should alter the choices attribute because that could break things for people: >>> p = argparse.ArgumentParser()
>>> a = p.add_argument("letter", choices='abcd')
>>> a.choices
'abcd' |
There are also test cases with a string being passed for choices. |
Attached a patch. Rather than altering choices or making a special check for string instances, I just changed the if statement to if action.choices is not None and value not in list(action.choices): from if action.choices is not None and value not in action.choices: It has the added benefit of handling all sequence types correctly (rather than just strings). I tried to think of a case where this wouldn't work as expected, but wasn't able to. |
See bpo-16468 for why that won't work in general. |
The only time this would be an issue is for infinite sequences via range or a generator, which doesn't work anyway. >>> p = argparse.ArgumentParser()
>>> a = p.add_argument('a', choices=itertools.count(0), type=int)
>>> p.parse_args(['10000'])
... hangs Are there any other cases where coercing to a list wouldn't work? |
When choices isn't iterable but supports the "in" operator, e.g. class MyChoices(object):
def __contains__(self, item):
return True |
test_argparse.py has some "choices='abc'" cases. In those should "parser.parse_args(['--foo','bc'])" be considered a success or failure? The underlying issue here is that while string iteration behaves like list iteration, string __contains__ looks for substrings, not just one character that matches. (String __contains__ also returns a TypeError if its argument is not a string.) But other than removing poor examples in documentation and tests, I'm not sure this issue requires a change. |
Changing _check_value from: def _check_value(self, action, value):
# converted value must be one of the choices (if specified)
if action.choices is not None and value not in action.choices:
... to def _check_value(self, action, value):
# converted value must be one of the choices (if specified)
if action.choices is not None:
choices = action.choices
if isinstance(choices, str):
choices = list(choices)
if value not in action.choices:
... would correct the string search without affecting other types of choices. |
I just posted a patch to http://bugs.python.org/issue16468 that deals with this 'bc' in 'abc' issue. |
We could make the error message less wrong: >>> p.parse_args(['d'])
usage: [-h] {a,b,c}
: error: argument a: invalid choice: 'd' (choose a value in 'abc') % git diff
diff --git a/Lib/argparse.py b/Lib/argparse.py
index b44fa4f0f6..f03cc1f110 100644
--- a/Lib/argparse.py
+++ b/Lib/argparse.py
@@ -2499,8 +2499,8 @@ def _check_value(self, action, value):
# converted value must be one of the choices (if specified)
if action.choices is not None and value not in action.choices:
args = {'value': value,
- 'choices': ', '.join(map(repr, action.choices))}
- msg = _('invalid choice: %(value)r (choose from %(choices)s)')
+ 'choices': repr(action.choices)}
+ msg = _('invalid choice: %(value)r (choose a value in %(choices)s)')
raise ArgumentError(action, msg % args)
#
```======================= |
Substrings of the specified string no longer considered valid values.
This can even be classified as a minor security issue. The current behavior is most likely not what most of users mean. It allows invalid data to pass the primary validation. Examples with string #124578 converts the string There is a similar issue with bytes and bytearray, but it has much less chance to occur (you need a special type converter that can return int and bytes), so I leave it as is. |
…4578) Substrings of the specified string no longer considered valid values.
…ythonGH-124578) Substrings of the specified string no longer considered valid values. (cherry picked from commit f1a2417) Co-authored-by: Serhiy Storchaka <[email protected]>
…ythonGH-124578) Substrings of the specified string no longer considered valid values. (cherry picked from commit f1a2417) Co-authored-by: Serhiy Storchaka <[email protected]>
…GH-124578) (GH-124756) Substrings of the specified string no longer considered valid values. (cherry picked from commit f1a2417) Co-authored-by: Serhiy Storchaka <[email protected]>
…GH-124578) (GH-124755) Substrings of the specified string no longer considered valid values. (cherry picked from commit f1a2417) Co-authored-by: Serhiy Storchaka <[email protected]>
Uh oh!
There was an error while loading. Please reload this page.
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
Linked PRs
The text was updated successfully, but these errors were encountered: