refactor(commands)!: change arg handling strategy #12441

RoloEdits · 2025-01-07T03:59:11Z

Closes: #7453
Closes: #6171
Closes: #10549
Closes: #10993

Blocking: #12320
Blocking: #12288
Blocking: #11164

helix-term/src/commands/typed.rs

helix-core/src/shellwords.rs

RoloEdits · 2025-01-08T14:02:05Z

The changes here seem to work well with the flags PR. This should be ready for review, I think I just need to touch up the wording in the errors when the params count it off. I will open this for review and then get to that as well as update the writeup with actual content.

helix-core/src/shellwords.rs

helix-term/src/commands/typed.rs

the-mikedavis · 2025-01-08T18:04:05Z

helix-core/src/shellwords.rs

+    /// but a normal `&str`.
+    #[inline]
+    #[must_use]
+    pub fn peek(&self) -> Option<Cow<'_, str>> {


This can be removed: a Peekable should be used instead now that the item might be an allocation

I think I can just make this private instead. When its parsing flags it wont ever allocate. And when its collected it has them all enabled for Parameters, which could allocate, but it never used peek, just a collect. I need internal access to the idx. The string that passed to unescape has to have the correct lifetime, its its done through a function (like rest) im pretty sure it will error as not living long enough: unescape(&input[args.idx..], false)

It's small so if we need to use it later we can reintroduce it. Let's remove it for now.

I can remove it here and then in flags add it if I need it. As it is really only meant to be used there.

Also just to clarify on why this is most likely needed, if Literal parsing can have flags, like with sh for example. I need to exhaust the flags but still be able to save the rest of the remaining string as one thing. Peekable does not give access to the internal state and the way it works it to store the next value in an Option<Option<I::Item>>. This advances the iterator one place too many when taking the idx into account. Thats why I clone in this one to get a new instance as no state can be advanced in the looping iterator. Not only does Peekable not have access to the rest function, it wouldnt work correctly even if it did as already stated, when passing to unescape it creates a lifetiem that doesnt live long enough. So I need to implement rest again here with unescape(&input[args.idx..]) which gives a correct lifetime tied to input. Regardless though, this is really only needed for the flags. I will just put it there.

For things that use literal parsing we can enforce that flags are only at the beginning of the input. Kakoune has a similar concept, see ParameterDesc::Flags in parameters_parser.hh

Thats how it is now, at least in being assumed as such. The main issue is that I need the remaining part of the input which means it needs to be aware of progress that was made in idx during the loop.

:sh --no-popup wezterm spawn floating-pane lazygit yielded^ ^idx.. unescape(&'a input[idx..], _)

helix-core/src/shellwords.rs

RoloEdits · 2025-01-10T17:56:47Z

Hmm, im wondering if we need special case "" a bit. If we pass it to Args with Args::from(""), with its default ParseMode::Raw, it will always create a Vec, even if the only entry is vec![""]. This means that Args::from("").is_empty() is false, as there is currently one in the Vec (current implimentation is self.positionals.is_empty). We might need to check if the first argument is also empty or not with an is_some_and. Or maybe just filter out empty positions before collecting? filter(|arg| !arg.is_empty). Or return None whenever self.input.is_empty.

first and last also suffer similar problems if the only entry is "". len as well.

RoloEdits · 2025-01-10T18:56:48Z

Or return None whenever self.input.is_empty.

I did this for now as it seems more intuitive to have it like this.

RoloEdits · 2025-01-10T19:53:53Z

The completion paths for windows are handled in such a way that makes them wrong and hard to parse around in trying to adhere to the normal windows conventions. For example :o dir\"path to file.txt" is an invalid path in windows, yet this is what the completion is:

Because the backslashes are path segments, unless we make every windows path completion double backslash, you cant really escape spaces.

When typing / on windows it maintains the list of relative path completions, but with \ it provides drive level completions.

What we could do is just abstract the conventions and offer completions with / path segments. This way there wouldnt need to be compile time parser mode differences and spaces can be escaped in the same way and actually provide a correct representation to then convert into a proper path for the OS. Rust itself does this when handling strings to paths. This would simplify some things in code.

RoloEdits · 2025-01-10T20:54:47Z

Another thing that could work is instead of replacing just the segment to surround with quotes, it replaces the whole thing once and puts an open quote at the start of the path. The way the parsing is done with the completions now this would still allow to complete segments and then you could just manually close the quote if you need to open more files. Otherwise just accepting with no closing quote would open the file correctly.

…inds

the-mikedavis · 2025-01-13T22:05:26Z

I took a very long look into this, completions, flags and expansions. Usually I would prefer to see distinct features like these land independently but in this case I now think it's cleaner to land all at once since they can't really be independent code-wise. Each feature informs the design of the parsers, their output types and the completion code. I have changes locally now that add all of these features together based on Kakoune's approach. Kakoune's command line syntax and expansion strategy should be surprisingly effective for us. (Surprising because Kakoune doesn't support Windows.)

On the syntax level we should basically avoid supporting escapes like \n or \" - this simplifies compatibility with Windows. Unix can still do some escaping but it should be easy to avoid using backslashes. Unicode can instead be covered by expansions like %u{25CF} rather than \u{25CF}. The line ending should be covered by a variable expansion corresponding to the document's like ending %{line_ending} and maybe we can add special cases like %u{tab}/%u{lf}/u{crlf}. Otherwise the rest should be handled by quoting rules.

On the implementation level this looks like two parsers: one covering basically what shellwords does today (corresponding to ArgsParser), parsing the command line into tokens naively (unaware of the command's signature), and the other handling the distinction between flags and positionals (corresponding to Args). Both are used for the sake of completion. The expansion code feeds on the tokens from the first parser before passing them to the second.

I'll post my changes later this evening and I'd appreciate it if you could give them a try if you get a chance, especially on Windows. Also thanks for your work on this and its predecessor! These PRs have pushed the design forward quite a bit.

RoloEdits · 2025-01-14T01:21:29Z

Usually I would prefer to see distinct features like these land independently but in this case I now think it's cleaner to land all at once since they can't really be independent code-wise.

This is something I heavily felt as well. Many times I had almost asked if I could just combine them into one PR.

The expansion code feeds on the tokens from the first parser before passing them to the second.

If im understanding this right, I think this is what we had landed on for that PR too.

I'll post my changes later this evening and I'd appreciate it if you could give them a try if you get a chance, especially on Windows.

Will do. I have come across a lot of pitfalls and edge cases in doing this, so I will be sure to give it a good peek.

Also thanks for your work on this and its predecessor! These PRs have pushed the design forward quite a bit.

Am I understanding that this PR is dead, and you will be combining them in a new one?

Regardless, its been a good learning experience, I just wish I could have had more understanding around this, as it feels my ignorance really hindered seeing it through to the end. But I'm glad the groundwork helped.

the-mikedavis · 2025-01-14T01:47:30Z

Right yeah, I intend on continuing this with my own changes. I've posted #12527 and will hopefully merge it down within a few weeks. I would encourage you not to see this as a bad reflection on your work here - this and the predecessor are good changes and certainly far better than the questionable stuff we have in shellwords today. For me review is a balance of needing to familiarize myself with a problem and solution enough to know if a fix/enhancement will age well vs. spending time investigating it and working on it myself. In particularly complex areas like this that have lots of edge-cases and need design or necessary breaking changes it becomes more economical to make the change directly rather than guide an interested contributor. Aside from that we're generally more comfortable with accepting large (multi-K-line changes) from maintainers.

RoloEdits · 2025-01-14T02:43:41Z

I would encourage you not to see this as a bad reflection on your work here

No worries, this just got out of my paygrade, with so many things that this effected. It was the correct choice, to subsume it all into one integrated implimentation. The project is better off for it.

Thanks once again, cant wait to see it all land!

RoloEdits added 2 commits January 6, 2025 20:01

refactor(shellwords): change arg handling strategy

ff980a1

feat: add a params range hint to TypableCommand

4cb2f80

RoloEdits force-pushed the shellwords branch from 0814a3a to 2edddf1 Compare January 7, 2025 04:02

the-mikedavis reviewed Jan 7, 2025

View reviewed changes

helix-term/src/commands/typed.rs Outdated Show resolved Hide resolved

helix-core/src/shellwords.rs Outdated Show resolved Hide resolved

helix-core/src/shellwords.rs Outdated Show resolved Hide resolved

helix-core/src/shellwords.rs Outdated Show resolved Hide resolved

RoloEdits added 9 commits January 7, 2025 08:14

refactor: transition to a Vec wrapper for Args

c224387

refactor: add a signature field to TypableCommands

e92b17e

refactor: propgate ParseMode to command signature

a72436a

refactor: tweak Shellwords -> Args flow

0aa7fdd

refactor: make unescaped private

e45a46f

test: add base variant

c537874

fix: only validate argument count when enacting command

4d594b5

doc: fix examples

bde4c6c

refactor: tidy up the file

17202ec

RoloEdits force-pushed the shellwords branch from 2edddf1 to 17202ec Compare January 7, 2025 21:40

refactor: add typestate to ArgsParser around unescaping rules

7f53db5

RoloEdits force-pushed the shellwords branch from e47b55d to 7f53db5 Compare January 8, 2025 12:12

This was referenced Jan 8, 2025

feat(commands)!: add support for flags #12288

Closed

feat(commands): add support for custom typable commands #12320

Draft

Command expansion v2 #11164

Open

RoloEdits marked this pull request as ready for review January 8, 2025 14:02

refactor: tweak invalid param count error messages

f09a495

RoloEdits changed the title ~~refactor(shellwords): change arg handling strategy~~ refactor(commands): change arg handling strategy Jan 8, 2025

RoloEdits changed the title ~~refactor(commands): change arg handling strategy~~ refactor(commands)!: change arg handling strategy Jan 8, 2025

the-mikedavis reviewed Jan 8, 2025

View reviewed changes

RoloEdits added 3 commits January 8, 2025 09:28

refactor: remove uneeded From conversions

c54a26b

docs: add suggested changes

bf34167

refactor: remove elided lifetime

94a7556

the-mikedavis reviewed Jan 8, 2025

View reviewed changes

helix-core/src/shellwords.rs Show resolved Hide resolved

RoloEdits force-pushed the shellwords branch 3 times, most recently from 5e9274f to de7858d Compare January 10, 2025 03:43

fix: completer showing previous argument after space

910dfa6

RoloEdits force-pushed the shellwords branch from de7858d to 910dfa6 Compare January 10, 2025 03:44

RoloEdits added 3 commits January 9, 2025 20:29

fix: add special case if last char is \

81070da

feat: add final remaining two ParserMode variants

08b4dd6

reafactor: address more nits

fb3b6b7

RoloEdits force-pushed the shellwords branch from 52a2b4d to fb3b6b7 Compare January 10, 2025 05:32

refactor: change default ParseMode to Raw

b1b1e52

RoloEdits force-pushed the shellwords branch from 0387c22 to b1b1e52 Compare January 10, 2025 19:30

RoloEdits added 2 commits January 10, 2025 12:33

fix: check for end quote in ends_with_whitespace

1dd8001

fix: separate filepath parsing mode between unix and windows

b1b8b03

RoloEdits added 6 commits January 11, 2025 08:10

fix: pass Option<&str> to yank_join_impl instead of &str

9af8a7b

feat: add manual default function to ParseMode

31194a0

refactor: name arguments in debub_remote better

22d6d66

refactor: change toggle and set ParseMode to default

4660350

refactor: verify ensure_signature positionals for all ParseMode k…

fc05f8b

…inds

test: change name to better describe test

c7c1132

the-mikedavis mentioned this pull request Jan 14, 2025

Rewrite command line parsing, add flags and expansions #12527

Open

RoloEdits closed this Jan 14, 2025

RoloEdits deleted the shellwords branch January 14, 2025 02:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(commands)!: change arg handling strategy #12441

refactor(commands)!: change arg handling strategy #12441

RoloEdits commented Jan 7, 2025 •

edited

Loading

RoloEdits commented Jan 8, 2025

the-mikedavis Jan 8, 2025

RoloEdits Jan 8, 2025 •

edited

Loading

the-mikedavis Jan 8, 2025

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

the-mikedavis Jan 8, 2025

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits commented Jan 10, 2025 •

edited

Loading

RoloEdits commented Jan 10, 2025

RoloEdits commented Jan 10, 2025 •

edited

Loading

RoloEdits commented Jan 10, 2025

the-mikedavis commented Jan 13, 2025

RoloEdits commented Jan 14, 2025

the-mikedavis commented Jan 14, 2025

RoloEdits commented Jan 14, 2025

refactor(commands)!: change arg handling strategy #12441

refactor(commands)!: change arg handling strategy #12441

Conversation

RoloEdits commented Jan 7, 2025 • edited Loading

RoloEdits commented Jan 8, 2025

the-mikedavis Jan 8, 2025

Choose a reason for hiding this comment

RoloEdits Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

the-mikedavis Jan 8, 2025

Choose a reason for hiding this comment

RoloEdits Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

RoloEdits Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

the-mikedavis Jan 8, 2025

Choose a reason for hiding this comment

RoloEdits Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

RoloEdits Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

RoloEdits commented Jan 10, 2025 • edited Loading

RoloEdits commented Jan 10, 2025

RoloEdits commented Jan 10, 2025 • edited Loading

RoloEdits commented Jan 10, 2025

the-mikedavis commented Jan 13, 2025

RoloEdits commented Jan 14, 2025

the-mikedavis commented Jan 14, 2025

RoloEdits commented Jan 14, 2025

RoloEdits commented Jan 7, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits Jan 8, 2025 •

edited

Loading

RoloEdits commented Jan 10, 2025 •

edited

Loading

RoloEdits commented Jan 10, 2025 •

edited

Loading