Designing command-line interfaces

mcantor|14 years ago

Here's another one:

Provide explicit flags for default behavior.

For example, if your lines-of-code counting utility excludes preprocessor directives by default, and includes them when you pass "-i", provide an "-x" switch that signals to use the default behavior. This way, when someone wants to write a bash script that uses your utility, they can do this:

    case $include_directives in
    y)
        LOC_FLAGS='-i';;
    n)
        LOC_FLAGS='-x';;
    esac

    myloc $LOC_FLAGS

This has three benefits:

1.) It's explicit, and thus more obvious and clearer.

2.) It's more consistent, so there's no risk of empty/unset variables, whitespace, or other edge conditions screwing up a delicate munging operation.

3.) It's change-tolerant, so if future versions of your utility change the default behavior, scripts will continue functioning as expected.

a3_nm|14 years ago

Other benefit: if you aliased myownloc="myloc -i --complicated_option", you can run myownloc -x to selectively disable the -i switch.

reinhardt|14 years ago

What about the disadvantages?

- It doubles the number of flags of the program.

- It allows invalid or ambiguous combinations: is `myloc -i -x` valid? If yes which flag takes precedence?

It seems these must outweigh the benefits or it wouldn't be so uncommon.

mcantor|14 years ago

Oh, and another:

Use your language's command-line option processing libraries.

OptionParser in ruby and argparse in python. There is no reason to eschew these libraries: they're part of the standard library, they require zero coupling to your app logic, and they handle all of the edge cases for free.

"But I can just shift the arguments", you say! Yeah? Great! What if the user pipes input through STDOUT? What if the user passes a flag, a required argument, and then another flag? Will your script read the last flag correctly, or has the naive logic already entered "required argument processing mode"? Just use the library. It's a solved problem!

reinhardt|14 years ago

Just use the library. It's a solved problem!

Indeed! In fact it is so solved that the Python standard library has solved it differently 3-4 (I've lost count) times already, let alone the dozen or two extra packages at PyPI ;)

antoarts|14 years ago

I choose not to discuss implementation in that article, but you should never reinvent the wheel (poorly).

scott_s|14 years ago

And if you use C++ and are willing to use Boost, program_options: http://www.boost.org/doc/libs/1_47_0/doc/html/program_option...

I recognize that many people are not willing to use C++, and among those that are, many are still unwilling to use Boost, but I find the program_options library to be great. An example use that I think is reasonable and a big win over doing it myself: https://github.com/scotts/cellgen/blob/master/src/main.cpp#L...

brunoc|14 years ago

I'm working on a CLI right now that's rather "complex" in Ruby and i'm seeing the other extreme: CLI frameworks and DSLs. I don't know about other languages but in Ruby your library options range from the minimal (DIY with OptsParser, Trollop) to the gigantic (Thor, Rubicon) and everything in between...

billmcneale|14 years ago

For Java: JCommander http://jcommander.org/

mahmud|14 years ago

This might not be shared by others, but here it goes.

If your CLI program is a file-format conversion utility, please include a way to dump meta-data, header formats, etc. Don't just silently convert from A to B, allow me to get at the info you have gathered from the input.

For example, a binary disassembler SHOULD dump the executable header format. An spreadsheet converter utility SHOULD display how many sheets there are, if there are macros, how many rows, etc.

One of my pet peeves is pdftotext, a very nifty utility that I use to convert PDF reports to ASCII for subsequent awking. pdftotext has an option to specify start and end pages to extract, but it doesn't have an -i or --info option that tells me how many pages a PDF file has. So, my scripts have a very high upper-limit, like 1000, and it converts the file page by page, until the output text page has a size of zero.

Which reminds me, I should probably fork the fucker this weekend, now that I have some free time.

jsrn|14 years ago

"[...] but it doesn't have an -i or --info option that tells me how many pages a PDF file has"

You could use pdfinfo - in Debian / Ubuntu, it is in the same package as pdftotext (poppler-utils).

To extract (only) the number of pages of a PDF:

    $ pdfinfo FILE.pdf | grep '^Pages' | tr -s ' ' | cut -d ' ' -f 2

Pewpewarrows|14 years ago

One thing that always infuriates me:

If I go to --help or the man page for your command, and don't see a real example of how to use it immediately, you've failed me as a user.

Seeing your syntax tree and a list of every option and its description doesn't help me when I'm first trying to use your program. I just want to see one or two quick examples of real commands with a short sentence explaining each. After that I'll dive into the mess that is the dozens of flags and inputs to decipher exactly what I want.

pmr_|14 years ago

It always struck me as odd that man pages tend to have the EXAMPLES section as the last section of the document. I really would like it the other way around but this kinda fits with a bottom-up approach to learning. Look at the small details then get the full picture and how it's supposed to be used.

code_duck|14 years ago

I've seen a few discussions of this around.

I agree with you, but there are some who believe that examples do not belong in manages (not sure what their rationale is).

Nick_C|14 years ago

man docs have a specific format, a followed by b followed by c, etc. Examples are towards the end, near bugs and author.

It may not be how you want it, but at least it's consistent. shift-G will take you straight to the end, with hopefully some examples.

there|14 years ago

     Do you really want to do this (y/n)?

i would rather see this as

     Do you really want to do this (y/N)?

capitalize the option that will be used by default if you hit enter with no other input.

fnl|14 years ago

I absolutely agree - and not only does the article barley scratch what is Unix standard anyways, and the more tricky questions such as piping, exit and error states, or the environment are left out. Kind of disappointing, in my opinion.

killerswan|14 years ago

I'm also used to seeing Do you want to do this? [n]:

mcantor|14 years ago

One more:

Keep your "usage" blurb succinct and clear.

Don't clobber your users' terminals with two pages of output when they're not expecting it. If "yourapp", "yourapp -h" or "yourapp --invalid-flag" results in two screenfuls of information containing your app's license, installation instructions, contribution notes, an exhaustive listing of every single one of the 100 available subcommands, and a verbose representation of a configuration setting that 75% of your users won't care about, you're doing it wrong. (I'm looking at you, rvm).

This is similar to the idea of "You don't really understand something unless you can explain it so your grandmother gets it." Your app doesn't really have a sensible interface unless the top layer of its abstraction, or the most common commands, can be summarized in less than scrillions of lines of text. If you simply can't trim it down far enough, it's because you're breaking from the Unix philosophy and your utility is doing too much.

zwp|14 years ago

Also tedious:

    $ ls -Q
    ls: illegal option -- Q
    usage: ls -aAbBcCdeEfFghHiklLmnopqrRsStuUwxvV1@/[c | v]%[atime | crtime | ctime | mtime | all] [files]

It's succinct but that second line is almost completely useless.

ilikepi|14 years ago

Speaking of "usage" blurbs, for $DIETY's sake, please assume 80-character wide terminals.

janus|14 years ago

Also I'm looking at you, rails.

Xurinos|14 years ago

The name should be short. A long name will be tedious to type, so don’t call you version control program my-awesome-version-control-program. Call it something short, such as avc (Awesome Version Control).

    $ my-aw<TAB>  # yay!

Personally, I am not a big fan of everyone using two or three-letter linux names for everything. We have a lot of options now for autocompletion and seeing what is available, so long names are no longer a talking point.

scott_s|14 years ago

  $ may-aw<TAB>
  my-awe-inspiring-tool my-awwwww-how-cute-program  my-awesome-version-control-prog
  my-aww-yeah-tool      my-awchoo-i-sneezed-program my-aw-long-your-names-are-grandma

Also, please don't make me hit the - character when I call your program.

icebraining|14 years ago

"GUIs also have the advantage when it comes to presenting and editing information that is by nature graphical. This includes photo manipulation and watching movies (I have always wanted a program that shows a movie in my terminal by converting it to ASCII art in real-time, that would be sweet)."

This is an interesting discussion - what is the CLI, exactly? My personal PDF viewer is Zathura[1], which is controlled like VIM. On the other hand, programs like Kismet run on a terminal, but have mouse controlled menus.

Personally, I feel that Zathura is a CLI application, even though it depends on X, because the interaction is done in a keyboard driven way with no graphical widgets.

[1]: http://pwmt.org/projects/zathura/

nitrogen|14 years ago

I have always wanted a program that shows a movie in my terminal by converting it to ASCII art in real-time, that would be sweet

For the benefit of the original author, mplayer and xine both have ASCII art output modes (mplayer -quiet -vo aa, aaxine).

a3_nm|14 years ago

True. I often use text-based applications because I can usually expect them to have a sensible and efficient interface which does not require the mouse. There are graphical applications which follow the same spirit, but they are pretty well hidden.

huhtenberg|14 years ago

(edit) There is a bit of terminology substitution going on in linked article. Command line interface is a shell. That's where one types the commands. Calling command options and arguments an interface may be technically correct, but it is not what is conventionally understood under a term of CLI.

---

Speaking from an experience writing CLIs for configuration-heavy embedded devices, the key design element of a functional CLI is a context-aware TAB expansion. This is what makes a CLI truly convenient for routine use.

For example, if a mysql shell was smarter, it would've been allowing this:

  > use p<tab><enter>                   expands to "use production"
  > show c<tab> s<tab><tab><enter>      expands to "show columns from secondary"
  > select * f<tab> s<tab><tab> ...     expands to "select * from secondary ..."

It would also be nice to make OS shell more aware of individual commands' options, and to allow for example:

  # ip addr <tab>        shows "ip addr add"
  # <tab>                shows "ip addr del"
  # <tab>                shows "ip addr"

This is possible through hardcoding these expansions into the shell, but that's not very elegant, is it? On the other hand allowing to integrate arbitrary commands with the shell in a generic way would require putting together some sort of interface/manifest contraption and it would most likely go against the very spirit of Unix simplicity. So catch 22 it is.

mcantor|14 years ago

I think the use of "interface" is sensible, if somewhat ambiguous on the surface. In this context, options & arguments are the interface to the utility, not the "CLI", which is the interface to the command line itself.

Context-aware tab completion is very doable in the unix world, actually; it's just a hassle on the part of the utility-writer. Have you ever noticed that if you add a new file "foo/bar/baz.py" to a mercurial repository and type "hg add fo<TAB>", it will autocomplete all the way to "foo/bar/baz.py", without stopping at each directory? This is because hg has overridden the default tab completion provided by bash and defined its own set of possibilities.

I wish it were easier to set up; I think if that were the case, we'd see a lot more utilities with spot-on tab completion.

lysol|14 years ago

If you switch to Postgres, psql (the commandline client) supports the features you're asking for. :)

technomancy|14 years ago

Command-line usability is at least 50% about tab completion.

asolove|14 years ago

zsh has an extensible command/expansion system much like what you are asking for, allowing users to implement plugins documenting how to autocomplete commands, options, and data types passed to options (files, yes/no, etc.). It is fantastical.

prostoalex|14 years ago

ESR's "The Art of Unix Programming" has a good section on consistent default of command line options

http://www.catb.org/esr/writings/taoup/html/ch10s05.html

jcr|14 years ago

You might want to fix this typo.

> Maybe it¿s just me, but I prefer to remotely control computers via SSH over VNC

If you're really running "SSH over VNC" then you're doing it wrong. ;)

> I have always wanted a program that shows a movie in my terminal by converting it to ASCII art in real-time, that would be sweet

man mplayer and look for the -vo flag which controls the video output mode/driver. Two common options for video output (-vo) are the 'aa' (ASCII Art) and 'caca' (Color Coded ASCII Art). There is a third, 'bl' ("blinkenlights") but it's hardware dependent.

alexis-d|14 years ago

I agree with most of the article, except the yes/no part. In fact I think it's better to do :

"Do you want to do this (Y/n)?"

Where the most common option is uppercase so I can just hit enter.

dmpatierno|14 years ago

The important point here is to at least require that <Enter> key, rather than just using getchar() or whatever else and continuing on unexpectedly.

supersillyus|14 years ago

I'm glad he mentioned the "Silence trumps noise" point. I like to call the idiom "no news is good news". Tell me only when I need to know something (like a failure); "-v" is always there if I need it. It feels uncomfortable to get no output at first, but once you get used to it there's much less to read.

It's rather like the Plan 9 convention of programs returning strings instead of ints when then terminate; an empty string means success, a non-empty string contains the error message. I wish other OSes had adopted that.

BrandonM|14 years ago

There is no silence vs noise problem. Send output to STDOUT and "noise" to STDERR. Use -q and -v when appropriate. I thought it was weird that the author created a false dichotomy between being informational and giving only output that can be easily processed.

njharman|14 years ago

Problem with "silence" and "no news" is they are indistinguishable from many failure modes.

I vastly prefer confirmation of action(s) as default and -q flag. At very lest there needs to be -v flag that provides confirmation of action(s)

alexholehouse|14 years ago

Additionally, for interactive CLI programs or suites of programs, consistency is key. Offer the same way to select options, confirm or deny information, input data etc.

I'm also working on a project where by entering "?" at any interactive section you're taken to an interactive help menu. From here you can query (amongst other things) the state of the program, something that isn't always clear when you're using command line software, as you can't have extra info somewhere in the corner or whatever.

salem|14 years ago

One cool feature of the juniper junos CLI is that all the CLI commands have an option for XML output, making parsing of the CLI output in scripts a little more sane.

shabble|14 years ago

There was a project mentioned on HN fairly recently (~weeks ago) about an extended pipe system, using JSON and a bunch of common schemas, as I recall.

Both http://acko.net/blog/on-termkit and http://blogs.perl.org/users/rocco_caputo/2011/05/apppipefilt... seem to fit what I was thinking of, but I'm sure there was something else.

Nice ideas, but the inertia of existing unix tools is going to be hard to overcome, and a system isn't much use until all your common tools support it (or at least don't break it).

rch|14 years ago

I've started adding an option to output a complete Galaxy tool config file for some CLI apps.

-- http://wiki.g2.bx.psu.edu/Admin/Tools/Tool%20Config%20Syntax

killerswan|14 years ago

I talked some folks into using JSON output for a tool, and am now in the midst of fixing both ends so that the output is valid JSON, instead of a newline delimited file with JSON on each line.......

FaceKicker|14 years ago

Non-interactive programs get the most attention in this article, while text-based user interfaces are barely covered at all.

That's disappointing, I was hoping I'd learn how vi, etc. worked from this, since I know nothing about writing command line interfaces other than input and output to the last column of the last line of the terminal. Does anyone know of a good article/introduction to this?

a3_nm|14 years ago

Besides the technical details, there are good practices which should be followed, but I don't know of any document which lists them. This is a pity, because though text-based interfaces are often designed in a much more efficient way than GUI interfaces, they are seldom consistent between themselves, and often get something wrong. A few common ones:

- Don't make the user reach for distant keys like escape or pagedown/pageup (also support ^N/^P or ^B/^F) or the arrow keys (also support hjkl), unless you really need to.

- For one-line text entry, support readline bindings (^W, ^U, ^Y, ^B, ^F, etc.)

- If you show a list, provide a way to search for an item rather than moving through the list item by item or page by page.

- Unless there is a good reason not to, spawn $PAGER to show text and $EDITOR to edit text.

- If there is a finite set of actions to choose from, provide one-key hotkeys for each one. Don't require unnecessary use of the control key. Optionally show the list of possible or common actions, but have an option to hide it and save screen space for users who don't need it anymore (like mutt does). Likewise, if there are several items that can get focus, provide hotkeys, don't require the user to Tab their way through all of them.

Obviously, there are more.

antoarts|14 years ago

If you want to know how to create programs like vi, you should check out ncurses (http://www.gnu.org/s/ncurses/, tutorial: http://tldp.org/HOWTO/NCURSES-Programming-HOWTO/)

munificent|14 years ago

Examples include cd, calling it with no arguments returns you to the home directory.

How did I never know that before?

a3_nm|14 years ago

Make sure you also know about cd - (go to previous directory) and cd foo bar in zsh (go to ${PWD/foo/bar}).

gnufied|14 years ago

> Silence trumps noise

How about, ec2-server-start -n 4 -t medium -i img-xxxg Which expands to start 4 medium instances using image img-xxxg. Will you prefer a long wait and then silently back on prompt or an indication of something happening?

I agree with what he is saying, but I think there is a hint of unfair generalization here.

> Naming your utility

Again small unix commands are like precious three letter domain names. Not always viable. Also, although most of single letter commands are free, one should avoid to name a CLI binary in single letter, because 1. users often type single letter stuff by mistake. 2. users use single letter aliases.

shadearg|14 years ago

> Will you prefer a long wait and then silently back on prompt or an indication of something happening?

This is what the -v, --verbose option is for. Without this flag, you should assume everything is operating as expected until you receive an error message or an exit status >0.

ojilles|14 years ago

The article is interesting up to a point, but what really struck me is that this blog claims to be written by a 15 year old. I think the topics and depth of the other articles on the blog spell great things for this guy's future!

rmccue|14 years ago

> Multi-letter options start with two hyphens, and each such argument must be separated with spaces.

That's something that's always annoyed me about screen; commands such as "wipe" are "screen -wipe"

lell|14 years ago

there's "find ./ -type d", also (find directories only.)

rhizome|14 years ago

After -h --help, I think the most essential option is -n --dry-run

uriel|14 years ago

See the paper by Rob Pike and Brian Kernighan 'Program Design in the UNIX Environment' (aka 'cat -v considered harmful'):

http://harmful.cat-v.org/cat-v/

80 comments