Building a Shell

Building a shell is a great exercise, but honestly having to deal with string parsing is such a bother that it robs like 2/3 of the joy along the way. I once built a very simple one in Go [0] as a learning exercise and I stopped once I started getting frustrated with all the corner cases.

[0] https://github.com/lourencovales/codecrafters/blob/master/sh...

The pipe section is the part that changes how you think about processes. Once you've manually done the dup2 dance — close write-end in parent, close read-end in child, wire them up — it stops being magic and starts being obvious why `grep | sort | uniq` works at all. The thing that surprised me building a similar toy was how late in the process job control has to come: you can get a working pipe chain surprisingly fast, and then job control (SIGTSTP, tcsetpgrp, the whole mess) costs 5x more than everything else combined.

Some time ago I've written an article about a particular aspect of shells, job control: https://emersion.fr/blog/2019/job-control/

There's a very good exercise on Codecrafters (https://app.codecrafters.io/courses/shell/overview) to walk you through writing your own shell. I found it enlightening, as well as a good way to learn a new language.

Had an assignment to build a shell in a week, how hard could it be?

  controlling terminal
  session leader
  job control

The parser was easy in comparison.

Link was previously posted by author: https://news.ycombinator.com/item?id=47398749 There are other good quality articles on their site, and maybe deserves the imaginary points.

Unix shells are conceptually simple but hide a surprising amount of complexity under the hood that we take for granted. I recently had build my own PTY controller. There were so many edge-cases to deal with. It took weeks of stress testing and writing many tests to get it right.

Interesting. I wanted to do toast | bash to let the AI drive the computer but the bash shell really got in the way. Too much complexity. The things that annoy humans, $ expansion, special characters, etc don't work for AI either. Ended up writing a custom shell for AI (and humans). When a tool gets in the way, sometimes it just time to change the tool.

Bit of pedantry but I don't think traditional unix shell (like this) follows repl model; the shell is not usually doing printing of the result of evaluation. Instead the printing happens more as a side effect of the commands.

Fun read. Wonder if you are able to edit text in the shell, or if you need to implement a gap buffer to allow it?

Had an assignment to build a shell in a week, how hard could it be?

  controlling terminal
  session leader
  job control

The parser was easy in comparison.

[0] https://github.com/lourencovales/codecrafters/blob/master/sh...

Author here, and yeah, I agree. I skipped writing a parser altogether and just split on whitespace and `|` so that I could get to the interesting bits.

For side-projects, I have to ask myself if I'm writing a parser, or if I'm building something else; e.g. for a toy programming language, it's way more fun to start with an AST and play around, and come back to the parser if you really fall in love with it.

A common problem I noticed is that if you took certain courses in computer science, you may have a pre-conceived notion of how to parse programming languages, and the shell language doesn't quite fit that model

I have seen this misconception many times

In Oils, we have some pretty minor elaborations of the standard model, and it makes things a lot easier

How to Parse Shell Like a Programming Language - https://www.oilshell.org/blog/2019/02/07.html

Everything I wrote there still holds, although that post could use some minor updates (and OSH is the most bash-compatible shell, and more POSIX-compatible than /bin/sh on Debian - e.g. https://pages.oils.pub/spec-compat/2025-11-02/renamed-tmp/sp... )

---

To summarize that, I'd say that doing as much work as possible in the lexer, with regular languages and "lexer modes", drastically reduces the complexity of writing a shell parser

And it's not just one parser -- shell actually has 5 to 15 different parsers, depending on how you count

I often show this file to make that point: https://oils.pub/release/0.37.0/pub/src-tree.wwz/_gen/_tmp/m...

(linked from https://oils.pub/release/0.37.0/quality.html)

Fine-grained heterogenous algebraic data types also help. Shells in C tend to use a homogeneous command* and word* kind of representation

https://oils.pub/release/0.37.0/pub/src-tree.wwz/frontend/sy... (~700 lines of type definitions)

Can say the same for control characters in terminals. I even think maybe it's just easier to ditch them all and use QT to build a "terminal" with clickable urls, something similar to what TempleOS does.

Author here, and yeah, I agree. I skipped writing a parser altogether and just split on whitespace and `|` so that I could get to the interesting bits.

I remember my first shell programming I ever did was batch in windows back in the 3.11/95 days.

The first line was always to turn off echo, and I've always wondered why that was a decision for batch script. Or I'm misremembering. 30 years of separation makes it hard to remember the details.

It’s a shell, not the whole thing. The whole thing is the shell+kernel+programs.

Some time ago I've written an article about a particular aspect of shells, job control: https://emersion.fr/blog/2019/job-control/

Link was previously posted by author: https://news.ycombinator.com/item?id=47398749 There are other good quality articles on their site, and maybe deserves the imaginary points.

I have seen this misconception many times

In Oils, we have some pretty minor elaborations of the standard model, and it makes things a lot easier

How to Parse Shell Like a Programming Language - https://www.oilshell.org/blog/2019/02/07.html

---

To summarize that, I'd say that doing as much work as possible in the lexer, with regular languages and "lexer modes", drastically reduces the complexity of writing a shell parser

And it's not just one parser -- shell actually has 5 to 15 different parsers, depending on how you count

I often show this file to make that point: https://oils.pub/release/0.37.0/pub/src-tree.wwz/_gen/_tmp/m...

(linked from https://oils.pub/release/0.37.0/quality.html)

Fine-grained heterogenous algebraic data types also help. Shells in C tend to use a homogeneous command* and word* kind of representation

https://oils.pub/release/0.37.0/pub/src-tree.wwz/frontend/sy... (~700 lines of type definitions)

Yup, job control is a huge mess. I think Bill Joy was able to modify the shell, the syscall interface, and the terminal driver at the same time to implement the hacky mechanism of job control. But a few years later that kind of crosscutting change would have been harder

One thing we learned from implementing job control in https://oils.pub is that the differing pipeline semantics of bash and zsh makes a difference

In bash, the last part of the pipeline is forked (unless shopt -s lastpipe)

In zsh, it isn't

    $ bash -c 'echo hi | read x; echo $x'  # no output
          
    $ zsh -c 'echo hi | read x; echo $x'
    hi

And then that affects this case:

    bash$ sleep 5 | read
    ^Z
    [1]+  Stopped                 sleep 5 | read


    zsh$ sleep 5 | read    # job control doesn't apply to this case in zsh
    ^Zzsh: job can't be suspended

So yeah the semantics of shell are not very well specified (which is one reason for OSH and YSH). I recall a bug running an Alpine Linux shell script where this difference matters -- if the last part is NOT forked, then the script doesn't run

I think there was almost a "double bug" -- the script relied on the `read` output being "lost", even though that was likely not the intended behavior

Yes!! This!! I wrote a shell awhile back and was pretty happy with it... but could _not_ get job control to work quite right. It was a big pain.

Fun read. Wonder if you are able to edit text in the shell, or if you need to implement a gap buffer to allow it?

Editing the current line works because I brought in https://man7.org/linux/man-pages/man3/readline.3.html towards the end so I could support editing, tab completion, and history.

IIRC readline uses a `char *` internally since the length of a user-edited line is fairly bounded.

I remember my first shell programming I ever did was batch in windows back in the 3.11/95 days.

The first line was always to turn off echo, and I've always wondered why that was a decision for batch script. Or I'm misremembering. 30 years of separation makes it hard to remember the details.

It’s a shell, not the whole thing. The whole thing is the shell+kernel+programs.

Echo in that case prints command lines before executing them. Its analog is `set -x` rather than `echo`.

Yes!! This!! I wrote a shell awhile back and was pretty happy with it... but could _not_ get job control to work quite right. It was a big pain.

One thing we learned from implementing job control in https://oils.pub is that the differing pipeline semantics of bash and zsh makes a difference

In bash, the last part of the pipeline is forked (unless shopt -s lastpipe)

In zsh, it isn't

    $ bash -c 'echo hi | read x; echo $x'  # no output
          
    $ zsh -c 'echo hi | read x; echo $x'
    hi

And then that affects this case:

    bash$ sleep 5 | read
    ^Z
    [1]+  Stopped                 sleep 5 | read


    zsh$ sleep 5 | read    # job control doesn't apply to this case in zsh
    ^Zzsh: job can't be suspended

I think there was almost a "double bug" -- the script relied on the `read` output being "lost", even though that was likely not the intended behavior

Echo in that case prints command lines before executing them. Its analog is `set -x` rather than `echo`.

Editing the current line works because I brought in https://man7.org/linux/man-pages/man3/readline.3.html towards the end so I could support editing, tab completion, and history.

IIRC readline uses a `char *` internally since the length of a user-edited line is fairly bounded.

worth noting that you get basic line editing for "free" from kernels tty subsystem even if you don't use readline.

Very cool. Currently working on the beginning of a small text editor so this part seemed interesting and was curious of any overlap. Thanks for the interesting post!

worth noting that you get basic line editing for "free" from kernels tty subsystem even if you don't use readline.

The shell sits in front of a lot of my work, but I mostly use it for the outcome: running unix commands and scripts, creating branches and making commits. Unlike when I'm writing code, I'm rarely thinking about how the shell itself works under the hood.

So, to dig a bit deeper into shells, I'm going to build a toy one until I run out of time. I have a fresh pot of filter coffee, and I'm awake three hours before everyone else.

A quick look ahead to everything I'm able to support by the end:

./andsh

andsh$ cd /

andsh$ pwd

andsh$ echo $HOME

/Users/andrew

andsh$ nosuchcommand

nosuchcommand: No such file or directory

andsh$ echo $?

127

andsh$ printf abc\n | tr a-z A-Z | rev

CBA

andsh$ ec hello

andsh$ echo hello

hello

andsh$

andsh$ echo hello

hello

andsh$ ^D

If you prefer reading C over prose, head straight to healeycodes/andsh.

REPL

A shell is an interactive program before it's a language implementation, and the user experience starts at the prompt. This first step is about building the interactive skeleton: print a prompt, read a line, keep a little state, and leave a clean place to plug executio logic into.

// repl.h

typedef struct {

int last\_status;

int running;

int interactive;

} Shell;

We also need the classic read-eval-print loop:

// repl.c

int shell_run(Shell *shell) {

char \*line \= NULL;

size\_t capacity \= 0;

if (install\_signal\_handlers() != 0) {

    return 1;

}

while (shell\->running) {

    int rc \= read\_line(&line, &capacity, shell);

    if (rc \== 0) {

        break;

    }

    if (rc < 0) {

        free(line);

        return 1;

    }

    eval\_line(shell, line);

}

free(line);

return shell\->last\_status;

}

read_line returns three cases: got a line, hit EOF, or hit a real error.

eval_line starts tiny: blank lines do nothing, exit stops the shell in-process, and everything else gets treated as an external command.

// inside eval_line

if (strcmp(argv[0], "exit") == 0) {

shell\->running \= 0;

free\_argv(argv);

return shell\->last\_status;

}

status = execute_external(shell, argv);

At the moment, we can run ls but we can't run ls -l yet. It's interpreted as a single command "ls -l".

From a Line to argv

Before we add env var expansion and pipes, let's start by splitting a line on spaces and tabs so we can run simple foreground commands like echo hello world or ls -l.

It will be intentionally incomplete. It still won't handle quotes or redirections, but it will peel off | as syntax so we can grow into supporting pipelines later. It's still useful because Unix process APIs want argv (argument vector, values passed down to a program when it starts).

First, we need a way to split a line:

// repl.c

static char **tokenize_line(const char *line, int *count_out) {

while (\*p != '\\0') {

    while (isspace((unsigned char) \*p)) {

        p++;

    }

    if (\*p \== '|') {

        push\_word(&words, &count, &capacity, dup\_range(p, 1));

        p++;

        continue;

    }

    // .. copy the next word up to whitespace or |

}

\*count\_out \= (int) count;

return words;

}

Which we can call inside our fledgling eval_line function to get a stream of shell words before we group them into commands.

// inside eval_line

if (line_is_blank(line)) {

return 0;

}

words = tokenize_line(line, &word_count);

if (word_count == 0) {

free\_words(words);

return 0;

}

Running Commands

A shell can't replace itself with a command that it's launching (otherwise the shell would cease to exist after running that command) so it must create a child process to run the command, and wait for it to finish.

The parent shell stays alive and the child process becomes the command.

execvp is a convenient call from the exec family here. It searches PATH and replaces the current process with a new program, using the current process environment.

waitpid gives control back to the shell after the command exits.

// repl.c

pid = fork();

if (pid == 0) {

execvp(argv\[0\], argv);

perror(argv\[0\]);

// 127: command not found, 126: found but not executable / cannot invoke

\_exit(errno \== ENOENT ? 127 : 126);

}

// ..

while (waitpid(pid, &status, 0) < 0) {

if (errno != EINTR) {

    perror("waitpid");

    shell\->last\_status \= 1;

    return shell\->last\_status;

}

}

The child uses _exit to avoid running parent-inherited libc cleanup in the forked child (can lead to duplicated output and other unintended side effects).

One shell-y detail I wanted to keep was the interrupted wait path. Retrying on EINTR keeps the shell from losing track of a child process when the terminal sends an interrupt.

Now we can do real shell things:

./andsh

andsh$ echo hello world

hello world

andsh$ pwd

/Users/andrew/Documents/experiments/andsh

andsh$ ls -l

total 160

-rw-r--r-- 1 andrew staff 194 14 Mar 08:10 Makefile

drwxr-xr-x 7 andrew staff 224 14 Mar 14:24 src

andsh$ ^D

For the process/system call stuff, C is great for writing toy shells. The downsides are things like splitting a line (managing dynamic memory), and later, adding more shell syntax (string lifetimes).

cd, or How to Get around

One of the core shell rules is that some commands can't run in a child process. For example, if the shell forks and a child calls chdir then only the child changes directories; when the child exits, the parent shell is still in the old directory.

This is why cd has to be a builtin.

// inside try_builtin

if (strcmp(command->argv[0], "cd") == 0) {

return run\_builtin\_cd(shell, command);

}

Something I learned for this post is that HOME is the conventional default target when running a lone cd.

static int run_builtin_cd(Shell *shell, Command *command) {

const char \*target \= command\->argc \== 1 ? getenv("HOME") : command\->argv\[1\];

if (chdir(target) != 0) {

    perror("cd");

    shell\->last\_status \= 1;

    return shell\->last\_status;

}

shell\->last\_status \= 0;

return 0;

}

Because run_builtin_cd runs inside the shell process, the next prompt sees the new directory.

Env Var Expansion

Before running a command, the shell rewrites parts of the input line. There are a few syntax rules and ordering details here, but for my toy shell I'm just adding env var expansion.

echo $HOME shouldn't print $HOME, it should print /Users/andrew.

I'm just hacking this in. Only whole-word $NAME expansion. No quotes, no ${NAME}, and no splitting rules.

static char *expand_word(const Shell *shell, const char *word) {

const char \*value;

if (strcmp(word, "$?") \== 0) {

    char status\[32\];

    snprintf(status, sizeof(status), "%d", shell\->last\_status);

    return strdup(status);

}

if (word\[0\] != '$' || word\[1\] \== '\\0') {

    return strdup(word);

}

// .. look up NAME in the environment

value \= getenv(word + 1);

if (value \== NULL) {

    // Unset variables expand to the empty string in this toy shell.

    return strdup("");

}

return strdup(value);

}

Expansion happens after tokenization but before execution. And | is syntax, not data, so we don't try to expand it:

for (i = 0; words[i] != NULL; i++) {

char \*expanded;

if (strcmp(words\[i\], "|") \== 0) {

    continue;

}

expanded \= expand\_word(shell, words\[i\]);

free(words\[i\]);

words\[i\] \= expanded;

}

We expand token-by-token, keeping it simple; and skipping writing a parser.

The special case for $? is also nice to leave in the code because it's one of those tiny shell details that makes the prompt feel less fake.

Piping

A pipe (|) is a kernel buffer with one process writing bytes in and another reading bytes out.

cmd1 | cmd2 connects the stdout of the left command to the stdin of the right command. For N commands, you need N - 1 pipes.

The heavy lifting here will be done by pipe(), which creates a one-way channel for interprocess communication. pipe() fills the array pipefd with two file descriptors. pipefd[0] is the read end, and pipefd[1] is the write end. Data written to the write end is buffered by the kernel until it is read from the read end.

The core pipe loop runs once per command in the pipeline. Each iteration may create one new pipe for the next command. prev_read is the read end carried forward from the previous iteration.

for (i = 0; i < pipeline->count; i++) {

int pipefd\[2\] \= {\-1, \-1};

if (i + 1 < pipeline\->count) {

    pipe(pipefd);

}

pid \= fork();

if (pid \== 0) {

    // .. hook this command up to prev\_read / pipefd

    execvp(pipeline\->commands\[i\].argv\[0\], pipeline\->commands\[i\].argv);

}

// .. parent closes what it doesn't need, then carries read end forward

}

E.g. for printf abc | tr a-z A-Z | rev:

Left command produces bytes
Middle command reads, transforms, writes
Right command reads the final stream

dup2 lets normal programs work in a pipeline without knowing about the shell. Programs read from stdin and write to stdout. The shell creates a pipe and uses dup2 to connect these streams.

Below, dup2(prev_read, STDIN_FILENO); makes the process read from the previous pipe instead of stdin, and dup2(pipefd[1], STDOUT_FILENO); makes its output go into the next pipe instead of the shell prompt. Because the program still reads from stdin and writes to stdout as usual, it works in the pipeline without any special logic.

if (pid == 0) {

if (prev\_read != \-1) {

    dup2(prev\_read, STDIN\_FILENO);

}

if (pipefd\[1\] != \-1) {

    dup2(pipefd\[1\], STDOUT\_FILENO);

}

if (prev\_read != \-1) {

    close(prev\_read);

}

if (pipefd\[0\] != \-1) {

    close(pipefd\[0\]);

    close(pipefd\[1\]);

}

}

And now, this shell's demo is looking a little more complete:

./andsh

andsh$ cd /

andsh$ pwd

andsh$ echo $HOME

/Users/andrew

andsh$ nosuchcommand

nosuchcommand: No such file or directory

andsh$ echo $?

127

andsh$ printf abc | tr a-z A-Z | rev

CBA

andsh$ ^D

Recap

To recap a bit, I'll step through what happens when ls $HOME | grep foo is entered.

It's tokenized into ["ls", "$HOME", "|", "grep", "foo"].

Then expanded into ["ls", "/Users/andrew", "|", "grep", "foo"].

The flat token list is separated into structured pipeline commands:

["ls", "/Users/andrew"]
["grep", "foo"]

The shell creates a pipe to connect the output of ls to the input of grep.

The child commands start via fork, and execvp swaps them into the target programs.

By the wonderful design of Unix: if grep reads faster than ls writes, it blocks waiting for more data; if ls writes faster than grep reads, the pipe buffer fills and ls temporarily blocks. This synchronization happens automatically through the pipe, while the shell simply waits for both child processes to finish.

The output of grep isn't connected to a pipe as it's the last command, and any results are displayed to the user.

Polishing the REPL: History and Tab Completion

Even though our little REPL runs commands, expands env vars, and builds pipes, the interaction still feels rough. Left and right arrows do not magically work just because you're in a terminal. The terminal just sends escape sequences like ^[[D.

At the moment, trying to move left and fix a typo ends up looking like this:

./andsh

andsh$ echo ac^[[Db

ac^[[Db

Up to now, getline has been reading bytes just fine. Now we need something to sit in the middle and give us line editing, history, and completion. One answer is the readline library.

The outcome of calling it, is a line to evaluate, and history that can be walked later:

// inside read_line

if (shell->interactive) {

free(\*line);

\*line \= readline("andsh$ "); // <--

if (\*line \== NULL) {

    fputc('\\n', stdout);

    return 0;

}

if ((\*line)\[0\] != '\\0') {

    add\_history(\*line);

}

return 1;

}

There's also a little setup for tab completion and history:

// inside shell_init

if (shell->interactive) {

rl\_readline\_name \= "andsh";

rl\_catch\_signals \= 0;

// Plug in tab completion

rl\_attempted\_completion\_function \= shell\_completion;

// We don't need special paste handling

rl\_variable\_bind("enable-bracketed-paste", "off");

// History support

using\_history();

}

The API for tab completion involves providing a generator that can cycle through the different matching options.

static char *completion_generator(const char *text, int state) {

if (state \== 0) {

    // Initial setup \`text\` is e.g. \`ech\`

    if (build\_completion\_matches(text) != 0) {

        free\_completion\_matches();

        return NULL;

    }

}

if (g\_completion\_index \>= g\_completion\_count) {

    return NULL;

}

return strdup(g\_completion\_matches\[g\_completion\_index++\]);

}

readline calls the generator repeatedly until it returns NULL. When state == 0, we set up the generator by building all the completion matches. After that, the generator hands matches back one at a time (e.g. each time the user presses tab, or all at once for tab-tab-Y).

So, we need a function that takes text (partial text) and returns a list of matches. I've chosen to scan the current directory (.) for files, followed by all the $PATH directories; returning any files that start with the partial text.

static int build_completion_matches(const char *text) {

free\_completion\_matches();

// Inside this function, we're calling \`readdir\`, \`starts\_with\`,

// and \`add\_completion\_match\` for anything we consider a match.

if (collect\_matches\_from\_dir(".", text, 0) != 0) {

    return \-1;

}

path \= getenv("PATH");

// .. split PATH on ':' and scan each directory

}

Adding basic tab completion like this really makes me consider the performance implications of shells. I didn't know that some shells might make hundreds of system calls around each prompt to figure out things like completion options.

The final demo:

andsh$ unam

andsh$ uname

andsh$ Makef

andsh$ Makefile

andsh$ echo hello

hello

andsh$

andsh$ echo hello

What's Missing

A lot is missing. andsh is usable enough. It could handle maybe 50% of my shell use cases: launching programs, some git commands, and basic pipes into grep.

But it's very small and incomplete. No quoting is a big one. echo "hello world" is where some people would start when implementing a shell but ... I've written a lot of parsers on this blog already. There's no redirection, so <, >, and >> do not work. Builtins are also minimal, and I only really handle them as standalone commands.

Redirection would add more file descriptor plumbing to execution, and quoting would force the tokenizer to become a real shell lexer.

I think my biggest learnings were the low-level process APIs shells are using under the hood. I don't often work directly with calls like execvp and dup2.

Read the code at healeycodes/andsh, and send me your terminal and shell projects pls.

Hacker Times

Hacker Times

Discussion

Discussion

REPL

From a Line to argv

Running Commands

cd, or How to Get around

Env Var Expansion

Piping

Recap

Polishing the REPL: History and Tab Completion

What's Missing