How To Name Things

swyx 2019-05-16

There are 2 hard problems in computer science: cache invalidation, naming things, and off-by-1 errors. - Leon Bambrick

I’ve vacillated on my opinion on naming things (this post mainly deals with naming things in code, though I end with other resources on naming everything). I think most people start out with no or weak opinions, looking slightly askance at the weirdos who do have strong opinions. They absorb naming conventions by osmosis, and then run into real problems at scale/over time, and then develop extremely strong opinions informed by that experience.

The Weirdo’s Journey.

I’ve given this essay a slightly clickbaity title. Spoiler: I’m not going to solve the problem of naming things today. All I hope to do is describe some opinions I’ve formed from my experience in Python and JS, list some considerations, invite you to share yours, and suggest you have this debate on your team.

Use AI! [Sept 2022 Edit]

Not Naming Things [Aug 2019 Edit]

One option people sometimes forget they have at their disposal is to just not name things where possible. I have a couple examples for you.

Example 1: Not giving different names at module and function boundaries

Mind your “name stack”. This is the number of names you have to keep in your head as you read code.

You can name the same thing 8 different ways at boundaries and hate life when you have to refactor or grep your own code:

// index.js
const grault = require('./corge')
const foo = grault('baz')

// corge.js
export default function doBar(qux) {
  let quux = parse(qux)
  return quux
}

or just 2 ways and hate life less:

// index.js
const { getFoo } = require('./getFoo')
const foo = getFoo('bar')

// getFoo.js
export function getFoo(bar) {
  return parse(bar)
}

Example 2: Using Tooling to autogenerate names

Instead of naming a title in CSS and then also a <Title className="title"> in React, opening yourself up to global conflicts and subsequent refactoring, you could choose to use either a CSS Module or CSS in JS approach to scope and manage them together. Credit for this idea comes from Max Stoiber.

Notable Exception: Kyle Simpson famously does not use => function syntax, preferring explicit function declaration, because he wants to avoid anonymous functions in the stack trace. That is his prerogative, but I don’t think this is a battle worth fighting.

Heroku-like autogenerated app names

Example: "rough-snowflake-1142", "nameless-star-13", "cool.leaf.6743"

Tools

https://github.com/usmanbashir/haikunator Generate Heroku-like memorable random names to use in your apps or anywhere else.
https://en.wikipedia.org/wiki/PGP_word_list
https://github.com/moby/moby/blob/c90254c7464cac5c56e7ab9e6b1857c119d5d263/pkg/namesgenerator/names-generator.go

Probably Bad Names

Inherent in having any opinion on naming things is some intuition that some names are worse than others.

This can feel a bit silly in languages where naming has no impact on program behavior, especially in JavaScript where everything gets minified. In that sense, naming is bikeshedding.

But code is not just written for correctness, it is also written for other humans to read (and maintain). In a strong form of Sapir Whorf, what you name a thing can totally shape and artificially limit your creativity. In that sense, naming is -not- bikeshedding.

And yes, I’ve unironically been in standups where we bikeshedded on whether something was bikeshedding. The rabbit hole goes deep.

I’ll motivate the discussion with some examples:

Metasyntactic names, the “lorem ipsum” of code: foo, bar, baz. This isn’t always wrong, especially when the name is meant to be a placeholder. You’re not likely to see these in actual code. But you might.
Vague names: thething, that, someObject. Everything’s a thing. that is no more descriptive than this. In JS, everything’s an object. So what?
Too short, likely overloaded names: id, name, url. There’s nothing inherently wrong with these, but often you need more than one of these. So you start with one id in your code, and then later on have an product.id, then a user.id, and pretty soon its no longer clear what id means. It is then harder and harder to grep and rename names in your code. This is especially important when the language allows shadowing (ahem JS). Probably my most controversial, and recent, opinion. Always ask yourself: “What do I do if there is more than one of this name?”
Overly Long names: >30 characters is pushing it IMO. You can namespace names inside a dict/object. (see below)
Scary Technical names: ModifiedApplicativeFunctor. As much as this makes sense to you, it has to make sense to the next person. Again, if you’re on a team that all shares your context, go ahead. But at least pause to consider if they do.
Nonconventional names: Naming conventions don’t exist in a vacuum. If everyone in a community does import React from 'react' and you do import Bunny from 'react' because you thought it would be a fun idea… it loses its fun quickly. More seriously, you can establish convenient aliases for names and concepts, but be careful that your code becomes an unreadable mess of custom convention.

Name Pollution

It is possible to have too much of a good thing! Even if all names technically fit whatever guidelines you choose, it is still possible to have way too many names. Every new name demands more space in your working memory. One very pervasive way this happens is when names cross file and module boundaries:

styleInjection.js has only one export.
That export is a function, which is named genStylesCode because that’s what it does.
A different file imports styleInjection.js and calls it styleInjector because that’s what it uses it for.
styleInjection.js isn’t imported anywhere else, it isn’t a reusable utility.

This was adapted from real code in a popular framework. Here we end up with 3 different names for the exact same thing. Triple the bikeshedding. As Joe Fiorini puts it, name files after their default export, or even better, don’t have a default export, and still name the file after the “main” export anyway.

Controversial Names

Not all names are obviously bad, even though they may seem bad to you.

Single Letter Names: You may dislike the TypeScript community using T, U, or V for generic type variables, but that does genuinely reflect the mathematical/set theory framing of the type system, and emphasize the genericness of the type variable. You may dislike using e for errors or for events, but if its usage is scoped, the impact really is very small and not worth arguing over. However, non-descriptive abbreviations that show up in errors seen by end users and your library consumers are bad news. Other forms of abbreviations may or may not be worth banning, check this ESLint rule for ideas.
Plurals vs Arrays: You can have names variable be an array of names, or a nameArray, which is more verbose and explicit but less aesthetic. Don’t choose lazy pluralization and beware substrings.
Block, Element, Modifier: BEM was wildly popular for a reason - the global nature of CSS - but scoping methods have evolved a lot since then and BEM is far less necessary than used to be. It is also, to put it mildly, verbose.

Probably Good Ideas and their Considerations

Having dealt with the easy stuff above, I’m now on much more equivocal territory. Here we deal with some considerations you may want to think about in forming your naming guidelines and where I stand:

Encoding Types: In dynamically typed languages it can be helpful to give a hint as to what the variable is. But even in statically typed languages it can help give hints to the reader, especially where type inference means there isn’t explicit annotation at every step:
- Is it an array? I mentioned this is controversial, and I’ve gone both ways between names and nameArray. But I do like giving a hint that something is an array.
- Is it nullable? I -really- like this for JavaScript, and because I have done some Haskell I often inject a monad, e.g. maybeResult. This reminds me to check if the result is falsy first. However, be warned that this can often not be the right choice for variables that can have more than two states, e.g. undefined | Error | Success. Pick a name that reflects the true nature of the concept.
- Is it sync? A similar monadic hint. The Node-style convention where the default, shorter name is async and the blocking, synchronous version has the longer name is a good idea, especially because asynchrony tends to be introduced and spread through codebases later on. Since you probably want to write async code wherever possible, let’s make that the more concise name.
- Is it a boolean? I do like boolean verb prefixes: isDone, hasProperty, didChange over done, !!object[property], changed. Here is an ESLint rule for that.. Daniel Lo Nigro mentions that banning inverse booleans also seems like a good idea - notDone, noHeaders - to avoid double negatives - but I haven’t personally done that yet.
- Is it an important enum or constant? use SCREAMING_CASE, e.g. DISPLAY_MODE_NONE, DISPLAY_MODE_INLINE, DISPLAY_MODE_BLOCK. Often used in Redux action constants, and environment variables.
- Is it an internal variable? This one I like a lot - if the variable is not meant to be exposed, it can often help to prefix _internal variables, especially if you are mirroring an argument just for mutability in order to output it again.
- Not just for “type system” types: In the mailing list preview, Massimiliano wrote in with an outstanding pointer to Joel Spolsky’s Making Wrong Code Look Wrong, which advocates the original idea behind Hungarian notation, which encodes types in names far beyond what normal types can cover, reflecting ideas like “string safety” and “width” and “index” and so on. A strong recommend!
- (2022 Edit) - Put Units in Names! sleep(300) means anywhere from 0.3ms to 5 minutes depending on what language you are using.
Filenames: We already discussed crossing file and module boundaries above. Jonathan Johnson also mentions that dates should come first in YYYY-MM-DD format. Camelcasing filenames can be a footgun because require() in macOS is is case insensitive but Unix is not.
Namespacing: We all agree descriptive names are better, but also that names that are too long are bad. One way to break this knot is by various namespacing strategies. Use your language’s module system and data structures when naming convention fails you. For example, break up a collection of longish names like DISPLAY_MODE_NONE, DISPLAY_MODE_INLINE, DISPLAY_MODE_BLOCK into a displayModes dict or enum that you can access, like displayModes.NONE. It doesnt have to just be variables, it can be functions too.
Grammar: One of the most impactful naming decisions documented was in the React lifecycle naming, which established a grammar of Concepts, Actions and Operands to help make lifecycles easier to remember. For CLI’s, Heroku insists that topics are plural nouns and commands are verbs in their CLI Style Guide. Your users will very quickly learn your grammar and that is a fantastic way to communicate and structure your public API.
Casing policy: Svelte uses snake_case rather than camelCase because of research showing 20% higher cognitive load (source)

Sidenote: Naming is a subdiscipline of a broader art I call “API Design” - a very important and difficult-to-study topic I hope to one day write about.

As usual, it is possible to take good ideas too far - encoding types into EVERYTHING and being concise leads you to the commonly misused form of Hungarian Notation, which nobody likes.

The Cost of Enforcement

I do have a strong opinion that naming opinions should be breakable guidelines rather than strict rules. If you are spending more than 30 seconds discussing a name in a code review, and opinions differ, its probably not worth further debate. Your team’s time is valuable and this costs more the bigger your team is. (Although if someone comes up with a name that everyone agrees better fits the concept/domain, then that is a great use of time!)

But wait, what about code standards? Without constant vigilance, my codebase will descend into a pit of chaos!

Well, first of all, nice to see that you trust your colleagues that much.

Second, whatever can’t be automated can’t be enforced. Code reviews cost. Human code review will have inconsistencies. The person who nitpicks names all the time will either be resented or joked about because they don’t see the bigger picture. It just never ends well. Don’t be the bad cop - let the machine do it.

The base level is trusting in syntax and tests - if the code is valid and works as advertised, you very likely already have bigger problems you should pay attention to. The next level is autoformatting (prettier for JS, black for python) and linting where you write or adopt code that looks at your AST and enforces simple naming rules. Be careful: Overly eager linting is a problem.

As Nick Shrock says: Delegate to Tooling Whenever Possible. His advice on Code Reviews is worth a full read here. Importantly: the goal of a code review is not to make it so that the code looks as if you wrote it. Internalize that.

Sindre Sorhus has some strong opinions on naming. You may not agree with all of them, but at least they are enforced in code. Check eslint-plugin-unicorn.

Domain Driven Design

(Aug 2019 Update) I was fortunate enough to attend a workshop by Andrew Cassell on Domain Driven Design (slides) where the concept of “Ubiquitous Language” drives naming and I really like this concept. However some of the application examples I’ve seen bleed the domain all over the place whereas I really only think it matters most at the public API.

Collections of Things

(Aug 2019 Update) Don’t pluralize lazily, e.g. blog.js and blogs.js. This is terrible to grep especially with one name being a substring of the other. Prefer to name both items and collections visibly. This is similar to the Hungarian notation idea, but works even if you use a type system. Tweet

2021 Edit: see also my popular post on Premature Pluralization

When all else fails… who writes the code?

If you’re still spending a lot of organizational energy bickering over a name… remember this story from Bret Taylor about how Google Maps’ Satellite Mode was almost named Bird Mode.

Code Complete [Nov 2019 Edit]

The volumninous Code Complete offers an entire chapter on the Power of Variable Names. This has a lot of good advice. Here are some nice examples pulled from the book:

Name length:
- Too long: numberOfPeopleOnTheUsOlympicTeam, numberOfSeatsInTheStadium, maximumNumberOfPointsInModernOlympics
- Too short: n, np, ntm
- Just right: numTeamMembers, teamMemberCount, numSeatsInStadium, seatCount, teamPointsMax, pointsRecord
“longer names are better for rarely used variables or global variables and shorter names are better for local variables or loop variables”
Using Common Opposites in Variable Names e.g. begin/end, first/last, locked/unlocked , min/max, next/previous
Loop Indexes: i, j, k are fine inside a loop. if used outside the loop, be more descriptive, e.g. recordCount
Status Variables: dont use flags. use enums, descriptive naming
- Bad: flag = 0x1; statusFlag = 0x80; printFlag = 16; computeFlag = 0;
- Better: dataReady = true; characterType = CONTROL_CHARACTER; reportType = ReportType_Annual; recalcNeeded = false;
Useful Boolean names should imply true or false:
done: Use done to indicate whether something is done. The variable can indi- cate whether a loop is done or some other operation is done. Set done to false before something is done, and set it to true when something is completed.
error: Use error to indicate that an error has occurred. Set the variable to false when no error has occurred and to true when an error has occurred.
found: Use found to indicate whether a value has been found. Set found to false when the value has not been found and to true once the value has been found. Use found when searching an array for a value, a file for an employee ID, a list of paychecks for a certain paycheck amount, and so on.
successor ok: Use success or ok to indicate whether an operation has been suc- cessful. Set the variable to false when an operation has failed and to true when an operation has succeeded. If you can, replace success with a more specific name that describes precisely what it means to be successful. If the program is success- ful when processing is complete, you might use processingComplete instead. If the program is successful when a value is found, you might use found instead.
Bad: status, sourceFile
Better: statusOK, sourceFileAvailable
acceptable - is* prefix. doesnt work for everything
stay positive - avoid double negatives! if (!notFound)
When You Should Have a Naming Convention
- When multiple programmers are working on a project
- When you plan to turn a program over to another programmer for modifica- tions and maintenance (which is nearly always)
- When your programs are reviewed by other programmers in your organization
- When your program is so large that you can’t hold the whole thing in your brain at once and must think about it in pieces
- When the program will be long-lived enough that you might put it aside for a few weeks or months before working on it again
- When you have a lot of unusual terms that are common on a project and want to have standard terms or abbreviations to use in coding

Naming Content

https://www.creativeelements.fm/jake-thomas/
- focus on clickworthy emotions:
  - Curiosity, Fear, Desire
  - Curiosity + Fear, Curiosity + Desire
  - more:
    1. Curiosity
    2. Desire
    3. Negativity
    4. List
    5. Trendjacking
    6. Authority
    7. Time fame
    8. Calling out a specific audience (like beginners or by age)
    9. Refute objection
    10. X vs. Y
- adjectives that call
  - How to X for Beginners
  - Never X Again - How to Y the Easy Way
  - I Stopped X for Y days and Here’s What Happened
  - These X will change your life
- https://creatorhookspro.com/ https://twitter.com/jthomas__/status/1547706326058340353?s=20
  - 47% of titles have a number (can include current year)
  - 1/ Open a loop ex: “Start Doing This and Never Be POOR or BROKE Again”
  - 2/ Reveal a secret ex: “The Secret Money Saving Rule I Learned in Japan”
  - 3/ Ask a question ex: “Do Mom Cats Miss Their Kittens After Adoption?”

Naming Startups

Naming Startups/Domains: https://www.saashub.com/domainsfortherestofus-alternatives
- “If you have a US startup called X and you don’t have x.com, you should probably change your name.” Paul Graham
- Derek Sivers: https://sive.rs/com with a programmatic approach to find unclaimed domains
  - some good alternatives https://news.ycombinator.com/item?id=31665165
  - https://3sname.com/
- Laura Roeder on meetedgar: https://lauraroeder.com/how-i-nabbed-the-com-for-my-bootstrapped-startup-without-spending-a-million-bucks-6dc35c4606e9
  - gettting resend.com for 25k https://resend.com/blog/how-to-pick-a-startup-name
  - https://github.com/swyxio/buying-domains/
- https://dotcomagain.com/ for buying recently dropped domains
Stripe’s naming story: buy the .com
See The Genius of Apple’s Name
https://www.shopify.com/tools/business-name-generator
nice thread here on the process for Atlan.com https://twitter.com/prukalpa/status/1482727402614702081

More References

ztellman’s Elements of Clojure book has a free chapter on naming: naming data, functions, and macros.
Naming Components: https://open-ui.org/analysis/component-matrix
Naming Git Branches: https://deepsource.io/blog/git-branch-naming-conventions/
Naming Products/Books/Concepts: The Naming Book (site, podcast)

steps to name things
1. criteria for success - eg feelings
2. Generation
  - Compound Words (Facebook)
  - Portmanteaus (Pinterest)
  - Foreign Translations
  - Obscure words (Nike - greek goddess)

Your Opinion Here!

I asked for more opinions on Twitter, and here are some I got:

Dan Abramov: Longer names to discourage use - for context, React uses this a lot in dangerouslySetInnerHTML and more subtly in getDerivedStateFromProps and most famously in DO_NOT_USE_OR YOU_WILL_BE_FIRED
Jamie Wong: The Grep Test
Chris Biscardi’s post on Styles and Naming
Ivan Babak: Use Context-independent names
b_sted: Don’t camelcase filenames is a footgun because require() in macOS is is case insensitive but Unix is not
Danny Eck: Mark unstable, sync and unsafe code!
Ersagun Kuruca: More bad names: script, callback, data, object, value, event, number, list
Matthew Weeks: Keep it Simple but Descriptive
Eric Bischard recommends a very great talk by Kevlin Henney: “Giving Code a Good Name”.
VGR: https://www.ribbonfarm.com/2012/02/02/how-to-name-things/

Last but not least, in the mailing list preview, Massimiliano also recommended Joel Spolsky’s Making Wrong Code Look Wrong, which I can’t help but recommend again.

2023 addendum: letting your users name themselves

Discord recently went thru a painful rename: https://support.discord.com/hc/en-us/articles/12620128861463

How To Name Things

Latest Posts