About & methodology

Disclaimer

An unofficial fan index — not affiliated with or endorsed by Scott Alexander. Posts are © their respective authors; this site links to the originals and publishes only its own summaries and scores.

Site author’s view

The world would be much better if we all read more Scott Alexander. Yet, 1) we may not be convinced Scott Alexander is worth reading yet. 2) We don’t have the time to read his entire corpus.

It is therefore helpful to have a guide that can point us to his most important works, and if there is an area of specific interest, explore that.

Goal

With too much for any of us (unless we’re some historical scholar), a review and ranking, indexed, sortable and filterable by Claude (currently this review is built on Opus 4.8), is helpful to that endeavor.

What it is not

This is not designed to be a critique of his work. Although Claude, based on the site author’s parameters, has given values to the quality of the writing, its paradigm shift to Claude (no proof that’s even remotely correct introspection), and real-world impact — don’t put too much stock in it. We humans need some heuristic to easily make our next reading decision.

What I hope it is

#3 is probably, on the whole, going to impact you more than #23, and almost certainly more than #230. This is for interest’s sake, and to help you more quickly find great essays!

Scope

558 posts have been scored so far (Astral Codex Ten 227 · Slate Star Codex 198 · LessWrong 113 · LiveJournal 17 · ai-2027.com 1 · raikoth.net 1 · Unsong 1). This is a work in progress — the index grows as more of the archive is scored, and is not yet a complete catalogue.

Inclusion & exclusion

The aim is to cover Scott’s substantive standalone writing across his venues — Slate Star Codex and Astral Codex Ten, his earlier LessWrong posts (written under the pen name Yvain) and LiveJournal posts, and a few pieces published elsewhere. In practice that means his original essays, book reviews, fiction, and long “much more than you wanted to know” investigations — his own posts only, not his comments or replies.

Deliberately left out are posts that aren’t standalone works: open threads, meetup and housekeeping announcements, link round-ups, and comment-highlight digests. A small number of guest posts (for instance, Book Review Contest entries) are included and clearly flagged as guest.

Coverage is a work in progress and is not yet the complete archive; more is added as it is scored. The selection was carried out by Claude following these criteria, so a post may occasionally be wrongly included or overlooked.

How scoring works

Every post gets two independent 1–100 scores, judged against fixed anchor bands. They are never blended; the ranking sorts by quality.

Quality

Impact, insight, reframing and explanatory power, as the piece reads today, including its real-world significance.

Score	Label
90–100	Landmark
75–89	Excellent
60–74	Strong
40–59	Solid
20–39	Minor
1–19	Negligible

Claude’s paradigm shift at publication

How novel and disruptive the ideas were in their historical context to Claude, judged in the post’s own time. The goal is to measure how much the article would have shifted Claude’s beliefs, taking into account that much of it is already in Claude’s training data. Does this actually work? Who knows; Claude said it was reasonable. LOL. Early landmarks whose ideas are now mainstream still score high here; they are not penalised for having won.

Score	Label
90–100	Paradigm-defining
75–89	Major shift
60–74	Notable shift
40–59	Moderate
20–39	Slight
1–19	None

RWI — Real-world impact (0–10)

A separate measure of concrete change in the material world and in ordinary lives (laws, institutions, professional practice, widely-adopted tools). Whether that change reached a broad population rather than influence confined to a narrow elite was a consideration that adjusted the values. Shown where assessed.

Whose judgement is this?

I chose the categories and the framework for valuing the articles. Within that framework, I asked Claude to do its own analysis (avoiding others’ reviews and commentary), though it could follow data links and search for evidence to strengthen or refute Scott’s statements.

Claude was required to read each article in full, never truncating it. For a sense of scale, the evaluation phase is expected to use on the order of twenty-seven (27) one-million-token context windows of Opus 4.8: each new session reinitialises the whole project’s parameters, and the evaluation rules and anchors are reloaded before every article.

There was a good deal more to it than this short summary, and I’m not interested in nitpicking the details. Is it actually Claude’s own analysis, without any input from web commentary or critique? I’m not betting it is. Could I have done better prompt engineering? Almost certainly. Would it have changed the rankings much? Probably not.

One deliberate exception: given its length, I didn’t require Claude to read all of Unsong (Scott’s novel), and I let it draw on outside reviews and critique for that single entry.

The results seem reasonably consistent with general chatter online, and quite effective for the purpose; a quick heuristic for the next thing to read.

Reuse & licensing

The summaries and scores on this site are licensed CC BY 4.0 — reuse them freely with attribution. Scott Alexander’s original writing remains his; this site only links to it.

Data last generated 2026-06-20.