The Inspection Paradox — Why the Bus Is Always Late

Chapter 01You're Not Paranoid

You arrive at the bus stop. The schedule says buses come every 10 minutes on average. Simple enough: you should wait about 5 minutes, right? Half the interval. That's what average means.

Except you wait 8 minutes. Then 9. Then 12. Over weeks of commuting, your average wait creeps toward 7, 8, even 10 minutes. You start to suspect the transit authority is lying, or that the universe has a personal vendetta against you.

It doesn't. The math does.

The inspection paradox is what happens when you measure a process by randomly arriving in the middle of it. You don't get an average sample — you get a biased one, tilted toward the long side, every single time.

This isn't a fluke of bus schedules. It's a universal sampling bias that infects everything from class sizes to friendship networks to airport security lines. And once you see it, you can't unsee it.

Chapter 02Length-Biased Sampling

Imagine laying out the time between buses on a ruler. Some gaps are short — 3 minutes, 4 minutes — and some are long: 15, 18, even 22 minutes. The average gap is 10 minutes. Now close your eyes and drop your finger randomly on the ruler.

Which gap are you more likely to land in?

The long one. Obviously. A 20-minute gap occupies twice as much of the ruler as a 10-minute gap. You're literally twice as likely to land in it. This is length-biased sampling: when you sample a random point in time, the probability of landing in any particular interval is proportional to that interval's length.1

Suppose a bus route has exactly three gaps between buses: 2 minutes, 8 minutes, and 20 minutes. The average gap is (2 + 8 + 20) / 3 = 10 minutes.

But if you arrive at a random time, the probability of landing in each gap is proportional to its length: 2/30, 8/30, and 20/30. The expected gap you experience is:

(2/30)×2 + (8/30)×8 + (20/30)×20 = 14.9 minutes

Almost 15 minutes — fifty percent more than the "average" of 10. The average gap and the average experienced gap are completely different numbers.

This is the core of the inspection paradox. The average of the intervals is not the same as the average interval you'll experience if you show up at a random time. The experienced average is always larger — always — as long as there's any variation in the intervals at all.2

Chapter 03The Waiting Time Paradox

Let's get formal. Suppose buses arrive as a Poisson process with rate λ — meaning the average time between arrivals is 1/λ. If buses come every 10 minutes on average, λ = 0.1 per minute.

Poisson processes have a beautiful and terrible property: they're memoryless. No matter when you arrive at the stop, the expected time until the next bus is... the full average interval. Not half. The full thing.3

The Waiting Time Paradox

E[W] = 1/λ = μ

Your expected wait equals the full average interval — not half of it.

This feels outrageous. If the average gap is 10 minutes and you arrive at a "random" point within a gap, shouldn't you be halfway through on average? Shouldn't the expected wait be 5 minutes?

No. Because you're not arriving in a random gap. You're arriving at a random time — and that random time is more likely to fall within a long gap. The length-biased sampling exactly cancels the "you're partway through" benefit. For a Poisson process, these two effects perfectly offset: you land in a gap that's, on average, twice as long as the typical gap, and you're halfway through it. Half of twice the average is... the average.4

The Key Insight

For a Poisson process, the expected wait is the full mean interval. For non-Poisson arrivals with more variability, it's even worse. The general formula is: E[W] = (μ/2)(1 + CV²), where CV is the coefficient of variation of the inter-arrival times. More variability → longer waits.

Chapter 04The Friendship Paradox

In 1991, the sociologist Scott Feld published a paper with one of the most delightfully depressing titles in academic history: "Why Your Friends Have More Friends Than You Do."5

It's not a self-esteem problem. It's the inspection paradox wearing a different hat.

Here's the logic. Pick a random person. Now pick one of their friends at random. Is that friend likely to be popular or unpopular? Popular — because popular people appear on more friend lists. When you sample a friend-of-a-friend, you're length-biased sampling from the social network: people with more connections are proportionally more likely to show up.

Numbers show each person's friend count. The red node (5 friends) appears on everyone's friend list.

The average person in this network has (5+2+2+2+2+1)/6 = 2.3 friends. But the average number of friends that your friends have? Much higher. The hub with 5 friends shows up on 5 different friend lists, dragging the average way up. The math works out the same as the bus paradox: you're sampling people weighted by their popularity, not uniformly.

The Friendship Paradox

E[friends of friends] = μ + σ²/μ

Where μ is mean degree and σ² is variance. As long as σ² > 0, your friends have more friends than you (on average).

This isn't just a curiosity. Researchers have used the friendship paradox for early epidemic detection: instead of monitoring random people, monitor random people's friends. Because those friends are biased toward high-connectivity individuals, they catch infections earlier — a real-world exploitation of length-biased sampling.6

Chapter 05The Class Size Paradox

A university proudly reports that its average class size is 25 students. Sounds intimate, personal, the kind of place where professors know your name. You enroll. Every classroom you walk into has 80, 150, 300 people. What happened?

The university wasn't lying. They calculated the average across all classes. But your experience is different: you are a student, and as a student, you're more likely to be in a large class — because large classes contain more students.7

A university offers 3 classes: one with 10 students, one with 20, and one with 270. The university's average class size: (10 + 20 + 270) / 3 = 100.

But 270 out of 300 total students are in the big class. The average class size experienced by students: (10×10 + 20×20 + 270×270) / 300 = 245.

The university says 100. The students experience 245. Both numbers are correct. They're just measuring different things.

This is the inspection paradox again. When you sample by classes, each class counts equally. When you sample by students, each class is weighted by its size. The student-weighted average always exceeds the class-weighted average — it's mathematically guaranteed.

The average person is in an above-average group. Not a paradox — a theorem.

Chapter 06The Bus Stop Simulator

Enough theory. Let's watch the inspection paradox happen in real time. Below is a bus route where buses arrive with exponentially distributed gaps (a Poisson process). The average gap is 10 minutes. You can arrive at any time and see how long you actually wait.

Bus Stop Simulator

Buses arrive every 10 minutes on average (Poisson process). Click "Arrive Now" to show up at a random time, or run 1,000 trials to see the distribution.

Average interval between buses 10 min

Trials

Avg Wait

—

Expected (naive)

5.0

Naive expectation (μ/2)

Your actual waits

True expectation (μ)

If you ran enough trials, you'll notice your average wait converges not to half the interval (the naive expectation), but to the full interval — exactly what the math predicts for a Poisson process. The histogram of your waits follows an exponential distribution with the same mean as the original process. Length-biased sampling in action.

Chapter 07Everywhere You Look

Once you have the inspection paradox in your mental toolkit, you start seeing it everywhere — because it is everywhere.

Why your flight is always full

Airlines report an average load factor of 87%. But you're a passenger, which means you're more likely to be on a full flight than an empty one. The flights with 200 passengers contribute 200 passenger-experiences; the flight with 30 passengers contributes only 30. Your experienced load factor is systematically higher than the airline's reported average.8

Why you always pick the slow checkout line

Partly this is confirmation bias. But partly it's real: if you join a line at a random time, you're more likely to arrive during a period when lines are long (because long-line periods last longer). The same inspection paradox logic applies to time — you sample the slow phase because the slow phase occupies more time.

Why the pool is always crowded

The pool manager says "we average 15 swimmers at a time." But you're a swimmer, and you tend to go when... other swimmers go. Survey swimmers and they'll report a much higher average. Each swimmer's experience is weighted by the crowd size when they swim, which biases upward for the same mathematical reason.

Why wars feel endless

A randomly selected person is more likely to be born during a long war than a short skirmish. If you lived through the 20th century and felt like humans were always fighting, the inspection paradox was part of why — the long wars dominated lived experience far more than a simple count of conflicts would suggest.

The General Principle

Any time you sample from a process by being part of it rather than observing it from outside, you get a length-biased sample. The experienced average always exceeds the objective average. The only way they're equal is if there's zero variance — if every interval, class, or flight is exactly the same size.

Chapter 08The Antidote

The inspection paradox isn't a bug in reality — it's a bug in how we intuitively process averages. The "average" depends ferociously on what you're averaging over. Average class size per class or per student? Average bus gap per gap or per moment you might arrive? These are different questions with different answers, and our brains chronically confuse them.

The fix is surprisingly simple: ask "weighted by what?" Every time someone presents an average — airline on-time percentage, hospital success rates, class sizes, internet speeds — ask whose perspective it represents. The provider's average and the user's average are almost never the same number.

And the next time you're standing at a bus stop, watching the minutes tick past the "average" wait time, you can take some comfort in knowing: you're not unlucky. You're just a randomly arriving observer in a variable-interval process, experiencing the mathematically inevitable consequences of length-biased sampling.

Though I'll admit, that's cold comfort when the bus is 15 minutes late.9

Notes

This is formally known as "size-biased sampling" or "length-biased sampling." The foundational treatment is in Cox, D.R., "Renewal Theory," Methuen, 1962. Cox showed that the distribution of the interval containing a random point is the original distribution weighted by interval length.
Technically, the expected length of the interval containing a random point is μ + σ²/μ, where μ is the mean and σ² the variance of the inter-arrival distribution. This is always ≥ μ, with equality only when σ² = 0.
The memoryless property is unique to the exponential distribution (for continuous distributions). Formally: P(X > s + t | X > s) = P(X > t). It's why Poisson processes are so mathematically tractable and so counterintuitive.
For a Poisson process, the interval containing a random point has the distribution of the sum of two independent exponentials — an Erlang(2,λ) distribution — with mean 2/λ. You're uniformly located within it, so your expected wait is half that: 1/λ = μ. See Feller, W., "An Introduction to Probability Theory and Its Applications," Vol. II, Wiley, 1971.
Feld, Scott L. "Why Your Friends Have More Friends Than You Do." American Journal of Sociology 96, no. 6 (1991): 1464–1477. The paper is a masterpiece of making a mathematical inevitability feel like a personal insult.
Christakis, N.A. and Fowler, J.H. "Social Network Sensors for Early Detection of Contagious Outbreaks." PLoS ONE 5(9), 2010. They monitored Harvard students' friends and detected flu outbreaks 2 weeks earlier than monitoring random students.
This example is from Hemenway, David. "The Class Size Paradox." American Statistician 36, no. 1 (1982). The paradox generalizes: the average number of people on your bus, in your gym class, at your polling place — all systematically exceed the averages reported by the institutions.
U.S. domestic airline load factors have hovered around 85-90% since 2015 (Bureau of Transportation Statistics). But surveys of fliers consistently report experienced load factors above 90%. The inspection paradox accounts for most of the gap.
If it helps: many transit agencies now use real-time GPS tracking and dynamic headway management specifically to reduce variance in bus intervals. Lower variance → smaller gap between the "official" average wait and your experienced wait. The inspection paradox is the mathematical justification for every transit app on your phone.