Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Part 3: The Poisson Distribution — The Limit of Many Rare Events

DhvaniAI

An Applied Scenario — Bearing Faults Per Hour

A different sensor on the same motor monitors shock pulses — the high-frequency clicks a damaged ball bearing emits as it rolls past a load zone. On a healthy bearing, these clicks are extremely rare. On a damaged one, they multiply.

You want a maintenance metric: how many shock pulses arrive in a one-hour window?

Try modelling this with the Binomial. You’d need:

Yet the expected count per hour — say, λ=3\lambda = 3 clicks/hour on a healthy bearing — is a real, measurable physical quantity. It doesn’t depend on how you slice time.

You want a distribution parameterised by λ\lambda alone, where the trial-counting machinery quietly disappears. That distribution exists, and it’s what the Binomial becomes in exactly this regime.


Intuition

The Binomial works perfectly when you know nn and pp separately. But many counting problems involve events that are rare across a large region — and the region’s “size” is what you actually measure, not nn and pp individually.

The Poisson distribution is what the Binomial becomes when:

The two parameters collapse into one: λ\lambda, the expected count per region.


Examples Across Domains

ProcessUnderlying nnUnderlying ppλ\lambda
Bearing shock pulsesTime slices per hourP(click in slice)Clicks per hour
Network packet dropsPackets per minuteP(drop)Drops per minute
Photon countingAvailable photonsP(photon hits photosite)Expected electrons per exposure
Defects on a surfaceSurface micro-cellsP(defect per cell)Defects per m²
Earthquakes in a regionCrustal “trial” stress eventsP(release per event)Events per year
Background pixel changesPixels per frameP(noise above threshold)Changed pixels per frame

In every row: huge nn, tiny pp, moderate λ\lambda. The model doesn’t care what kind of “event” you’re counting.


The Derivation

Start from the Binomial PMF and take the limit.

P(k)=(nk)pk(1p)nkP(k) = \binom{n}{k} p^k (1-p)^{n-k}

Substitute p=λ/np = \lambda / n:

P(k)=(nk)(λn)k(1λn)nkP(k) = \binom{n}{k} \left(\frac{\lambda}{n}\right)^k \left(1 - \frac{\lambda}{n}\right)^{n-k}

Step-by-step limit

Expand the binomial coefficient:

(nk)=n!k!(nk)!=n(n1)(n2)(nk+1)k!\binom{n}{k} = \frac{n!}{k!(n-k)!} = \frac{n(n-1)(n-2)\cdots(n-k+1)}{k!}

For large nn, each factor n,(n1),(n2),n, (n-1), (n-2), \ldots is approximately nn:

(nk)nkk!\binom{n}{k} \approx \frac{n^k}{k!}

Substitute back:

P(k)nkk!λknk(1λn)nkP(k) \approx \frac{n^k}{k!} \cdot \frac{\lambda^k}{n^k} \cdot \left(1 - \frac{\lambda}{n}\right)^{n-k}

The nkn^k terms cancel:

P(k)λkk!(1λn)n(1λn)kP(k) \approx \frac{\lambda^k}{k!} \cdot \left(1 - \frac{\lambda}{n}\right)^{n} \cdot \left(1 - \frac{\lambda}{n}\right)^{-k}

As nn \to \infty:

P(kλ)=λkeλk!\boxed{P(k \mid \lambda) = \frac{\lambda^k \, e^{-\lambda}}{k!}}

Mean: E[k]=λE[k] = \lambda Variance: Var(k)=λ\text{Var}(k) = \lambda

The magical property: the mean equals the variance. This single fact is the engine behind every shot-noise calculation in sensors, every queueing model in networks, and every count-based regression in statistics.


Back to the Bearing

For the healthy bearing with λ=3\lambda = 3 clicks/hour:

So observing 0 clicks in an hour is uncommon but not alarming (5% of healthy hours look like this). Observing 10 clicks in an hour is a 1-in-1000 event under the healthy model — strong evidence the bearing state has changed.

The same logic, with different λ\lambda, runs every condition-monitoring threshold in the building.


Why Poisson is the Right Model for Photon Counting

A typical LED emits ~1018 photons per second. The probability that any specific photon reaches a 6 µm × 6 µm photosite is vanishingly small. But the product λ=np\lambda = np — determined by illumination, reflectance, exposure time, and sensor area — sits in the range of tens to thousands.

This is exactly the Poisson regime: enormous nn, tiny pp, moderate λ\lambda. The Poisson model is not an approximation here — it is the physically correct distribution for photon counting.


The Poisson Has ONE Parameter — Why That Matters

DistributionParametersMeanVarianceMean = Variance?
Binomial(nn, pp)nn, ppnpnpnp(1p)np(1-p)Only when p0p \to 0
Poisson(λ\lambda)λ\lambdaλ\lambdaλ\lambdaAlways

If you measure the mean of a Poisson process, you immediately know its variance. Sensor designers use this constantly — they predict noise from signal level alone. The same trick works for any system in the Poisson regime: a packet-loss monitor, a click-rate model, or a defect-counting pipeline.