An Applied Scenario — Bearing Faults Per Hour¶
A different sensor on the same motor monitors shock pulses — the high-frequency clicks a damaged ball bearing emits as it rolls past a load zone. On a healthy bearing, these clicks are extremely rare. On a damaged one, they multiply.
You want a maintenance metric: how many shock pulses arrive in a one-hour window?
Try modelling this with the Binomial. You’d need:
— the number of “trials” per hour. But what is a trial? A nanosecond? A bearing rotation? It depends on how finely you slice time.
— the probability of a click in one trial. As you slice time finer, grows and shrinks.
Yet the expected count per hour — say, clicks/hour on a healthy bearing — is a real, measurable physical quantity. It doesn’t depend on how you slice time.
You want a distribution parameterised by alone, where the trial-counting machinery quietly disappears. That distribution exists, and it’s what the Binomial becomes in exactly this regime.
Intuition¶
The Binomial works perfectly when you know and separately. But many counting problems involve events that are rare across a large region — and the region’s “size” is what you actually measure, not and individually.
The Poisson distribution is what the Binomial becomes when:
(the region is sliced into infinitely many trials)
(each trial is vanishingly unlikely)
stays constant (the expected count is fixed by the physics)
The two parameters collapse into one: , the expected count per region.
Examples Across Domains¶
| Process | Underlying | Underlying | |
|---|---|---|---|
| Bearing shock pulses | Time slices per hour | P(click in slice) | Clicks per hour |
| Network packet drops | Packets per minute | P(drop) | Drops per minute |
| Photon counting | Available photons | P(photon hits photosite) | Expected electrons per exposure |
| Defects on a surface | Surface micro-cells | P(defect per cell) | Defects per m² |
| Earthquakes in a region | Crustal “trial” stress events | P(release per event) | Events per year |
| Background pixel changes | Pixels per frame | P(noise above threshold) | Changed pixels per frame |
In every row: huge , tiny , moderate . The model doesn’t care what kind of “event” you’re counting.
The Derivation¶
Start from the Binomial PMF and take the limit.
Substitute :
Step-by-step limit¶
Expand the binomial coefficient:
For large , each factor is approximately :
Substitute back:
The terms cancel:
As :
(the definition of )
(since is fixed and )
Mean: Variance:
The magical property: the mean equals the variance. This single fact is the engine behind every shot-noise calculation in sensors, every queueing model in networks, and every count-based regression in statistics.
Back to the Bearing¶
For the healthy bearing with clicks/hour:
Expected count per hour: 3
Standard deviation:
So observing 0 clicks in an hour is uncommon but not alarming (5% of healthy hours look like this). Observing 10 clicks in an hour is a 1-in-1000 event under the healthy model — strong evidence the bearing state has changed.
The same logic, with different , runs every condition-monitoring threshold in the building.
Why Poisson is the Right Model for Photon Counting¶
A typical LED emits ~1018 photons per second. The probability that any specific photon reaches a 6 µm × 6 µm photosite is vanishingly small. But the product — determined by illumination, reflectance, exposure time, and sensor area — sits in the range of tens to thousands.
This is exactly the Poisson regime: enormous , tiny , moderate . The Poisson model is not an approximation here — it is the physically correct distribution for photon counting.
The Poisson Has ONE Parameter — Why That Matters¶
| Distribution | Parameters | Mean | Variance | Mean = Variance? |
|---|---|---|---|---|
| Binomial(, ) | , | Only when | ||
| Poisson() | Always |
If you measure the mean of a Poisson process, you immediately know its variance. Sensor designers use this constantly — they predict noise from signal level alone. The same trick works for any system in the Poisson regime: a packet-loss monitor, a click-rate model, or a defect-counting pipeline.