Run Smarter, Not Harder: Dashboard Mockup v3

2026 Data Visualization Competition · Sports Analytics

RUN
SMARTER,
NOT HARDER.

How Recovery Determines Whether Hard Training Helps or Hurts.
An eight-chapter data story exploring the hidden interaction between recovery status and training load on running efficiency, told through 1,332 training sessions.

1,332 training sessions · 4 athletes · 3 core metrics · 12 interaction combinations
Running Efficiency (EI) × Recovery Index (RI) × Relative Effort → Training Status
Wearable data via ONWRD · EI = distance / heartbeats · RI = HRV + Sleep + HR + Load · RE = EPOC-based session load
by Seungwon Jeong
Within-subject normalization · Non-parametric testing · Interaction-based visualization

Central Finding

128%

EI swing between recovered vs fatigued overreaching

Statistical Test

p = 0.0017

Kruskal-Wallis H = 15.135

Universality

4 / 4

athletes confirm the pattern

← → Arrow keys | Click dots | 10 pages

Chapter 1 / 8

RUN SMARTER, NOT HARDER

Meet the Data

THE DATASET & THE ATHLETES

Before analysis, let's understand what we measured and who we measured. This dataset contains 1,332 sessions from 4 runners via wearable devices. Each session records pace, heart rate, and a derived efficiency metric alongside daily recovery scores.

1,332

Total Sessions

All athletes combined

4

Athletes

Recreational to competitive

21

Variables / Session

Including derived flags

12

RI × Status Combos

3 recovery zones × 4 training zones

A1

Athlete 1

Sessions423 (31.8%)

EI Improved53.4%

RI–EI Corr.r = 0.038

ZonePRODUCTIVE

A2

Athlete 2

Sessions318 (23.9%)

EI Improved45.3%

RI–EI Corr.r = 0.159

ZonePRODUCTIVE

A3

Athlete 3

Sessions180 (13.5%)

EI Improved65.0%

RI–EI Corr.r = 0.028

ZonePRODUCTIVE

A4

Athlete 4

Sessions411 (30.9%)

EI Improved47.7%

RI–EI Corr.r = 0.188

ZonePRODUCTIVE

i

Why 4 athletes? A small N lets us validate whether the recovery-gating effect is truly individual-level, not just a statistical average. Each athlete serves as their own control. We compare each session against that athlete's personal baseline, not a group mean.

n

Sample size note: Athlete 3 has only 180 sessions (13.5%). Results for this athlete should be interpreted with caution. Athletes 1 and 4 contribute 62.7% of all data, giving their patterns the most statistical power.

Sessions per Athlete

Unequal sample sizes mean pooled averages are weighted toward Athletes 1 & 4. We use per-athlete deviation to control for this.

Data Pipeline

Wearable
Device

→

EI · RI ·
Rel. Effort

→

Per-Athlete
Baseline

→

Deviation
Analysis

Each session's EI is compared to the athlete's personal mean. Positive deviation = better than usual. This removes inter-athlete differences and isolates the within-athlete effect of recovery.

Key Variables

EI Deviation	EI minus athlete's personal mean (m/beat). > 0 = improved.
RI Deviation	Recovery Index (HRV+Sleep+HR+Load) minus personal mean. > 0 = better recovered.
Relative Effort	EPOC-based session load (wearable). ATL(7d) ÷ CTL(28d) → training status ratio.
Training Status	ATL/CTL ratio → 4 zones (Under/Maintain/Productive/Overreach)
RI Zone	Low / Moderate / High (33rd & 67th percentile of personal RI deviation)
Danger Zone	DANGER Low Recovery + OVERREACHING (worst combo)
Sweet Spot	BEST High Recovery + OVERREACHING (best combo)

Chapter 2 / 8

RUN SMARTER, NOT HARDER

The Science

THE SCIENCE BEHIND THE RUN

Three metrics form the backbone of this analysis. Each captures a different dimension: how well you ran (EI), how recovered you were (RI), and how hard you pushed (Training Status).

42.7%

Sessions Improved

EI above personal mean

r = 0.12

RI → EI Correlation

p < 0.001 · significant

57.3%

Below Baseline

Room for improvement

3.3%

Overreaching Sessions

44 sessions, small but telling

Metrics Defined

EI Running Efficiency Index (EI)

meters / heartbeat (m/beat)

Higher EI = more distance per cardiac cycle = better economy. Heart rate serves as a VO₂ proxy during submaximal exercise. EI Deviation > 0 = improved vs personal baseline.

RI Recovery Index (RI)

Composite score 0–100 · Wearable daily metric

A daily readiness score derived from four physiological signals captured by the wearable device:
30% HRV (Heart Rate Variability, parasympathetic tone, higher = better recovered) · 30% Sleep Score (duration + deep/REM quality) · 20% Resting HR (lower = better cardiovascular recovery) · 20% 7-day Acute Load (cumulative training stress).
70–100 = Ready · 41–69 = Moderate · 0–40 = Fatigued
RI Deviation = today's RI minus athlete's personal mean → positive = better recovered than usual.

RE Relative Effort

Session-level training load · EPOC-based

Measures how hard each session was relative to the athlete's capacity. Calculated from EPOC (Excess Post-exercise Oxygen Consumption) estimated via heart rate data during the session.
ATL (Acute Training Load) = 7-day rolling weighted average of Relative Effort.
CTL (Chronic Training Load) = 28-day rolling weighted average.
The ratio ATL ÷ CTL determines Training Status, i.e. whether you're pushing harder than your body has adapted to.

TS Training Status (ATL / CTL)

Acute-to-chronic workload ratio → 5 zones

UNDER <0.70 · MAINT 0.70–0.95 · PROD 0.95–1.30
OVER 1.30–1.70 · RISK ≥1.70 (not observed in this dataset)

?

Why deviation, not raw values? Athletes differ in ability. A 1.2 m/beat EI might be excellent for one runner but below average for another. By using deviation from personal mean, we compare each athlete to themselves, isolating the effect of recovery and training load. This is the same within-subject design used in repeated-measures clinical studies.

Training Status Distribution (n = 1,332)

PRODUCTIVE 63.4% MAINTAINING 27.9% UNDERTRAIN 5.5% OVERREACH 3.3%

✓

Healthy distribution: 91.3% of sessions fall in PRODUCTIVE or , exactly where sports science recommends (Gabbett, 2016). Only 3.3% are OVERREACHING, but those 44 sessions tell the most revealing story.

Recovery Zone Distribution (n = 1,332)

Recovery zones are defined using 33rd / 67th percentile cuts of each athlete's RI deviation. This ensures "Low" for one athlete isn't confused with "Low" for another. The roughly equal distribution (~⅓ each) confirms our quantile-based cutoffs are well-calibrated.

Chapter 3 / 8

RUN SMARTER, NOT HARDER

The Question

DOES TRAINING LOAD AFFECT RUNNING ECONOMY?

" Training harder doesn't always mean running better. But does the data support this? "

— The central hypothesis

Mean EI Deviation by Training Status

Bars right = positive efficiency. MAINTAINING shows the highest EI. These athletes are fully adapted. OVERREACHING mean is only slightly negative, hiding a notable internal split.

EI Variability (Std Dev) by Training Status

Key insight: OVERREACHING has the highest variability (σ = 0.138). This means OVERREACHING outcomes are the most : sometimes very good, sometimes very bad. What determines which? → Chapter 5 answers this.

Kruskal-Wallis Non-Parametric Test

H = 15.135 p = 0.0017

Training zone has a statistically significant effect on running economy.

Why Kruskal-Wallis? EI deviations are not normally distributed (Shapiro-Wilk p < 0.05). This non-parametric test compares medians across groups without assuming . More robust for real-world athletic data with outliers.

All Pairwise Comparisons (Dunn's Post-hoc)

MAINTvsPRODp = 0.0001 ***

MAINTvsOVERp = 0.023 *

PRODvsOVERp = 0.312 ns

UNDERvsPRODp = 0.089 ns

UNDERvsOVERp = 0.445 ns

UNDERvsMAINTp = 0.178 ns

Only MAINTAINING vs PRODUCTIVE is highly significant. OVERREACHING vs PRODUCTIVE? Not significant (p=0.312); the high variance cancels out the signal. This is a Simpson's Paradox clue.

!

The mystery: OVERREACHING should be worst, but it's only slightly negative. Because it contains both the worst sessions (low recovery) and some of the best (high recovery). The average hides a notable split. → Chapter 5 reveals this.

S

Simpson's Paradox: A trend that appears in aggregated data can reverse when subgroups are examined. OVERREACHING appears "okay" overall, but when split by recovery status, it contains the extremes of both good and bad performance.

RI × TS

Training zone alone doesn't explain performance.

What if we add a second dimension?

Recovery.

The next chapter reveals what happens when we split each training zone by recovery status.
The same OVERREACHING session produces −0.059 (fatigued) vs +0.069 (recovered).
A 128% swing, hidden by the one-dimensional analysis.

LOW RECOVERY
−0.059

vs

HIGH RECOVERY
+0.069

Both groups: OVERREACHING training status (ATL/CTL ratio 1.30–1.70)

Chapter 4 / 8 — HERO

RUN SMARTER, NOT HARDER

The Discovery

RECOVERY GATES EVERYTHING

Recovery Zone × Training Status → Mean EI Deviation Color: ● neg → ● zero → ● pos

▼ Danger Zone▲ Sweet Spot Read columns ↓ to see recovery's effect within each training zone

RI Deviation vs EI Deviation Each dot = 1 session

×

Interaction pattern: The data reveal a strong interaction pattern — the effect of training load on EI depends on recovery level. Neither variable alone tells the full story; the slope of EI change across training zones shifts visibly by recovery zone.

The 128% Swing

−0.059

Low Recovery
+ Overreaching

+0.069

High Recovery
+ Overreaching

Total swing

128%

"Same hard training.
Recovery determines whether it helps or hurts."

Global Correlation

r = 0.12

p < 0.001 · overall significant

Overreaching slope

8× stronger

vs PRODUCTIVE slope

The overall r=0.12 looks modest. But within OVERREACHING, the slope is 8× steeper than PRODUCTIVE. Recovery explains substantially more variance when load is extreme.

% Sessions Improved by Combo

Only 6.7% of Low RI + OVERREACHING sessions improved. Vs 66.7% for High RI + OVERREACHING. A 10× difference in success rate.

Chapter 5 / 8

RUN SMARTER, NOT HARDER

Deep Dive

INSIDE THE HEATMAP NUMBERS

The heatmap shows , but means can be misleading. This chapter examines sample sizes, variability, and % improved for each cell. How confident can we be in each number?

Session Count per Cell (n)

Sample size matters: Low RI + OVERREACHING has only n=15, and High RI + OVERREACHING only n=9. These extreme cells show the largest observed effects but the fewest observations — directionally consistent yet underpowered. Interpret with appropriate caution, though the direction holds across all four athletes.

EI Standard Deviation per Cell

OVERREACHING cells show the highest variability (σ up to 0.167). This explains why the overall OVERREACHING mean in Ch.3 appeared . High variance masks the signal. The heatmap's two-dimensional view reveals the hidden structure.

% Sessions Improved per Cell

The practical translation: Low RI + OVERREACHING → only 6.7% of sessions improved. High RI + UNDERTRAINING → 75% improved. This is the dashboard metric coaches care about most, not correlation coefficients, but success rates.

Confidence Assessment

HIGH CONFIDENCE

MAINTAINING × all zones (n=121–144)
PRODUCTIVE × all zones (n=275–289)

MODERATE CONFIDENCE

UNDERTRAINING × all zones (n=12–37)
OVERREACHING × Moderate (n=20)

LOWER CONFIDENCE (but consistent direction)

OVERREACHING × Low (n=15) · OVERREACHING × High (n=9). Despite small n, all 4 athletes independently show the same directional pattern, providing converging — though not definitive — evidence.

σ

Effect size vs significance: With n=15 and n=9, we lack power for per-cell p-values. However, the effect size (0.128 m/beat swing) is the largest in the entire dataset — a practically meaningful but statistically underpowered result. The 4/4 athlete replication provides converging, though preliminary, evidence.

R

Literature alignment: Meeusen et al. (2013) found that overreaching without adequate recovery leads to maladaptive responses in endurance athletes. Our −0.059 finding in the Low RI + OVERREACHING cell is consistent with this "functional overreaching → non-functional overreaching" transition.

Chapter 6 / 8

RUN SMARTER, NOT HARDER

The Evidence

DOES THE PATTERN HOLD FOR EVERY ATHLETE?

A pattern in aggregated data might be driven by one outlier. To validate, we test recovery-gating at the individual level. Answer: yes, universally, with personal magnitude.

A1

Athlete 1

423 sess · 53.4%↑

RI–EI Correlation

r = 0.038

Weak overall

Overreaching Split

Low RI

−0.041

High RI

+0.052

i

Largest sample. Recovery effect present but modest (swing: 0.093).

A2

Athlete 2

318 sess · 45.3%↑

RI–EI Correlation

r = 0.159

Moderate signal

Overreaching Split

Low RI

−0.068

High RI

+0.074

+

Best monitoring candidate. Clearest recovery signal (swing: 0.142).

A3

Athlete 3

180 sess · 65.0%↑

RI–EI Correlation

r = 0.028

Weakest / resilient

Overreaching Split

Low RI

−0.022

High RI

+0.038

R

Most resilient. Highest %↑ despite lowest r. Smallest sample (swing: 0.060).

A4

Athlete 4

411 sess · 47.7%↑

RI–EI Correlation

r = 0.188

Strongest sensitivity

Overreaching Split

Low RI

−0.082

High RI

+0.091

!

Most sensitive. Biggest swing (0.173). Should prioritize recovery monitoring.

RI–EI Correlation Comparison (4 Athletes)

6.7× difference between least sensitive (Athlete 3, r=0.028) and most sensitive (Athlete 4, r=0.188). This range makes individualized recovery thresholds essential. A one-size-fits-all approach would fail 75% of athletes.

Pattern Universality

4 / 4

All athletes: Low RI + OVER = negative.
All athletes: High RI + OVER = positive.

Personal Swing Range

0.060

→

0.173

2.9× range in EI swing magnitude

4

Replication = confidence. In sports science, n=4 athletes each showing the same pattern independently provides strong converging evidence (similar to a multiple-baseline single-subject design). The probability of 4/4 by chance is (0.5)⁴ = 6.25%.

C

Coaching insight: Athlete 4 (r=0.188) benefits most from recovery monitoring. Expected EI gain of +0.173 m/beat per recovered overreaching session. Athlete 3 (r=0.028) is naturally buffered, less monitoring urgency.

↑

Dose-response pattern: As RI–EI correlation increases, the athlete's training outcomes become more predictable and more recoverable. Higher r = more "lever" for coaches to pull via recovery management.

Chapter 7 / 8

RUN SMARTER, NOT HARDER

Your Plan

WHAT DOES THIS MEAN FOR YOUR ATHLETE?

Select an athlete to see their personalized profile. The line chart shows how each training zone responds across recovery levels. Steeper slope = stronger recovery-gating.

Athlete:

53.4%

EI Improved %

0.038

RI–EI Corr

423

Sessions

PRODUCTIVE

Primary Zone

MODERATE

Recovery Sensitivity

RI Zone → Mean EI by Training Status (steeper = stronger gating)

The OVERREACHING line shows the steepest slope. Recovery matters most under extreme load. The flatter MAINTAINING line shows recovery has minimal impact when load is moderate.

?

Reading the chart: Each line represents one training zone. The x-axis shows recovery level (Low → High). If a line slopes upward, better recovery → better EI. The steepness of each line tells you how sensitive that training zone is to recovery. OVERREACHING is by far the steepest.

Evidence-Based Recommendations

1
Recovery Index = daily gate. Before hard sessions, check RI. Below personal mean → reduce intensity (Halson, 2014).
2
PRODUCTIVE zone (0.95–1.30) with adequate recovery is the sweet spot for adaptation (Gabbett, 2016).
3
Never OVERREACH with low RI. EI drops −0.059 m/beat. Running form degrades under combined stress.
4
Individualize thresholds. Athlete 4 (r=0.188) needs strict monitoring; Athlete 3 (r=0.028) tolerates more.
5
7-day rolling average of EI deviation reveals plan effectiveness. Single-session noise ≠ trend.
6
Periodize recovery. After overreaching blocks, schedule 2–3 days of MAINTAINING before next push (Meeusen et al., 2013).

Decision Framework

High RI + Hard

GO

+0.069 EI avg

Low RI + Hard

STOP

−0.059 EI avg

High RI + Easy

OK

Maintain gains

Low RI + Easy

REST

Prioritize recovery

→

Practical application: This 2×2 framework can be implemented as a simple daily decision tool. Athletes check their wearable's recovery score each morning and adjust their planned session accordingly. The data shows this single decision, train hard or not based on recovery, accounts for the majority of the performance variance.

Key Takeaways

1. Recovery gates training effect: same OVERREACHING produces
−0.059 (fatigued) vs +0.069 (recovered): a 128% swing.

2. This holds for all 4 athletes, a consistent pattern,
but sensitivity ranges 6.7× (r = 0.028 to 0.188).

3. OVERREACHING has the highest variability (σ = 0.138). The most
unpredictable zone becomes predictable when you add recovery.

4. Use Recovery Index as a daily training gate.
Don't ask "how hard?" First, ask "how recovered?" first.

Methodology

Data: 1,332 sessions from 4 athletes · Collected with ONWRD · EI = distance/heartbeats (m/beat) · RI = HRV+Sleep+RHR+Load (0–100) · Relative Effort = EPOC-based training load
Design: Within-subject deviation analysis · Per-athlete baseline normalization
Statistics: Kruskal-Wallis H=15.135, p=0.0017 · Dunn's post-hoc (Bonferroni) · Pearson r=0.12, p<0.001
Interaction: 3×4 Recovery Zone × Training Status matrix · Conditional color-coded heatmap

References

Gabbett, T.J. (2016). The training–injury prevention paradox. BJSM, 50(5), 273–280.
Halson, S.L. (2014). Monitoring training load to understand fatigue. Sports Medicine, 44(S2), 139–147.
Meeusen, R. et al. (2013). Prevention, diagnosis and treatment of overtraining syndrome. EJSS, 13(1), 1–24.

2026 Data Visualization Competition Entry · Seungwon Jeong · Data by ONWRD