Engineering Council Test Reliability Report

Executive Snapshot

7

Daily Runs

4/7

Daily Green

62m 25s

Avg Daily Runtime

17

Smoke Attempts

10/17

Smoke Green

3m 18s

Avg Smoke Runtime

3m 10s

Median Smoke Time

1

Current Green Streak

Executive Analysis

Bottom line: release confidence is unstable in both the broad regression path and the deploy smoke path. The immediate job is to separate real product regressions from execution noise, then burn down the concentrated failure clusters.

What Matters

Daily regression passed 4 of 7 runs (57.1%), with a current green streak of 1 and a best streak of 3 in this window.
Smoke passed 10 of 17 attempts (58.8%) across 13 production pipelines.
Failure concentration is not random: Frontend has the highest strict failure ratio at 0.28%, while Frontend has the broadest non-pass footprint at 0.28%.
University is the weakest smoke surface in this window at 1/4 green (25.0%).
Daily-suite runtime averaged 62m 25s.

Engineering Analysis

The failure profile is concentrated enough to act on. Frontend and Frontend are carrying the strongest signal, which means reliability work should be assigned by category ownership instead of treating the suite as one undifferentiated problem.
The broader daily suite is carrying more instability than smoke, which usually means product regressions are escaping into wider coverage areas even when the narrow deploy gate looks acceptable.
The daily suite is now large enough that runtime itself is becoming a management variable at 62m 25s average duration. At that size, every additional flaky or redundant test has a measurable cost on feedback speed.

Recommended Actions

Assign one owner to Frontend for the next cycle and expect a short written burn-down: top failing tests, suspected root causes, flake versus regression breakdown, and what gets fixed or quarantined first.
Treat the daily regression suite like an operations queue until it is calm again: triage failures after each red run, close known-noise items fast, and avoid letting multiple unrelated red signals pile up between runs.
Put University smoke under closer guardrails for the next release cycle. It is the best place to improve first-pass deploy confidence quickly.

Improvement Ideas

Introduce a small reliability budget for tests: every flaky or quarantined case needs an owner and an expiry, and the team should review that budget weekly the same way it reviews bugs or incidents.
Track first-fail to root-cause time as a core metric. Fast diagnosis is as important as raw pass rate because the practical value of a test gate depends on how quickly it helps the team recover.
Define a runtime budget per suite and require justification when test count or duration grows. Reliable feedback systems stay trusted when they remain both stable and proportionate.

Category Execution Ratios

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

Strict Failure Ratio

Share of category executions that ended in failed across all daily runs in this window.

Billing0.00%

Web0.02%

Frontend0.28%

Library0.00%

Non-pass Ratio

Share of category executions that ended in failed, pending, or skipped across all daily runs in this window.

Billing0.00%

Web0.02%

Frontend0.28%

Library0.00%

Category Aggregate Table

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

Category	Total	Failed	Failure Ratio	Non-pass Ratio	Runs With Failures
Billing	756	0	0.00%	0.00%	0
Web	5460	1	0.02%	0.02%	1
Frontend	2160	6	0.28%	0.28%	3
Library	602	0	0.00%	0.00%	0

Billing

Pend 0Skip 0Runs 0

0

0.00%

756

Web

Pend 0Skip 0Runs 1

1

0.02%

5460

Frontend

Pend 0Skip 0Runs 3

6

0.28%

2160

Library

Pend 0Skip 0Runs 0

0

0.00%

602

Recent Runs

Recent Daily Suite Runs

Date	Pipeline	Suites	Status	Summary
2026-04-25 18:26	153528	BillingWebFrontendLibrary	PASSED	Total 1244 \| Passed 1244 \| Failed 0
2026-04-26 18:25	153530	BillingWebFrontendLibrary	PASSED	Total 1244 \| Passed 1244 \| Failed 0
2026-04-27 18:25	153750	BillingWebFrontendLibrary	PASSED	Total 1244 \| Passed 1244 \| Failed 0
2026-04-28 18:26	153968	BillingWebFrontendLibrary	FAILED	Total 1244 \| Passed 1240 \| Failed 4
2026-04-29 18:25	154112	BillingWebFrontendLibrary	FAILED	Total 1244 \| Passed 1243 \| Failed 1
2026-04-30 23:05	154297	BillingWebFrontendLibrary	FAILED	Total 1514 \| Passed 1512 \| Failed 2
2026-05-01 18:25	154317	BillingWebFrontendLibrary	PASSED	Total 1244 \| Passed 1244 \| Failed 0

2026-04-25 18:26Pipeline 153528BillingWebFrontendLibrary

PASSED

T 1244 | P 1244 | F 0 | Pend 0

2026-04-26 18:25Pipeline 153530BillingWebFrontendLibrary

PASSED

T 1244 | P 1244 | F 0 | Pend 0

2026-04-27 18:25Pipeline 153750BillingWebFrontendLibrary

PASSED

T 1244 | P 1244 | F 0 | Pend 0

2026-04-28 18:26Pipeline 153968BillingWebFrontendLibrary

FAILED

T 1244 | P 1240 | F 4 | Pend 0

2026-04-29 18:25Pipeline 154112BillingWebFrontendLibrary

FAILED

T 1244 | P 1243 | F 1 | Pend 0

2026-04-30 23:05Pipeline 154297BillingWebFrontendLibrary

FAILED

T 1514 | P 1512 | F 2 | Pend 0

2026-05-01 18:25Pipeline 154317BillingWebFrontendLibrary

PASSED

T 1244 | P 1244 | F 0 | Pend 0

Recent Smoke Attempts

Date	Suite	Pipeline	Job	Status	Passed	Failed	Duration
2026-04-27 11:26	Frontend	153582	Frontend smoke	PASSED	110	0	3m 12s
2026-04-27 15:29	Frontend	153624	Frontend smoke	PASSED	110	0	3m 09s
2026-04-28 14:10	University	153904	University smoke	PASSED	60	0	2m 21s
2026-04-28 14:14	Frontend	153904	Frontend smoke	PASSED	110	0	3m 00s
2026-04-28 14:36	University	153915	University smoke	FAILED	59	1	2m 29s
2026-04-28 14:39	University	153915	University smoke	FAILED	59	1	2m 58s
2026-04-28 14:40	Frontend	153915	Frontend smoke	FAILED	108	2	4m 09s
2026-04-28 15:47	Frontend	153940	Frontend smoke	FAILED	108	2	3m 46s
2026-04-28 18:27	Frontend	153967	Frontend smoke	FAILED	108	2	4m 04s
2026-04-28 18:40	University	153970	University smoke	FAILED	59	1	2m 56s
2026-04-28 18:45	Frontend	153970	Frontend smoke	FAILED	108	2	3m 57s
2026-04-29 13:02	Frontend	154041	Frontend smoke	PASSED	110	0	3m 37s
2026-04-30 06:47	Frontend	154117	Frontend smoke	PASSED	110	0	3m 27s
2026-04-30 14:43	Frontend	154215	Frontend smoke	PASSED	110	0	3m 34s
2026-04-30 15:59	Frontend	154270	Frontend smoke	PASSED	110	0	3m 09s
2026-04-30 16:14	Frontend	154276	Frontend smoke	PASSED	110	0	3m 20s
2026-04-30 17:10	Frontend	154296	Frontend smoke	PASSED	110	0	3m 04s

2026-04-27 11:26FrontendPipeline 153582Job Frontend smoke

PASSED

P 110 | F 0 | 3m 12s

2026-04-27 15:29FrontendPipeline 153624Job Frontend smoke

PASSED

P 110 | F 0 | 3m 09s

2026-04-28 14:10UniversityPipeline 153904Job University smoke

PASSED

P 60 | F 0 | 2m 21s

2026-04-28 14:14FrontendPipeline 153904Job Frontend smoke

PASSED

P 110 | F 0 | 3m 00s

2026-04-28 14:36UniversityPipeline 153915Job University smoke

FAILED

P 59 | F 1 | 2m 29s

2026-04-28 14:39UniversityPipeline 153915Job University smoke

FAILED

P 59 | F 1 | 2m 58s

2026-04-28 14:40FrontendPipeline 153915Job Frontend smoke

FAILED

P 108 | F 2 | 4m 09s

2026-04-28 15:47FrontendPipeline 153940Job Frontend smoke

FAILED

P 108 | F 2 | 3m 46s

2026-04-28 18:27FrontendPipeline 153967Job Frontend smoke

FAILED

P 108 | F 2 | 4m 04s

2026-04-28 18:40UniversityPipeline 153970Job University smoke

FAILED

P 59 | F 1 | 2m 56s

2026-04-28 18:45FrontendPipeline 153970Job Frontend smoke

FAILED

P 108 | F 2 | 3m 57s

2026-04-29 13:02FrontendPipeline 154041Job Frontend smoke

PASSED

P 110 | F 0 | 3m 37s

2026-04-30 06:47FrontendPipeline 154117Job Frontend smoke

PASSED

P 110 | F 0 | 3m 27s

2026-04-30 14:43FrontendPipeline 154215Job Frontend smoke

PASSED

P 110 | F 0 | 3m 34s

2026-04-30 15:59FrontendPipeline 154270Job Frontend smoke

PASSED

P 110 | F 0 | 3m 09s

2026-04-30 16:14FrontendPipeline 154276Job Frontend smoke

PASSED

P 110 | F 0 | 3m 20s

2026-04-30 17:10FrontendPipeline 154296Job Frontend smoke

PASSED

P 110 | F 0 | 3m 04s

Smoke Suite Breakdown

Frontend

13 attempts across 13 pipelines

69% green

Passed9

Failed4

Incomplete0

Avg runtime3m 30s

Median passing runtime3m 12s

Pipelines13

University

4 attempts across 3 pipelines

25% green

Passed1

Failed3

Incomplete0

Avg runtime2m 41s

Median passing runtime2m 21s

Pipelines3