Engineering Council Test Reliability Report

Executive Snapshot

7

Daily Runs

0/7

Daily Green

21m 42s

Avg Daily Runtime

16

Smoke Attempts

14/16

Smoke Green

3m 14s

Avg Smoke Runtime

3m 09s

Median Smoke Time

0

Current Green Streak

Executive Analysis

Bottom line: the regression system is informative but not calm. The data suggest repeatable problem areas rather than random breakage, which means focused ownership should move the needle quickly.

What Matters

Daily regression passed 0 of 7 runs (0.0%), with a current green streak of 0 and a best streak of 0 in this window. The latest daily run (157199) failed, so the system is ending the week under tension rather than in a clean state. 7 failed run(s) never reached complete daily-suite counts, which points to some infrastructure or setup noise mixed into the product signal.
Smoke passed 14 of 16 attempts (87.5%) across 9 production pipelines. 2 pipeline(s) recovered on rerun, which is useful for continuity but also a sign that first-pass deploy signal is noisier than it should be.
Failure concentration is not random: Frontend has the highest strict failure ratio at 4.75%, while Social has the broadest non-pass footprint at 16.67%.
University is the weakest smoke surface in this window at 4/5 green (80.0%).
Daily-suite runtime averaged 21m 42s, while observed daily test volume moved from 1,273 to 450.

Engineering Analysis

A release gate should fail loudly for product regressions and quietly for infrastructure noise. Rerun recoveries plus incomplete daily or smoke attempts suggest those two failure modes are still partially mixed together.
The failure profile is concentrated enough to act on. Frontend and Social are carrying the strongest signal, which means reliability work should be assigned by category ownership instead of treating the suite as one undifferentiated problem.
The broader daily suite is carrying more instability than smoke, which usually means product regressions are escaping into wider coverage areas even when the narrow deploy gate looks acceptable.

Recommended Actions

Split incomplete execution failures from real assertion failures in the report narrative. Setup breakage should stay visible, but it should not look identical to a product regression in the executive readout.
Assign one owner to Frontend for the next cycle and expect a short written burn-down: top failing tests, suspected root causes, flake versus regression breakdown, and what gets fixed or quarantined first.
Treat the daily regression suite like an operations queue until it is calm again: triage failures after each red run, close known-noise items fast, and avoid letting multiple unrelated red signals pile up between runs.
Put University smoke under closer guardrails for the next release cycle. It is the best place to improve first-pass deploy confidence quickly.

Improvement Ideas

Introduce a small reliability budget for tests: every flaky or quarantined case needs an owner and an expiry, and the team should review that budget weekly the same way it reviews bugs or incidents.
Track first-fail to root-cause time as a core metric. Fast diagnosis is as important as raw pass rate because the practical value of a test gate depends on how quickly it helps the team recover.
Define a runtime budget per suite and require justification when test count or duration grows. Reliable feedback systems stay trusted when they remain both stable and proportionate.

Category Execution Ratios

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

Strict Failure Ratio

Share of category executions that ended in failed across all daily runs in this window.

Billing1.32%

Web0.04%

Frontend4.75%

Library3.82%

University0.00%

Subscriptions0.00%

Admission0.00%

Social3.97%

Non-pass Ratio

Share of category executions that ended in failed, pending, or skipped across all daily runs in this window.

Billing14.29%

Web0.17%

Frontend13.77%

Library14.29%

University0.00%

Subscriptions0.00%

Admission0.00%

Social16.67%

Category Aggregate Table

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

Category	Total	Failed	Pending	Skipped	Failure Ratio	Non-pass Ratio	Runs With Failures
Billing	756	10	0	98	1.32%	14.29%	1
Web	2356	1	0	3	0.04%	0.17%	1
Frontend	1873	89	0	169	4.75%	13.77%	7
Library	602	23	0	63	3.82%	14.29%	1
University	0	0	0	0	0.00%	0.00%	7
Subscriptions	0	0	0	0	0.00%	0.00%	7
Admission	0	0	0	0	0.00%	0.00%	7
Social	126	5	6	10	3.97%	16.67%	1

Billing

Pend 0Skip 98Runs 1

10

1.32%

14.29%

756

Web

Pend 0Skip 3Runs 1

1

0.04%

0.17%

2356

Frontend

Pend 0Skip 169Runs 7

89

4.75%

13.77%

1873

Library

Pend 0Skip 63Runs 1

23

3.82%

14.29%

602

University

Pend 0Skip 0Runs 7

0

0.00%

0

Subscriptions

Pend 0Skip 0Runs 7

0

0.00%

0

Admission

Pend 0Skip 0Runs 7

0

0.00%

0

Social

Pend 6Skip 10Runs 1

5

3.97%

16.67%

126

Recent Runs

Recent Daily Suite Runs

Date	Pipeline	Suites	Status	Summary
2026-05-16 18:27	156178	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 1273 \| Passed 1268 \| Failed 4 \| Pending 1 \| Incomplete suite counts
2026-05-17 18:27	156183	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 1273 \| Passed 1269 \| Failed 3 \| Pending 1 \| Incomplete suite counts
2026-05-18 18:26	156473	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 497 \| Passed 491 \| Failed 5 \| Pending 1 \| Incomplete suite counts
2026-05-19 18:28	156720	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 1273 \| Passed 1267 \| Failed 5 \| Pending 1 \| Incomplete suite counts
2026-05-20 18:28	156907	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 497 \| Passed 493 \| Failed 3 \| Pending 1 \| Incomplete suite counts
2026-05-21 18:24	157062	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 450 \| Passed 449 \| Failed 2 \| Pending 1 \| Incomplete suite counts
2026-05-22 18:11	157199	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 450 \| Passed 3 \| Failed 106 \| Incomplete suite counts

2026-05-16 18:27Pipeline 156178BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 1273 | P 1268 | F 4 | Pend 1 | Incomplete

2026-05-17 18:27Pipeline 156183BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 1273 | P 1269 | F 3 | Pend 1 | Incomplete

2026-05-18 18:26Pipeline 156473BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 497 | P 491 | F 5 | Pend 1 | Incomplete

2026-05-19 18:28Pipeline 156720BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 1273 | P 1267 | F 5 | Pend 1 | Incomplete

2026-05-20 18:28Pipeline 156907BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 497 | P 493 | F 3 | Pend 1 | Incomplete

2026-05-21 18:24Pipeline 157062BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 450 | P 449 | F 2 | Pend 1 | Incomplete

2026-05-22 18:11Pipeline 157199BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 450 | P 3 | F 106 | Pend 0 | Incomplete

Recent Smoke Attempts

Date	Suite	Pipeline	Job	Status	Passed	Failed	Duration
2026-05-18 15:02	Frontend	156306	Frontend smoke	PASSED	110	0	3m 08s
2026-05-18 17:08	Frontend	156455	Frontend smoke	PASSED	110	0	3m 12s
2026-05-18 22:21	Frontend	156479	Frontend smoke	PASSED	110	0	3m 09s
2026-05-18 23:10	Frontend	156481	Frontend smoke	PASSED	110	0	3m 20s
2026-05-19 12:54	Frontend	156608	Frontend smoke	FAILED	80	1	5m 01s
2026-05-19 12:59	Frontend	156608	Frontend smoke	PASSED	110	0	4m 02s
2026-05-20 23:05	Frontend	156608	Frontend smoke	PASSED	110	0	3m 31s
2026-05-21 14:43	University	157003	University smoke	PASSED	60	0	2m 13s
2026-05-21 14:50	Frontend	157003	Frontend smoke	PASSED	110	0	3m 49s
2026-05-21 16:13	University	157030	University smoke	PASSED	60	0	2m 13s
2026-05-21 16:19	Frontend	157030	Frontend smoke	PASSED	110	0	3m 06s
2026-05-22 15:37	University	157139	University smoke	PASSED	60	0	2m 10s
2026-05-22 15:41	Frontend	157139	Frontend smoke	PASSED	110	0	3m 08s
2026-05-22 16:56	University	157198	University smoke	FAILED	59	1	3m 22s
2026-05-22 16:56	Frontend	157198	Frontend smoke	PASSED	110	0	3m 53s
2026-05-22 16:59	University	157198	University smoke	PASSED	60	0	2m 25s

2026-05-18 15:02FrontendPipeline 156306Job Frontend smoke

PASSED

P 110 | F 0 | 3m 08s

2026-05-18 17:08FrontendPipeline 156455Job Frontend smoke

PASSED

P 110 | F 0 | 3m 12s

2026-05-18 22:21FrontendPipeline 156479Job Frontend smoke

PASSED

P 110 | F 0 | 3m 09s

2026-05-18 23:10FrontendPipeline 156481Job Frontend smoke

PASSED

P 110 | F 0 | 3m 20s

2026-05-19 12:54FrontendPipeline 156608Job Frontend smoke

FAILED

P 80 | F 1 | 5m 01s

2026-05-19 12:59FrontendPipeline 156608Job Frontend smoke

PASSED

P 110 | F 0 | 4m 02s

2026-05-20 23:05FrontendPipeline 156608Job Frontend smoke

PASSED

P 110 | F 0 | 3m 31s

2026-05-21 14:43UniversityPipeline 157003Job University smoke

PASSED

P 60 | F 0 | 2m 13s

2026-05-21 14:50FrontendPipeline 157003Job Frontend smoke

PASSED

P 110 | F 0 | 3m 49s

2026-05-21 16:13UniversityPipeline 157030Job University smoke

PASSED

P 60 | F 0 | 2m 13s

2026-05-21 16:19FrontendPipeline 157030Job Frontend smoke

PASSED

P 110 | F 0 | 3m 06s

2026-05-22 15:37UniversityPipeline 157139Job University smoke

PASSED

P 60 | F 0 | 2m 10s

2026-05-22 15:41FrontendPipeline 157139Job Frontend smoke

PASSED

P 110 | F 0 | 3m 08s

2026-05-22 16:56UniversityPipeline 157198Job University smoke

FAILED

P 59 | F 1 | 3m 22s

2026-05-22 16:56FrontendPipeline 157198Job Frontend smoke

PASSED

P 110 | F 0 | 3m 53s

2026-05-22 16:59UniversityPipeline 157198Job University smoke

PASSED

P 60 | F 0 | 2m 25s

Smoke Suite Breakdown

Frontend

11 attempts across 9 pipelines

91% green

Passed10

Failed1

Incomplete0

Avg runtime3m 34s

Median passing runtime3m 16s

Pipelines9

University

5 attempts across 4 pipelines

80% green

Passed4

Failed1

Incomplete0

Avg runtime2m 28s

Median passing runtime2m 13s

Pipelines4