Engineering Council Test Reliability Report

Executive Snapshot

4

Daily Runs

0/4

Daily Green

20m 05s

Avg Daily Runtime

12

Smoke Attempts

11/12

Smoke Green

3m 04s

Avg Smoke Runtime

3m 07s

Median Smoke Time

0

Current Green Streak

Executive Analysis

Bottom line: the regression system is informative but not calm. The data suggest repeatable problem areas rather than random breakage, which means focused ownership should move the needle quickly.

What Matters

Daily regression passed 0 of 4 runs (0.0%), with a current green streak of 0 and a best streak of 0 in this window. The latest daily run (156175) failed, so the system is ending the week under tension rather than in a clean state. 2 failed run(s) never reached complete daily-suite counts, which points to some infrastructure or setup noise mixed into the product signal.
Smoke passed 11 of 12 attempts (91.7%) across 8 production pipelines. 1 pipeline(s) recovered on rerun, which is useful for continuity but also a sign that first-pass deploy signal is noisier than it should be.
Failure concentration is not random: Social has the highest strict failure ratio at 33.33%, while Social has the broadest non-pass footprint at 88.89%.
University is the weakest smoke surface in this window at 3/4 green (75.0%).
Daily-suite runtime averaged 20m 05s, while observed daily test volume moved from 1,255 to 1,273.

Engineering Analysis

A release gate should fail loudly for product regressions and quietly for infrastructure noise. Rerun recoveries plus incomplete daily or smoke attempts suggest those two failure modes are still partially mixed together.
The failure profile is concentrated enough to act on. Social and Social are carrying the strongest signal, which means reliability work should be assigned by category ownership instead of treating the suite as one undifferentiated problem.
The broader daily suite is carrying more instability than smoke, which usually means product regressions are escaping into wider coverage areas even when the narrow deploy gate looks acceptable.

Recommended Actions

Split incomplete execution failures from real assertion failures in the report narrative. Setup breakage should stay visible, but it should not look identical to a product regression in the executive readout.
Assign one owner to Social for the next cycle and expect a short written burn-down: top failing tests, suspected root causes, flake versus regression breakdown, and what gets fixed or quarantined first.
Treat the daily regression suite like an operations queue until it is calm again: triage failures after each red run, close known-noise items fast, and avoid letting multiple unrelated red signals pile up between runs.
Put University smoke under closer guardrails for the next release cycle. It is the best place to improve first-pass deploy confidence quickly.

Improvement Ideas

Introduce a small reliability budget for tests: every flaky or quarantined case needs an owner and an expiry, and the team should review that budget weekly the same way it reviews bugs or incidents.
Track first-fail to root-cause time as a core metric. Fast diagnosis is as important as raw pass rate because the practical value of a test gate depends on how quickly it helps the team recover.
Define a runtime budget per suite and require justification when test count or duration grows. Reliable feedback systems stay trusted when they remain both stable and proportionate.

Category Execution Ratios

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

Strict Failure Ratio

Share of category executions that ended in failed across all daily runs in this window.

Billing2.31%

Web8.86%

Frontend9.25%

Library6.69%

University0.00%

Subscriptions0.00%

Admission0.00%

Social33.33%

Non-pass Ratio

Share of category executions that ended in failed, pending, or skipped across all daily runs in this window.

Billing25.00%

Web49.74%

Frontend27.14%

Library25.00%

University0.00%

Subscriptions0.00%

Admission0.00%

Social88.89%

Category Aggregate Table

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

Category	Total	Failed	Skipped	Failure Ratio	Non-pass Ratio	Runs With Failures
Billing	432	10	98	2.31%	25.00%	1
Web	1568	139	641	8.86%	49.74%	1
Frontend	1124	104	201	9.25%	27.14%	4
Library	344	23	63	6.69%	25.00%	1
University	0	0	0	0.00%	0.00%	2
Subscriptions	0	0	0	0.00%	0.00%	2
Admission	0	0	0	0.00%	0.00%	2
Social	36	12	20	33.33%	88.89%	2

Billing

Pend 0Skip 98Runs 1

10

2.31%

25.00%

432

Web

Pend 0Skip 641Runs 1

139

8.86%

49.74%

1568

Frontend

Pend 0Skip 201Runs 4

104

9.25%

27.14%

1124

Library

Pend 0Skip 63Runs 1

23

6.69%

25.00%

344

University

Pend 0Skip 0Runs 2

0

0.00%

0

Subscriptions

Pend 0Skip 0Runs 2

0

0.00%

0

Admission

Pend 0Skip 0Runs 2

0

0.00%

0

Social

Pend 0Skip 20Runs 2

12

33.33%

88.89%

36

Recent Runs

Recent Daily Suite Runs

Date	Pipeline	Suites	Status	Summary
2026-05-12 18:26	155661	BillingWebFrontendLibrary	FAILED	Total 1255 \| Passed 1244 \| Failed 11
2026-05-13 18:27	155895	BillingWebFrontendLibrary	FAILED	Total 479 \| Passed 468 \| Failed 11
2026-05-14 18:28	155995	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 497 \| Passed 481 \| Failed 6 \| Incomplete suite counts
2026-05-15 18:11	156175	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 1273 \| Passed 0 \| Failed 260 \| Incomplete suite counts

2026-05-12 18:26Pipeline 155661BillingWebFrontendLibrary

FAILED

T 1255 | P 1244 | F 11 | Pend 0

2026-05-13 18:27Pipeline 155895BillingWebFrontendLibrary

FAILED

T 479 | P 468 | F 11 | Pend 0

2026-05-14 18:28Pipeline 155995BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 497 | P 481 | F 6 | Pend 0 | Incomplete

2026-05-15 18:11Pipeline 156175BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 1273 | P 0 | F 260 | Pend 0 | Incomplete

Recent Smoke Attempts

Date	Suite	Pipeline	Job	Status	Passed	Failed	Duration
2026-05-10 15:16	Frontend	155320	Frontend smoke	PASSED	110	0	3m 59s
2026-05-10 15:44	Frontend	155324	Frontend smoke	PASSED	110	0	3m 40s
2026-05-11 11:13	Frontend	155377	Frontend smoke	PASSED	110	0	3m 04s
2026-05-11 16:34	Frontend	155489	Frontend smoke	PASSED	110	0	3m 10s
2026-05-12 16:02	University	155636	University smoke	PASSED	60	0	2m 14s
2026-05-12 16:07	Frontend	155636	Frontend smoke	PASSED	110	0	3m 07s
2026-05-14 07:48	University	155903	University smoke	PASSED	60	0	2m 13s
2026-05-14 07:53	Frontend	155903	Frontend smoke	PASSED	110	0	3m 07s
2026-05-14 14:26	University	155955	University smoke	FAILED	57	3	3m 11s
2026-05-14 14:30	Frontend	155955	Frontend smoke	PASSED	110	0	3m 11s
2026-05-14 16:32	University	155955	University smoke	PASSED	60	0	2m 25s
2026-05-15 12:20	Frontend	156034	Frontend smoke	PASSED	110	0	3m 26s

2026-05-10 15:16FrontendPipeline 155320Job Frontend smoke

PASSED

P 110 | F 0 | 3m 59s

2026-05-10 15:44FrontendPipeline 155324Job Frontend smoke

PASSED

P 110 | F 0 | 3m 40s

2026-05-11 11:13FrontendPipeline 155377Job Frontend smoke

PASSED

P 110 | F 0 | 3m 04s

2026-05-11 16:34FrontendPipeline 155489Job Frontend smoke

PASSED

P 110 | F 0 | 3m 10s

2026-05-12 16:02UniversityPipeline 155636Job University smoke

PASSED

P 60 | F 0 | 2m 14s

2026-05-12 16:07FrontendPipeline 155636Job Frontend smoke

PASSED

P 110 | F 0 | 3m 07s

2026-05-14 07:48UniversityPipeline 155903Job University smoke

PASSED

P 60 | F 0 | 2m 13s

2026-05-14 07:53FrontendPipeline 155903Job Frontend smoke

PASSED

P 110 | F 0 | 3m 07s

2026-05-14 14:26UniversityPipeline 155955Job University smoke

FAILED

P 57 | F 3 | 3m 11s

2026-05-14 14:30FrontendPipeline 155955Job Frontend smoke

PASSED

P 110 | F 0 | 3m 11s

2026-05-14 16:32UniversityPipeline 155955Job University smoke

PASSED

P 60 | F 0 | 2m 25s

2026-05-15 12:20FrontendPipeline 156034Job Frontend smoke

PASSED

P 110 | F 0 | 3m 26s

Smoke Suite Breakdown

Frontend

8 attempts across 8 pipelines

100% green

Passed8

Failed0

Incomplete0

Avg runtime3m 20s

Median passing runtime3m 11s

Pipelines8

University

4 attempts across 3 pipelines

75% green

Passed3

Failed1

Incomplete0

Avg runtime2m 31s

Median passing runtime2m 14s

Pipelines3