Engineering Council Test Reliability Report

Executive Snapshot

Daily Runs

0/7

Daily Green

20m 40s

Avg Daily Runtime

Smoke Attempts

9/26

Smoke Green

2m 17s

Avg Smoke Runtime

4m 34s

Median Smoke Time

Current Green Streak

Executive Analysis

Bottom line: release confidence is unstable in both the broad regression path and the deploy smoke path. The immediate job is to separate real product regressions from execution noise, then burn down the concentrated failure clusters.

What Matters

Daily regression passed 0 of 7 runs (0.0%), with a current green streak of 0 and a best streak of 0 in this window. The latest daily run (158268) failed, so the system is ending the week under tension rather than in a clean state. 5 failed run(s) never reached complete daily-suite counts, which points to some infrastructure or setup noise mixed into the product signal.
Smoke passed 9 of 26 attempts (34.6%) across 14 production pipelines. 3 pipeline(s) recovered on rerun, which is useful for continuity but also a sign that first-pass deploy signal is noisier than it should be. 15 failed attempt(s) never reached test execution counts at all.
Failure concentration is not random: Frontend has the highest strict failure ratio at 1.09%, while Social has the broadest non-pass footprint at 3.17%.
University is the weakest smoke surface in this window at 1/10 green (10.0%).
Daily-suite runtime averaged 20m 40s, while observed daily test volume moved from 450 to 520.

Engineering Analysis

A release gate should fail loudly for product regressions and quietly for infrastructure noise. Rerun recoveries plus incomplete daily or smoke attempts suggest those two failure modes are still partially mixed together.
The failure profile is concentrated enough to act on. Frontend and Social are carrying the strongest signal, which means reliability work should be assigned by category ownership instead of treating the suite as one undifferentiated problem.
The broader daily suite is carrying more instability than smoke, which usually means product regressions are escaping into wider coverage areas even when the narrow deploy gate looks acceptable.

Recommended Actions

Split incomplete execution failures from real assertion failures in the report narrative. Setup breakage should stay visible, but it should not look identical to a product regression in the executive readout.
Assign one owner to Frontend for the next cycle and expect a short written burn-down: top failing tests, suspected root causes, flake versus regression breakdown, and what gets fixed or quarantined first.
Treat the daily regression suite like an operations queue until it is calm again: triage failures after each red run, close known-noise items fast, and avoid letting multiple unrelated red signals pile up between runs.
Put University smoke under closer guardrails for the next release cycle. It is the best place to improve first-pass deploy confidence quickly.

Improvement Ideas

Introduce a small reliability budget for tests: every flaky or quarantined case needs an owner and an expiry, and the team should review that budget weekly the same way it reviews bugs or incidents.
Track first-fail to root-cause time as a core metric. Fast diagnosis is as important as raw pass rate because the practical value of a test gate depends on how quickly it helps the team recover.
Define a runtime budget per suite and require justification when test count or duration grows. Reliable feedback systems stay trusted when they remain both stable and proportionate.

Category Execution Ratios

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Example: if Billing executed 800 times across the week and 2 of those executions failed, Billing strict failure ratio is 0.25%. That does not mean 0.25% of pipelines failed; it means 0.25% of observed Billing executions ended in failed.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Strict Failure Ratio

Share of category executions that ended in failed across all daily runs in this window.

Billing0.13%

Web0.00%

Frontend1.09%

Library0.00%

University0.00%

Subscriptions0.00%

Admission0.00%

Social0.00%

Non-pass Ratio

Share of category executions that ended in failed, pending, or skipped across all daily runs in this window.

Billing0.13%

Web0.00%

Frontend1.09%

Library0.00%

University0.00%

Subscriptions0.00%

Admission0.00%

Social3.17%

Category Aggregate Table

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

How computed

Category total executions means the sum of that category's observed test executions across every daily-suite run in the selected window.

Strict Failure Ratio = failed executions for that category divided by total executions for that category across the window.

Non-pass Ratio = (failed + pending + skipped) executions for that category divided by total executions for that category across the window.

Category	Total	Failed	Pending	Failure Ratio	Non-pass Ratio	Runs With Failures
Billing	756	1	0	0.13%	0.13%	1
Web	1592	0	0	0.00%	0.00%	0
Frontend	1656	18	0	1.09%	1.09%	4
Library	602	0	0	0.00%	0.00%	0
University	6	0	0	0.00%	0.00%	5
Subscriptions	4	0	0	0.00%	0.00%	3
Admission	4	0	0	0.00%	0.00%	3
Social	126	0	4	0.00%	3.17%	0

Billing

Pend 0Skip 0Runs 1

0.13%

756

Web

Pend 0Skip 0Runs 0

0.00%

1592

Frontend

Pend 0Skip 0Runs 4

1.09%

1656

Library

Pend 0Skip 0Runs 0

0.00%

602

University

Pend 0Skip 0Runs 5

0.00%

Subscriptions

Pend 0Skip 0Runs 3

0.00%

Admission

Pend 0Skip 0Runs 3

0.00%

Social

Pend 4Skip 0Runs 0

0.00%

3.17%

126

Recent Runs

Recent Daily Suite Runs

Date	Pipeline	Suites	Status	Summary
2026-05-23 18:24	157201	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 450 \| Passed 448 \| Failed 3 \| Pending 1 \| Incomplete suite counts
2026-05-24 18:25	157212	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 1273 \| Passed 1272 \| Failed 0 \| Pending 1 \| Incomplete suite counts
2026-05-25 18:35	157402	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 497 \| Passed 496 \| Failed 0 \| Pending 1 \| Incomplete suite counts
2026-05-26 18:29	157644	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 1275 \| Passed 1270 \| Failed 4 \| Pending 1 \| Incomplete suite counts
2026-05-27 18:27	157886	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 223 \| Passed 223 \| Failed 0 \| Incomplete suite counts
2026-05-28 18:12	158062	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 508 \| Passed 502 \| Failed 6
2026-05-29 18:12	158268	BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial	FAILED	Total 520 \| Passed 514 \| Failed 6

2026-05-23 18:24Pipeline 157201BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 450 | P 448 | F 3 | Pend 1 | Incomplete

2026-05-24 18:25Pipeline 157212BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 1273 | P 1272 | F 0 | Pend 1 | Incomplete

2026-05-25 18:35Pipeline 157402BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 497 | P 496 | F 0 | Pend 1 | Incomplete

2026-05-26 18:29Pipeline 157644BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 1275 | P 1270 | F 4 | Pend 1 | Incomplete

2026-05-27 18:27Pipeline 157886BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 223 | P 223 | F 0 | Pend 0 | Incomplete

2026-05-28 18:12Pipeline 158062BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 508 | P 502 | F 6 | Pend 0

2026-05-29 18:12Pipeline 158268BillingWebFrontendLibraryUniversitySubscriptionsAdmissionSocial

FAILED

T 520 | P 514 | F 6 | Pend 0

Recent Smoke Attempts

Date	Suite	Pipeline	Job	Status	Passed	Failed	Duration
2026-05-25 12:53	University	157293	University smoke	FAILED	n/a	n/a	0m 13s
2026-05-25 12:57	Frontend	157293	Frontend smoke	FAILED	n/a	n/a	0m 19s
2026-05-25 17:17	Frontend	157392	Frontend smoke	FAILED	n/a	n/a	0m 14s
2026-05-25 17:23	University	157392	University smoke	FAILED	n/a	n/a	0m 14s
2026-05-26 13:57	Frontend	157552	Frontend smoke	FAILED	n/a	n/a	0m 14s
2026-05-26 14:10	University	157558	University smoke	FAILED	n/a	n/a	0m 22s
2026-05-26 14:12	Frontend	157558	Frontend smoke	FAILED	n/a	n/a	0m 14s
2026-05-26 18:32	University	157637	University smoke	FAILED	57	3	3m 41s
2026-05-26 18:36	Frontend	157637	Frontend smoke	PASSED	110	0	3m 05s
2026-05-27 12:16	University	157763	University smoke	FAILED	57	3	3m 35s
2026-05-27 12:19	Frontend	157763	Frontend smoke	PASSED	110	0	3m 11s
2026-05-27 15:31	University	157848	University smoke	FAILED	n/a	n/a	1m 33s
2026-05-27 15:34	Frontend	157848	Frontend smoke	FAILED	n/a	n/a	1m 30s
2026-05-27 16:21	Frontend	157848	Frontend smoke	FAILED	n/a	n/a	1m 27s
2026-05-27 16:21	University	157848	University smoke	FAILED	n/a	n/a	1m 33s
2026-05-27 16:37	Frontend	157848	Frontend smoke	FAILED	n/a	n/a	1m 24s
2026-05-27 16:37	University	157848	University smoke	FAILED	n/a	n/a	1m 24s
2026-05-27 19:16	Frontend	157887	Frontend smoke	FAILED	n/a	n/a	1m 39s
2026-05-28 15:15	Frontend	158021	Frontend smoke	PASSED	110	0	5m 23s
2026-05-28 15:49	Frontend	158040	Frontend smoke	PASSED	110	0	5m 12s
2026-05-28 21:22	University	158085	University smoke	FAILED	n/a	n/a	0m 08s
2026-05-28 21:28	Frontend	158085	Frontend smoke	PASSED	110	0	4m 36s
2026-05-28 23:31	Frontend	158101	Frontend smoke	PASSED	110	0	5m 31s
2026-05-29 15:12	Frontend	158231	Frontend smoke	PASSED	110	0	4m 33s
2026-05-29 16:41	Frontend	158247	Frontend smoke	PASSED	110	0	4m 34s
2026-05-29 17:53	University	158247	University smoke	PASSED	60	0	3m 26s

2026-05-25 12:53UniversityPipeline 157293Job University smoke

FAILED

P n/a | F n/a | 0m 13s

2026-05-25 12:57FrontendPipeline 157293Job Frontend smoke

FAILED

P n/a | F n/a | 0m 19s

2026-05-25 17:17FrontendPipeline 157392Job Frontend smoke

FAILED

P n/a | F n/a | 0m 14s

2026-05-25 17:23UniversityPipeline 157392Job University smoke

FAILED

P n/a | F n/a | 0m 14s

2026-05-26 13:57FrontendPipeline 157552Job Frontend smoke

FAILED

P n/a | F n/a | 0m 14s

2026-05-26 14:10UniversityPipeline 157558Job University smoke

FAILED

P n/a | F n/a | 0m 22s

2026-05-26 14:12FrontendPipeline 157558Job Frontend smoke

FAILED

P n/a | F n/a | 0m 14s

2026-05-26 18:32UniversityPipeline 157637Job University smoke

FAILED

P 57 | F 3 | 3m 41s

2026-05-26 18:36FrontendPipeline 157637Job Frontend smoke

PASSED

P 110 | F 0 | 3m 05s

2026-05-27 12:16UniversityPipeline 157763Job University smoke

FAILED

P 57 | F 3 | 3m 35s

2026-05-27 12:19FrontendPipeline 157763Job Frontend smoke

PASSED

P 110 | F 0 | 3m 11s

2026-05-27 15:31UniversityPipeline 157848Job University smoke

FAILED

P n/a | F n/a | 1m 33s

2026-05-27 15:34FrontendPipeline 157848Job Frontend smoke

FAILED

P n/a | F n/a | 1m 30s

2026-05-27 16:21FrontendPipeline 157848Job Frontend smoke

FAILED

P n/a | F n/a | 1m 27s

2026-05-27 16:21UniversityPipeline 157848Job University smoke

FAILED

P n/a | F n/a | 1m 33s

2026-05-27 16:37FrontendPipeline 157848Job Frontend smoke

FAILED

P n/a | F n/a | 1m 24s

2026-05-27 16:37UniversityPipeline 157848Job University smoke

FAILED

P n/a | F n/a | 1m 24s

2026-05-27 19:16FrontendPipeline 157887Job Frontend smoke

FAILED

P n/a | F n/a | 1m 39s

2026-05-28 15:15FrontendPipeline 158021Job Frontend smoke

PASSED

P 110 | F 0 | 5m 23s

2026-05-28 15:49FrontendPipeline 158040Job Frontend smoke

PASSED

P 110 | F 0 | 5m 12s

2026-05-28 21:22UniversityPipeline 158085Job University smoke

FAILED

P n/a | F n/a | 0m 08s

2026-05-28 21:28FrontendPipeline 158085Job Frontend smoke

PASSED

P 110 | F 0 | 4m 36s

2026-05-28 23:31FrontendPipeline 158101Job Frontend smoke

PASSED

P 110 | F 0 | 5m 31s

2026-05-29 15:12FrontendPipeline 158231Job Frontend smoke

PASSED

P 110 | F 0 | 4m 33s

2026-05-29 16:41FrontendPipeline 158247Job Frontend smoke

PASSED

P 110 | F 0 | 4m 34s

2026-05-29 17:53UniversityPipeline 158247Job University smoke

PASSED

P 60 | F 0 | 3m 26s

Smoke Suite Breakdown

Frontend

16 attempts across 14 pipelines

50% green

Passed8

Failed8

Incomplete8

Avg runtime2m 42s

Median passing runtime4m 35s

Pipelines14

University

10 attempts across 8 pipelines

10% green

Passed1

Failed9

Incomplete7

Avg runtime1m 37s

Median passing runtime3m 26s

Pipelines8

Generated from GitLab project adservio/helm2. Times are shown in Europe/Bucharest. Daily-suite runtime is measured from GitLab pipeline and job timestamps. Category counts come from GitLab test-report JSON artifacts, with job-trace fallback when older artifacts have expired.