KiloClawPowered by OpenClaw

Back to Cookbook

CI Failure Triage Bot

Separate flakes from regressions and route the fix

Turn CI failures into categorized, actionable tickets by detecting flaky behavior, extracting failure fingerprints, and recommending next actions with ownership.

CommunitySubmitted by CommunityWork10 min

Try in KiloClawFree 7-day trial

INGREDIENTS

🐙GitHub

PROMPT

Create a skill called "CI Failure Triage Bot". Given a CI job link or logs, the skill must: - Detect whether this failure is likely flaky (use rerun outcomes if available) - Extract a stable failure fingerprint (error lines, stack trace hash) - Classify into: flaky test, infra/network, dependency resolution, deterministic regression - Recommend next actions and propose an owner/team - Output a GitHub issue-ready summary (title, repro, evidence, next steps) Keep it concise but evidence-driven.

How It Works

This recipe classifies failures into: flaky tests, infra/network, dependency resolution,

or deterministic code regressions, then proposes fixes and ownership.

Triggers

CI fails intermittently and reruns sometimes pass
Teams routinely "just rerun CI"
CI failures block PR merges for hours or days

Steps

Collect the failing job, logs, and previous run history.
Determine "flake likelihood":

outcome differs after rerun without code change,
failure signature matches known flaky patterns.

Categorize:

test flake,
infra/network,
dependency resolution,
true regression.

Produce an action plan:

quarantine + root cause for flakes,
caching/fetch retry for dependencies,
code fix + regression test for true failures.

Open (or update) a tracking issue with the failure fingerprint and ownership.

Expected Outcome

CI failures stop being "mystery events" and become routable work.
Reduced wasted time rerunning pipelines and reading logs manually.

Example Inputs

"This PR fails on CI only sometimes; here's the job URL."
"Nightly main pipeline shows 10 flaky tests."
"Dependency download times out randomly."

Tips

Rerun is a signal, not a solution: capture the delta and classify the cause.

Tags:#ci-cd#flaky-tests#build-failures#developer-productivity

Related Recipes

Environment Drift Check

Catch silent tool/version drift before it breaks CI or prod

Prevent subtle mismatches between laptops, CI runners, and staging by detecting drift in runtime versions, build flags, and dependency resolution.

Flaky Test Quarantine

Stop flaky tests from blocking PRs while you fix them

Quarantine known flaky tests into a separate lane so they don't block merges, while preserving visibility and enforcing a fix-by deadline.

Source Hunter

Real sources, named experts, actual quotes

Deep research that finds primary sources with named individuals, community sentiment from Reddit/HN/X, and news coverage. No summaries of summaries — actual quotes with URLs.

CLAWBITE AI

Local-first AI assistant that automates small daily tasks safely on your device

A personal, local-first AI assistant that automates small daily tasks—organizing files, setting reminders, and monitoring system events—without touching sensitive data or taking risky actions without your approval.