AI Agents as Janitors: A Minimum Functional Platform for the SDLC

Mark Kendall
6 days ago
3 min read

AI Agents as Janitors: A Minimum Functional Platform for the SDLC

Executive Summary

This is a proposal for a shared, event-driven “janitor” automation service that helps engineering teams keep repositories clean, consistent, and compliant—without changing how features are built or shipped.

The system responds only to explicit triggers, makes small, bounded changes, validates them through existing CI pipelines, and proposes pull requests for human approval.

It is not autonomous development—it is disciplined cleanup at scale.

The Problem We’re Actually Solving

Modern engineering teams don’t struggle because they can’t write features.

They struggle because maintenance work piles up faster than humans can handle it.

Across large organizations, teams repeatedly lose time to:

dependency upgrades
repetitive refactors
enforcing standards by hand
policy drift across repositories
low-value cleanup work done by senior engineers

These tasks are necessary, but they don’t differentiate the business—and they distract engineers from solving real problems.

Why “Agent-First” Development Fails

The industry’s current narrative suggests AI agents should continuously improve codebases.

In practice, that leads to:

endless PRs
constant CI churn
reviewer fatigue
unstable release cadence
teams afraid to ship because the system is always changing

This proposal explicitly rejects that model.

If an AI system is “always running,” it’s already wrong.

The Janitor Model (What Actually Works)

In this model, AI agents behave like janitors:

they only work when asked
they receive explicit instructions
they operate within strict boundaries
they stop when the task is complete

No curiosity.

No creative rewrites.

No “improvements” that weren’t requested.

How the System Works (High Level)

An event occurs
- a Jira ticket reaches “Ready”
- a security or policy alert fires
- a dependency update is required
- a maintenance window opens
Context is pulled safely
- Jira, GitHub, and observability tools are accessed through MCP services
- MCPs enforce least privilege and data filtering
- agents never see credentials directly
A bounded plan is generated
- exactly what files can change
- exactly what tests must run
- no scope expansion allowed
A small code change is proposed
- patches are generated
- existing CI pipelines validate the result
- no pipelines are replaced or bypassed
Humans approve
- a PR is opened with an evidence pack
- engineers review and merge on their terms

Where the Intelligence Actually Lives

The intelligence is not in the agent.

It lives in:

architectural rules
quality gates
organizational standards
definitions of “done”

The agents simply apply those rules consistently.

This is why the system scales across:

repositories
teams
programming languages
organizational boundaries

What This Is Not

This platform does not:

write features
control deployments
auto-merge code
continuously refactor
replace developers or architects

Those ideas sound impressive in demos and fail in production.

Why This Saves Money (Without Reducing Headcount)

Organizations don’t save money by replacing engineers.

They save money by:

reducing rework
minimizing review fatigue
preventing standards drift
keeping senior engineers focused on high-value design work

You don’t pay janitors like architects—but you also don’t ask architects to take out the trash.

The Minimum Functional Product (MFP)

A realistic first version includes:

Event-driven triggers only
One or two high-ROI cleanup tasks
One repository type
Centralized policy enforcement
Human-approved PRs
Full auditability

No moonshots.

No platform rewrite.

No disruption to the SDLC.

The Real End Game

The goal is not “AI-first development.”

The goal is:

cleaner repositories
fewer low-value tasks
consistent standards
healthier delivery cadence

AI agents don’t need to be heroes to be valuable.

They just need to quietly clean up, on schedule, and get out of the way.

What’s Next

The next step is to define a minimum functional implementation—tools, staffing, timeline, and cost—that a real company could build, operate, and trust as a shared service.

No hype.

No fantasies.

Just a practical platform improvement.

AI Agents as Janitors: A Minimum Functional Platform for the SDLC

Recent Posts

Comments

Subscribe Form