How We Built an AI System That Automates Senior Solution Architect Workflows
Discover how we built a 4-agent AI co-pilot that converts complex RFPs into draft technical proposals in 15 minutes — with built-in conflict detection, assumption surfacing, and confidence scoring.
Author

Date

Book a call
Table of Contents
30% of a senior architect's time goes into reading RFPs and writing proposals rather than designing systems or solving engineering problems.
A 50-page enterprise RFP lands in an inbox. Two hours later, the architect has concluded what they suspected on page three: React Native frontend, Node backend, PostgreSQL, and Stripe for payments. The decision was predictable, but the process was manual and time-consuming.
What the AI Presales Co-Pilot Does
This system assists with the presales cycle. You upload a client's documents—PDFs, Word docs, Excel sheets, or emails—and the system produces a draft technical proposal that an architect can review and refine in 15 minutes instead of writing from scratch for 3 hours.
The value extends beyond document generation to reasoning:
- Detects Contradictions: It flags tensions, such as a request for Native Mobile only paired with Critical SEO.
- Surfaces Assumptions: It states the assumptions it is making (e.g., assuming Stripe for payments) so an architect can confirm or override them.
- Confidence Scoring: It scores its own decisions, indicating where reviewers should focus attention.

How It Works: Four Specialized Agents
The system runs four specialized AI agents in sequence. The entire pipeline runs over a WebSocket connection, streaming progress to the dashboard in real time.

Agent 1: Requirement Interpreter
This agent parses raw, unstructured documents using format-specific tools (PyMuPDF, pandas, etc.). It extracts features, technical constraints, and non-functional requirements.
- Pattern Enrichment: If a two-sided marketplace is mentioned, the system suggests features like ratings and booking based on a built-in knowledge base.
- Conflict Detection: A rule engine identifies over-engineering or technical mismatches during initial analysis.
Agent 2: Solution Engine
This agent produces tech stack recommendations and effort estimates. It uses a hybrid approach: deterministic rules suggest the baseline stack (e.g., NestJS/PostgreSQL), while the LLM validates that choice against the specific client context. Effort estimation provides optimistic-to-pessimistic ranges (e.g., "120–210 hours") to reflect real-world uncertainty.
Agent 3: Self-Critique
Before a proposal is written, a critique agent reviews the solution for consistency. If an assumption contradicts a user input, the system sends a clarifying question back to the architect via WebSockets. Only when the critique approves does the pipeline proceed to generation.
Agent 4: Proposal Composer


The Pipeline at a Glance
| Stage | Role | Output |
|---|---|---|
| 1. Parse | Ingestion of any file type | Combined raw text |
| 2. Extract | Conflict & pattern detection | Structured requirements JSON |
| 3. Design | Rule-based stack matching | Solution + effort mapping |
| 4. Review | Validation & User Clarification | Approved technical solution |
| 5. Write | Template-based generation | Polished Markdown/PDF proposal |
Engineering Insights
Rule-based systems and LLMs complement each other: Conflict detection and effort mapping do not need an LLM—they need deterministic rules. LLMs handle ambiguity and prose generation.
The Technical Stack
- Backend: Python with FastAPI for async orchestration.
- AI Layer: OpenRouter (Claude, GPT-4, and Gemini) for multi-model reasoning.
- Communication: WebSockets for real-time streaming to a React-based frontend.
- Knowledge Base: JSON-based datasets for requirement patterns and stack decision rules.

Related Articles.
More from the engineering frontline.
Dive deep into our research and insights on design, development, and the impact of various trends to businesses.

Apr 6, 2026
AI Code Healer for Fixing Broken CI/CD Builds Fast
A deep dive into how GeekyAnts built an AI-powered Code Healer that analyzes CI/CD failures, summarizes logs, and generates code-level fixes to keep development moving.

Apr 6, 2026
AI-Powered Global Watchtower for Supply Chain Risk Management
An AI-powered platform that monitors global supply chain risks in real time, detecting threats across weather, news, and shipping before they disrupt operations.

Apr 2, 2026
A Real-Time AI Fraud Decision Engine Under 50ms
A deep dive into how GeekyAnts built a real-time AI fraud detection system that evaluates transactions in milliseconds using a hybrid multi-agent approach.

Apr 1, 2026
Building an Autonomous Multi-Agent Fraud Detection System in Under 200ms
GeekyAnts built a 5-agent fraud detection pipeline that makes decisions in under 200ms — 15x cheaper than single-model systems, with full explainability built in.

Mar 31, 2026
Building a Self-Healing CI/CD System with an AI Agent
When code breaks a pipeline, developers have to stop working and figure out why. This blog shows how an AI agent reads the error, finds the fix, and submits it for review all on its own.

Mar 26, 2026
Maestro Automation Framework — Advanced to Expert
Master Maestro at scale. Learn architecture, reusable flows, CI/CD optimization, and how to eliminate flakiness in production-grade mobile automation.Master Maestro at scale. Learn architecture, reusable flows, CI/CD optimization, and how to eliminate flakiness in production-grade mobile automation.