KumoKodo.ai Case Study
Building Aether AI Support
How we built a modern, AI-powered customer support platform from the ground up using cutting-edge technologies and cloud-native architecture.
The Challenge
Traditional customer support software suffers from several critical limitations. We set out to build a platform that leverages AI to solve these problems while remaining affordable and easy to implement.
- Slow response times leading to customer frustration and churn
- Repetitive queries consuming valuable agent time
- Lack of intelligent routing causing mishandled tickets
- Disconnected systems creating data silos
- Complex pricing making it inaccessible for SMBs
Our Solution
Aether AI Support combines the latest in AI technology with modern web development practices to deliver a transformative support experience.
AI-First Architecture
We built a Retrieval-Augmented Generation (RAG) pipeline using OpenAI GPT-4o-mini and ChromaDB vector embeddings. The AI understands context, retrieves relevant knowledge base articles, and generates accurate, helpful responses in real-time.
Real-Time Collaboration
Using WebSockets and Server-Sent Events (SSE), we implemented instant updates across the entire platform. Agents see live ticket changes, typing indicators, and presence status without page refreshes.
Multi-Tenant SaaS Design
The platform supports unlimited companies with complete data isolation. Each tenant can customize their widget, knowledge base, SLA rules, and integrations independently.
Serverless Infrastructure
By deploying the backend on Google Cloud Run and frontend on Vercel, we achieved automatic scaling, zero cold-start latency, and 99.95% uptime without managing servers.
Technical Deep Dive
Backend Architecture
We chose FastAPI for its exceptional performance (benchmarked at 40,000+ requests/second) and native async support. The API is fully typed with Pydantic models, providing automatic validation and OpenAPI documentation.
- Service-oriented design with dependency injection
- Background task processing with asyncio
- Structured logging with correlation IDs
- Rate limiting and quota enforcement
AI & Machine Learning
The RAG pipeline processes incoming queries through multiple stages:
- 1.Query Analysis: Intent detection and entity extraction
- 2.Retrieval: Hybrid semantic + keyword search across knowledge base
- 3.Context Assembly: Relevance scoring and context window optimization
- 4.Generation: GPT-4o-mini with custom system prompts per company
- 5.Post-processing: Confidence scoring and escalation detection
Frontend Engineering
Next.js 14 with the App Router provides server-side rendering, streaming responses, and optimal bundle splitting. We implemented:
- Context-based theming system (light/dark/gradients)
- Framer Motion animations for smooth UX
- Keyboard shortcut system with Ctrl+K command palette
- Optimistic UI updates with SWR caching
- Responsive design with Tailwind CSS
Database Design
Google Firestore provides automatic scaling and real-time sync capabilities. Our data model includes:
- Companies: Multi-tenant configuration and settings
- Tickets: Support requests with full history
- Messages: Conversation threads with metadata
- Knowledge Base: Articles, FAQs, and embeddings
- Users: Agents, admins, and customers
Results & Metrics
Lessons Learned
Start with the AI, not the UI
We built the RAG pipeline and AI capabilities first, then designed the interface around them. This ensured the AI was a core feature, not an afterthought.
Invest in real-time from day one
Retrofitting WebSocket support is painful. By designing for real-time updates from the start, we avoided technical debt and delivered a more responsive experience.
Multi-tenancy requires early planning
Every database query, API route, and frontend component needed tenant awareness. Planning this architecture upfront saved months of refactoring.
Type everything
TypeScript on the frontend and Pydantic on the backend caught countless bugs before they reached production. Strong typing is non-negotiable for complex applications.
About KumoKodo.ai
We specialize in building AI-powered SaaS applications using modern cloud-native technologies. Our team combines expertise in machine learning, full-stack development, and user experience design to create products that solve real business problems.
Want to Try Aether AI Support?
Start your 14-day free trial and experience AI-powered support. Or contact us to discuss building a similar platform for your business.