Richard Echols

Forward Deployed AI Engineer · Founder · Builds & Runs Local AI Models

Richard Echols

I don't just talk about AI. I ship it, and I build and run my own local AI models on my own hardware. 26 paying customers. $1.6M managed. One engineer.

26+

Paying Customers

$1.6M+

ARR Managed

10,500+

Lines Shipped

USPTO

Trademark

What I Built

Kiyomi

A production AI business operating system with 26+ paying customers and a USPTO trademark. Not a side project. Not a demo. A real product people pay for.

“I didn't just learn AI — I built a business with it.”

10,500+ lines of production code. Direct LLM API integration with prompt caching. Multi-agent orchestration. Cross-platform distribution. AI-powered customer support. Built and shipped by one person.

TypeScriptNode.jsClaude APICloudflare WorkersD1SwiftMCPStripe
Visit kiyomibot.ai

Direct LLM Integration

Production API integration, not a wrapper. Custom prompt caching delivers 90% cost reduction.

Multi-Agent Orchestration

4 AI personas with distinct specializations operating autonomously on real business tasks.

14 Native Tools

File management, web search, code execution, scheduling, email, and more built into the engine.

Auth Proxy on Cloudflare

Production-grade auth layer on Cloudflare Workers. Token tracking, usage limits, billing logic.

Cross-Platform Distribution

Native installers for macOS (ARM64) and Windows (x64). Automated CI/CD build pipeline.

Automated Operations

10+ cron jobs handling support, monitoring, content moderation, and customer notifications.

What I Built

Kaiju-Coder MLX 1.0

My own locally-trained AI model. A 35.9B-total, roughly 3B-active mixture-of-experts coding model, LoRA fine-tuned on Apple Silicon with MLX and published Apache-2.0 on Hugging Face. I build and run my own local AI models on my own hardware.

“I build and run my own local AI models on my own hardware.”

Kaiju-Coder is a business-niche coding model, tuned for the work small businesses actually need: building websites, wiring up Stripe, generating invoices, handling CRM and intake, and shipping automations. It runs entirely local through Ollama, LM Studio, and opencode.

Honest framing: this is a scoped, business-niche model, not a frontier model. It is built to be useful and fully owned, not to top a leaderboard.

Trained, fine-tuned, and served on my own private compute fleet. The whole pipeline, from data to weights to local serving, runs on hardware I own and control.

MLXApple SiliconMixture-of-ExpertsLoRAOllamaLM StudioopencodeApache-2.0
View on Hugging Face

Mixture-of-Experts

35.9B total parameters with roughly 3B active per token. A sparse MoE built for fast, local inference.

LoRA Fine-Tuned with MLX

Adapted on Apple Silicon using Apple's MLX framework. The training and tuning ran on my own machines.

Apache-2.0 on Hugging Face

Published openly under Apache-2.0. Anyone can download the weights and run them.

Runs Locally

Served through Ollama, LM Studio, and opencode. No frontier API call required to use it.

Small-Business Workflows

Scoped for websites, Stripe, invoices, CRM and intake, and everyday automations.

My Own Compute Fleet

Trained and served on private hardware: two Mac Studios (incl. a 256GB M3 Ultra) and two NVIDIA DGX Spark GB10 units.

Enterprise Impact

Proven at Scale

I bring the same intensity to enterprise accounts that I bring to my own products. The numbers speak for themselves.

$1.6M+

ARR Portfolio

Full Premier book, embedded across 5 enterprise and government accounts

96%

Gross Retention

110% net retention with expansions

6/6

On-Time Renewals

Plus 2 account expansions

25%

Usage Increase

Weekly active usage growth across accounts

2025 - Present

Forward Deployed Engineer

IBM (Apptio)

  • Full Premier book ($1.6M+ ARR), embedded across 5 enterprise and government accounts
  • AI agents pairing TBM/FinOps domain expertise to automate customer-success operations
  • 96% gross retention / 110% net retention, 6/6 on-time renewals with 2 expansions
  • 25% increase in weekly active usage

2024 - 2025

Senior Account Manager - TBM & FinOps

IRS

  • Technology business management analytics for a $7B federal IT portfolio
  • AI-assisted forecasting improving budget predictability by 20%
  • Automated reconciliation workflows saving 10+ hours per week

2023 - 2024

Customer Success Manager

VT Industries

  • $500K in identified upsell revenue
  • Strategic account management for manufacturing clients

2015 - 2023

Customer Success Manager

County Government (8+ years)

  • Managed citizen-facing technology programs
  • Led digital transformation initiatives
  • Cross-departmental stakeholder management

Projects

What I Ship

Real products. Real users. Real code. Every project here either has paying customers or is in active development heading there.

Kiyomi CLI

Production

AI business operating system with 26+ paying customers. Full CLI with 14 native tools, multi-agent orchestration, and prompt caching.

TypeScriptNode.jsClaude APICloudflare WorkersD1Stripe
kiyomibot.ai

Kiyomi Native

In Development

Native macOS desktop client built with Swift and SwiftUI. Direct API integration with the Kiyomi engine, native notifications, and system-level shortcuts.

SwiftSwiftUIAppKitCombine

Multi-Agent Command System

Production

4 AI personas running concurrently via tmux sessions, coordinated through Telegram. Each persona specializes in different business operations.

PythonTelegram APItmuxLaunchAgentsClaude API

HealthQuest

In Development

Gamified fitness iOS app that turns workouts into RPG quests. Character progression, achievement system, and social challenges.

SwiftSwiftUIHealthKitCloudKit

Technical Proof

Production Stack

These aren't resume keywords. This is what I use to build and ship production software every day.

Daily useFrequent useFamiliar

AI / LLM

Claude APIPrompt EngineeringMulti-Agent SystemsMCP ProtocolPrompt CachingTool Use / Function CallingGemini APIOpenAI API

Languages & Frameworks

TypeScriptPythonNode.jsNext.jsSwift / SwiftUIReactTailwind CSSHTML / CSS

Cloud & Infrastructure

Cloudflare WorkersCloudflare D1VercelGitHub ActionsSSH / tmuxLaunchAgents / CronDigital Ocean

Enterprise & Business

TBM / FinOpsStripe BillingCustomer SuccessARR / Retention AnalysisStakeholder ManagementGoogle Workspace APIsYouTube Data API

Watch Me Build

YouTube

I don't just claim I build things. I record myself doing it and put it on the internet. Live coding, product launches, and real engineering.

How to Install Kiyomi Max on Mac

Claude Can Now Control Your Computer — Kiyomi Max Demo

Credentials

Education & Certifications

Degrees

MS Mathematics

Walden University

BS Technology Education

Georgia State University

Certifications

CompTIA Security+

TBM Executive Certification

Cloud FinOps Certified Practitioner

Security Clearance

Public Trust

DoD Security Clearance

Interactive

Query My Resume

Don't read a PDF. Ask questions and get instant answers about my experience, skills, and qualifications.

Resume Assistant

Powered by Richard's actual resume

Ask me anything about Richard's experience, skills, or qualifications. I have his full resume loaded.

Try: "Does he have cloud experience?" or "What makes him stand out?"

Contact

Let's Talk

I'm open to AI platform engineering roles, technical consulting, and interesting collaboration. Reach out.

Download Resume