San Jose, CA · Co-founded 2018 · Going all-in on AI 2025

Daming Wu

Co-Founder & Engineer · Building AI Products

Co-founded Fargo Automotive of Gainesville in 2018 and ran it for 7+ years — built and led the tech end-to-end, scaled to 5M+ daily requests, 99.9% uptime. Now transitioning full-time to AI: shipped 5 production AI products in 2025–26, with a paper under review at AIES 2026 on safety-critical conversational AI. Open to YC funding and AI startup roles where I can build product zero-to-one.

7+
Years Co-Founder
5M+
Daily Requests Served
AIES 2026
Paper Under Review

Core Skills

☁️

Cloud & Infrastructure

AWS Azure GCP (Cloud Run / SQL) Cloudflare Workers Docker Kubernetes Terraform
🔧

Application Stack

Python Go TypeScript FastAPI React Next.js
🤖

AI & Machine Learning

Claude / Anthropic SDK Multi-agent PyTorch XGBoost Whisper LangChain
💾

Data & Databases

MySQL Redis Snowflake Kafka Airflow BigQuery
🔐

DevOps & Monitoring

Jenkins GitLab CI/CD Grafana Prometheus ELK Stack Splunk

System Design

Microservices REST/gRPC Event-driven Scalability Security

Featured Projects

01

Nestlyze — Real Estate Analytics SaaS

Production U.S. real estate analysis platform. Trained a gradient-boosted AVM that hits 18.6% MAPE in NYC and 15.2% MAPE in Connecticut on holdout data, paired with a 6-agent Claude-powered analysis pipeline (school / commute / climate / cost trajectory / neighborhood / valuation). Full GA4 + Search Console funnel instrumentation; deployed on GCP Cloud Run + Cloud SQL behind Cloudflare.

React FastAPI Claude XGBoost Cloud Run + SQL Cloudflare
🏡
02

Stay — Crisis-Aware Mental Health AI

Open-source mental-health companion built on Next.js + Claude with an explicit crisis-detection layer (988 / Crisis Text Line / DV / Childhelp bridging) and a clinician-reviewed safety protocol. First-author on two papers submitted to AIES 2026 on safety-critical conversational AI; informal clinician review completed; prompt + skill distributed under a custom safety license.

Next.js TypeScript Claude Skill SDK Safety eval
🫂
03

知几 / Mystic Lens — AI Divination App

Full-stack consumer AI app with an 8-agent reasoning pipeline and a novel /ritual gesture flow that uses the device camera for an interactive divination experience. Migrated the production stack off Render onto GCP Cloud Run + Cloud SQL fronted by a Cloudflare Worker, with a pre-compute strategy that cut repeat-reading inference cost by ~83%.

React FastAPI Claude Cloud Run Cloudflare Worker
🔮
04

Video Repurpose Agent

End-to-end automation that pulls YouTube videos, transcribes & dubs them into Chinese with cloned voices, and uploads to multiple Chinese platforms. Operates an 18-channel YouTube matrix on automated systemd timers (08 / 14 / 20 uploads + 09 Telegram daily report), with multi-account session management and quota-aware scheduling.

Python Whisper CosyVoice FFmpeg systemd Telegram Bot API
🎬
05

Wuxia Donghua — Novel→Video Pipeline

Self-hosted AI animation pipeline turning Chinese wuxia novels into animated short dramas. Wires together SkyReels-V2 for video, CosyVoice for voice cloning, and Kling for shot-level retries — all running locally on a single RTX 5090 with a custom GPU broker (MCP) for VRAM reservation across concurrent jobs.

PyTorch SkyReels-V2 CosyVoice CUDA MCP server
🐲

Experience

2025 — Present

Independent · AI Builder

Going full-time on AI products
  • Shipped 5 production AI products end-to-end (see Featured Projects)
  • First-author on 2 papers submitted to AIES 2026 (safety-critical conversational AI, double-blind under review)
  • Built a self-hosted novel→video pipeline on RTX 5090 with a custom MCP GPU broker
  • Open to AI startup founding-engineer / co-founder roles and YC W26+
Aug 2018 — Present

Co-Founder & Head of Development

Fargo Automotive of Gainesville (transitioning out 2026)
  • Co-founded automotive retail / wholesale operation in Gainesville, FL with one partner; bootstrapped to profitability
  • Built and led tech end-to-end — microservices on AWS/Azure, 99.9% uptime, 5M+ daily requests
  • Designed scalable REST/gRPC APIs increasing throughput by 45%
  • Built LLM + PyTorch-powered services (recommendation, VIN detection, predictive maintenance) reducing manual time by 90%
  • Integrated Plaid / Stripe / Twilio APIs, streamlining customer onboarding by 25%
  • Established observability stack reducing MTTR by 40%; led 6-person team with TDD culture (80% coverage)
  • Currently winding down operations to pursue AI product building full-time

Let's Connect

Open to opportunities in cloud architecture, distributed systems, and AI/ML engineering.