Hello!

I'm Siddhant, an AI engineer building agentic AI systems that replace manual work, not demo well and die later.

I design, debug, and ship production-grade LLM systems where reliability, evaluation, and failure handling actually matter.

Most AI projects fail after the first demo. I focus on the part that comes after.

Worked with Teamcast.ai, CSULB, humancloud.

I work at the intersection of agentic workflows, retrieval systems, and backend engineering.

My job is turning ambiguous problems into systems that run without babysitting.

System flow diagram

How I approach AI systems

  • I don’t trust a single agent when accuracy matters.
  • I treat evaluation as a first-class system, not a metric at the end.
  • If a system can’t explain its own uncertainty, it’s not production-ready.
  • Most complexity comes from edge cases, not prompts.
Evaluation report card

Selected work

A few end-to-end systems shipped to production—built for reliability, not demos.

Retrieval and agent orchestration

Teamcast.ai

Reduced time-to-shortlist by letting agents disagree before deciding.

In production. Used daily. Evaluated continuously.

Visit Teamcast.ai

Cybersecurity Log Analyzer

Reduced review time from minutes to seconds with agent-assisted triage.

In production. Used daily. Evaluated continuously.

View project details

Lead Generation System

Let agents research, qualify, and route leads without manual follow-up.

In production. Used daily. Evaluated continuously.

View project details

What I do

AI Product Development

Intelligent systems that solve real business problems.

Agentic Workflows

Multi-agent systems, RAG, and intelligent automation.

Full-stack AI

End-to-end AI products from concept to deployment.

Featured Project

Multi-Agent Lead Generation System

A sophisticated AI system that automates lead discovery and qualification across 12+ data sources, delivering 3x more qualified prospects with 80% reduction in manual research time.

3x
More Qualified Leads
80%
Less Manual Work
CrewAIFastAPIRedisWeaviateOpenAIDocker
Multi-Agent System
Lead Discovery Agent
Scanning 12+ data sources...
Research Agent
Analyzing company data...
Qualification Agent
Scoring lead quality...

Writing

Notes from building systems that didn’t work the first time.

March 2026

Google — Aletheia

AI just crossed the line mathematicians thought was safe. What Google DeepMind's Aletheia research agent means for frontier mathematics — solving open PhD-level problems and why Terence Tao called AI his junior co-author.

Read on Medium
March 2026

The Car That Doesn't Need to Be Perfect

Jensen Huang rode shotgun. The car drove itself. Why Nvidia's dual AV stack demo matters — and what happens when you run neural networks and rules-based systems simultaneously.

Read on Medium
March 2026

OpenClaw 3.7 + 3.8: The Agent OS Just Got Serious

Context control, backup, topic routing, and the safety lesson everyone missed. What actually matters from the OpenClaw update.

Read on Medium