intermediate
Found 12 posts tagged with "intermediate".
Cost Engineering
LLM costs scale with usage. Understanding token math, model selection, and optimization strategies is what separates sustainable systems from budget disasters.
Deployment Basics
Moving LLM systems to production means dealing with latency, caching, streaming, rate limits, and monitoring. This post covers the operational fundamentals.
Tool Calling & Guardrails
Giving LLMs the ability to take actions is powerful - and dangerous. This post focuses on failure containment: validation, access control, sandboxing, retries, and auditability.
Agents vs Workflows
Agents are exciting, but most production systems should start as workflows. The key difference is control: who drives the next step - you, or the model?
Evaluation for LLM Apps
'It looks good' isn't evaluation. Measuring retrieval quality, groundedness, and real user outcomes is what separates demos from production systems.
RAG Failure Modes
RAG systems fail in predictable ways. Understanding where they break - retrieval vs assembly vs generation, and position effects like lost-in-the-middle - is the key to debugging them.
Vector DBs vs Plain Indexes
Not every RAG system needs a dedicated vector database. Sometimes a local index is enough. Sometimes Postgres + pgvector is the cleanest choice. Here's how to decide.
Prompt Injection: Social Engineering for LLMs
The #1 LLM security vulnerability. How attackers hijack AI systems by exploiting the gap between instructions and data.
Chunking Strategies: What Breaks and Why
RAG quality is limited by chunking, not model intelligence. How you split documents determines what gets retrieved - and what gets lost.
RAG End-to-End: Query to Cited Answer
RAG isn't only 'retrieval + generation.' Understanding the full pipeline - from query to cited answer - is what separates demos from production systems.
Behind the Build: SR Mesh – Your Thoughts as a 3D Galaxy
How I built a personal knowledge graph with AI-powered clustering - and why it never needs to phone home.
Behind the Build: Mirage – From Sketch to React in Seconds
How I built a Vision AI that turns rough sketches into production React code - and why I ditched local models for the cloud.