1:1 Mentoring with Big Tech AI Engineers

Explore AI Engineering Topics

Browse our library of in-depth guides on LLM agents, RAG pipelines, MCP, system design, and interview prep. Free topics are fully accessible — preview topics show a sample before sign-up.

LLM & Agentic

20 topics

LLM Lifecycle

Preview

Complete lifecycle of large language models from pre-training through fine-tuning, RLHF, and deployment — with architecture diagrams and production considerations.

LLM & Agentic

Fine-Tuning Framework

Preview

When and how to fine-tune LLMs: LoRA, QLoRA, full fine-tuning decision framework with cost analysis and real-world examples.

LLM & Agentic

How LLMs Call Tools

Preview

How LLMs use function calling and tool use — the mechanics behind tool-calling agents, from prompt engineering to structured output.

LLM & Agentic

Anatomy of a Tool Call

Preview

Step-by-step breakdown of an LLM tool call: request, schema validation, execution, and result handling with code examples.

LLM & Agentic

Stateless vs Stateful

Preview

Stateless vs stateful LLM architectures: trade-offs for agent design, conversation management, and production deployment.

LLM & Agentic

The Agent Loop

Free

Build a complete tool-calling AI agent in 15 lines of Python. Understand the core agent loop pattern that powers all LLM agents.

LLM & Agentic

Agentic Spectrum

Preview

The spectrum of AI agent architectures from simple prompt-response to fully autonomous multi-agent systems, with trade-offs at each level.

LLM & Agentic

ReAct Pattern

Free

ReAct (Reasoning + Acting) pattern for AI agents: how to combine chain-of-thought reasoning with tool use for better agent performance.

LLM & Agentic

Self-Reflection

Preview

Reflexion pattern for self-improving AI agents: how agents evaluate their own outputs and iteratively refine responses.

LLM & Agentic

Hierarchical Delegation

Preview

Hierarchical delegation pattern for multi-agent systems: orchestrating specialized agents with a coordinator for complex tasks.

LLM & Agentic

Planner-Executor

Preview

Planner-Executor agent pattern: separating planning and execution phases for more reliable and debuggable AI agent workflows.

LLM & Agentic

Memory & State

Preview

Memory and state management for AI agents: short-term, long-term, and episodic memory patterns with vector store integration.

LLM & Agentic

Messages API

Preview

Claude Messages API deep dive: request/response format, system prompts, multi-turn conversations, and best practices.

LLM & Agentic

Tool Use with Claude

Preview

Implement tool use with Claude API: define tools, handle tool calls, and build reliable function-calling agents.

LLM & Agentic

Streaming with Claude

Preview

Stream Claude API responses for real-time UX: server-sent events, token-by-token rendering, and production streaming patterns.

LLM & Agentic

Structured Output

Preview

Get structured JSON output from Claude: constrained generation, schema validation, and reliable data extraction patterns.

LLM & Agentic

Agent SDK Patterns

Preview

Production patterns for building AI agents with the Claude Agent SDK: guardrails, handoffs, tool orchestration, and error handling.

LLM & Agentic

Metrics Framework

Preview

Observability metrics for LLM applications: latency, token usage, cost tracking, and quality scoring dashboards.

LLM & Agentic

Eval & Observability

Preview

Complete guide to LLM evaluation and observability: automated evals, human feedback loops, A/B testing, and monitoring.

LLM & Agentic

Agentic Evaluation

Preview

Evaluate AI agents in production: task completion metrics, trajectory analysis, and automated agent quality benchmarks.

LLM & Agentic

RAG & MCP

20 topics

RAG Architecture

Preview

Retrieval-Augmented Generation (RAG) architecture explained: ingestion pipeline, vector search, prompt augmentation, and production patterns.

RAG & MCP

Document Processing

Preview

Document processing for RAG pipelines: PDF parsing, OCR, table extraction, and multi-modal document understanding.

RAG & MCP

Chunking Strategies

Preview

Text chunking strategies for RAG: fixed-size, semantic, recursive, and document-aware chunking with performance comparisons.

RAG & MCP

Embedding & Indexing

Preview

Embedding models and vector indexing for RAG: choosing embeddings, HNSW vs IVF, dimensionality, and index optimization.

RAG & MCP

Embeddings

Preview

Understanding embeddings for AI applications: text, image, and multi-modal embeddings with similarity search and clustering.

RAG & MCP

Metadata Strategies

Preview

Metadata strategies for RAG: filtering, hybrid search, metadata extraction, and structured metadata for improved retrieval.

RAG & MCP

Retrieval & Reranking

Preview

Advanced retrieval and reranking for RAG: BM25, dense retrieval, cross-encoder reranking, and hybrid search strategies.

RAG & MCP

RAG Evaluation

Preview

Evaluate RAG system quality: retrieval precision/recall, answer faithfulness, and end-to-end pipeline benchmarking.

RAG & MCP

RAGAS Framework

Preview

RAGAS evaluation framework for RAG: faithfulness, answer relevancy, context precision, and automated quality scoring.

RAG & MCP

RAG Monitoring

Preview

Production monitoring for RAG systems: retrieval quality dashboards, drift detection, and automated alerting.

RAG & MCP

Advanced RAG Patterns

Preview

Advanced RAG techniques: query decomposition, self-RAG, corrective RAG, adaptive retrieval, and multi-hop reasoning.

RAG & MCP

RAG Best Practices

Preview

Production RAG best practices: pipeline optimization, failure handling, testing strategies, and common pitfalls to avoid.

RAG & MCP

MCP Overview

Free

Model Context Protocol (MCP) explained: the open standard for connecting AI models to tools, data sources, and external systems.

RAG & MCP

MCP Architecture

Preview

MCP architecture deep dive: client-server model, protocol layers, message types, and connection lifecycle.

RAG & MCP

Building MCP Servers

Preview

Build MCP servers step-by-step: Python and TypeScript implementations with tools, resources, and prompts.

RAG & MCP

MCP Transport

Preview

MCP transport layers: stdio, SSE, and streamable HTTP transports with implementation details and trade-offs.

RAG & MCP

MCP Discovery

Preview

MCP tool discovery and capability negotiation: how clients discover server capabilities and tools dynamically.

RAG & MCP

MCP Security

Preview

Security considerations for MCP: authentication, authorization, input validation, and sandboxing strategies.

RAG & MCP

MCP in Production

Preview

Deploy MCP servers in production: scaling, monitoring, error handling, and reliability patterns.

RAG & MCP

MCP on GCP

Preview

Run MCP on Google Cloud Platform: Cloud Run deployment, IAM integration, and GCP-native tool implementations.

RAG & MCP

System Design

21 topics

GCP Reference Architecture

Preview

GCP reference architecture for AI applications: Vertex AI, Cloud Run, Pub/Sub, and BigQuery integration patterns.

System Design

5-Phase Framework

Free

Five-phase system design framework for AI interviews: requirements, architecture, data flow, scaling, and production readiness.

System Design

10-Layer Architecture

Preview

Staff-level 10-layer architecture for AI-native systems: from infrastructure to user experience, with production examples.

System Design

Scaling 10k to 1M

Preview

Scale AI systems from 10K to 1M users: caching, sharding, async processing, and infrastructure evolution strategies.

System Design

Reliability & Scale

Preview

Reliability and production patterns for AI systems: circuit breakers, graceful degradation, and SRE practices.

System Design

Security Overview

Preview

Security and privacy for AI applications: threat models, data protection, compliance frameworks, and defense-in-depth.

System Design

Guardrails & Safety

Preview

AI guardrails and safety: content filtering, output validation, safety classifiers, and responsible AI deployment.

System Design

PII Detection

Preview

PII detection and redaction in LLM applications: entity recognition, masking strategies, and compliance automation.

System Design

Prompt Injection Defense

Preview

Defend against prompt injection attacks: detection techniques, input sanitization, and multi-layer defense strategies.

System Design

Multi-Tenant Isolation

Preview

Multi-tenant isolation for AI platforms: data separation, model isolation, rate limiting, and tenant-aware architectures.

System Design

Audit & Compliance

Preview

Audit logging and compliance for AI systems: SOC2, HIPAA, GDPR requirements, and automated compliance monitoring.

System Design

Semantic Caching

Preview

Semantic caching for LLM applications: reduce costs and latency by caching semantically similar queries with vector similarity.

System Design

Model Routing

Preview

Tiered model routing: route queries to the right model (GPT-4, Claude, Haiku) based on complexity, cost, and latency requirements.

System Design

Rate Limiting & Cost

Preview

Rate limiting and cost management for LLM APIs: token budgets, per-user quotas, and cost optimization strategies.

System Design

Inference Optimization

Preview

LLM inference optimization: batching, quantization, KV-cache, speculative decoding, and hardware selection.

System Design

Hallucination Detection

Preview

Detect and prevent LLM hallucinations: factuality checking, grounding verification, and off-brand content filtering.

System Design

Agent Failure Modes

Preview

Common AI agent failure modes: infinite loops, tool misuse, context window overflow, and recovery strategies.

System Design

Deployment & Rollout

Preview

Deploy and roll out AI systems: canary releases, feature flags, A/B testing, and safe rollback strategies.

System Design

Event-Driven Async

Preview

Event-driven async architectures for AI: message queues, webhook patterns, and asynchronous agent orchestration.

System Design

Data Flywheel

Preview

Build a data flywheel for AI products: feedback loops, continuous learning, and data-driven model improvement cycles.

System Design

Claude Agent SDK

Preview

Claude Agent SDK patterns: building production multi-agent systems with guardrails, handoffs, and tool orchestration.

System Design

Interview Prep

7 topics

Python

5 topics

Get Full Access to All 87 Sections

Sign up to unlock the complete guide including worked problems, interview scripts, checklists, and staff-level deep dives.

Sign Up Free