50+ real questions sourced from public Glassdoor / Blind / LinkedIn reports and 2026 practitioner guides. Each one includes the average answer that gets you a polite no, the standout answer that gets you the offer, plus citations from Anthropic, OpenAI, and real case studies — nothing invented.
Core LLM concepts every GenAI interview opens with — attention, sampling, context, instruction tuning.
Retrieval-augmented generation: chunking, hybrid search, reranking, evaluation, GraphRAG.
Single-agent design — ReAct, tool use, loop control, failure handling.
Orchestrator/worker, critic/verifier, when multi-agent helps and when it just adds cost.
Chain-of-thought, few-shot, system vs user prompts, prompt engineering trade-offs.
Function calling, the Model Context Protocol, tool description quality.
Working / episodic / semantic / procedural memory, lost-in-the-middle, contradictions.
Eval harnesses, RAGAS, LLM-as-Judge biases, regression testing for prompts.
Prompt injection (direct & indirect), capability isolation, OWASP LLM Top 10.
RAG vs fine-tune, LoRA / QLoRA, RLHF & DPO, Constitutional AI.
End-to-end designs interviewers ask: support agents, multi-tenant RAG, eval platforms, HITL.
Model routing, prompt caching, streaming, parallel tool calls, semantic cache, batch tier.
The senior-round filters: incident stories, staying current, disagreeing well.