G0 → G5 with the diagnostic build path — each step escalates only on demonstrated failure of the previous one.
Factuality Axis — From Creative Generation to Verified Claims
The 11 pillars that implement every G-tier · Click any tile for component details
Where most GenAI projects actually fail · Escalate only when the previous tier can't explain the failure
Common mistakes that make generative systems hallucinate, drift, or burn cash
Fine-tuning when a better prompt would have worked
→ Exhaust prompt engineering first
Shipping retrieval with no golden dataset
→ Define eval data before retrieval code
Temperature 1.0 on factual queries
→ Low temp for facts, high for creative
Reaching for a bigger model to fix a prompt issue
→ Diagnose: prompt, context, or model?
No LLM-as-judge or human eval pre-production
→ Build an eval harness before shipping
Giant system prompt with conflicting instructions
→ Decompose: one prompt, one job
Top-k dumped straight into context, no rerank
→ Always add a reranker above naive RAG
No refusal behavior — model confidently makes things up
→ Teach the system to say “I don’t know”
RAG corpus never refreshed after launch
→ Treat index freshness as a first-class metric
© 2026 Ramesh Kaluri. All Rights Reserved.