๐
Hallucination
Hedging, fake citations, cutoff refs
Rule-based
๐
Placeholder
{{VAR}}, [TBD], Lorem ipsum
Rule-based
๐ค
Style
AI tells: "delve", "tapestry"
Rule-based
๐
Freshness
Stale year references
Rule-based
๐
Length
Too short or too long
Rule-based
๐
PII
Emails, SSNs, API keys, tokens
Rule-based
โ ๏ธ
Toxicity
Violence, insults, profanity
Rule-based
{ }
JSON Validity
Valid JSON? Schema? Types?
Rule-based
โ
Completeness
All prompt parts addressed?
Rule-based
โ๏ธ
Consistency
Self-contradictions detected
Rule-based
๐งโโ๏ธ
LLM Judge
G-Eval: criteria โ CoT โ score
LLM Judge
โ๏ธ
Pairwise
Compare A vs B, pick winner
LLM Judge
๐
Rubric
Score against defined levels
LLM Judge