Arbiter finds internal contradictions in coding agent “system prompts” using multi‑model checks | arXiv News