In radiology tests, clinical LLMs become safer mainly when given clean clinician-written evidence — accuracy alone is not enough | arXiv News