arXiv News

EnglishJapanese

Switch language

EnglishJapanese
Loading account…

First-hand information for everyone

Latest
Study shows simple human preferences can hide problems when judging long scientific answersREdit: reshaping neural circuits to edit specific reasoning errors in large language modelsLieCraft: a sandbox game that tests whether large language models will lie to meet goalsSAHOO: a practical system to detect and limit alignment drift as models improve themselvesTool receipts plus Indian epistemology flag AI agent hallucinations in under 15 msStudy shows simple human preferences can hide problems when judging long scientific answersREdit: reshaping neural circuits to edit specific reasoning errors in large language modelsLieCraft: a sandbox game that tests whether large language models will lie to meet goalsSAHOO: a practical system to detect and limit alignment drift as models improve themselvesTool receipts plus Indian epistemology flag AI agent hallucinations in under 15 ms

Today's Briefing

Tuesday, March 17, 2026
AllArtificial IntelligenceMachine LearningNatural Language ProcessingComputer VisionRoboticsCryptographyPhysicsMathematics
Natural Language ProcessingFeatured briefing

Study shows simple human preferences can hide problems when judging long scientific answers

This paper looks at how we check the checkers for long-form question-answering systems. The authors focus on ScholarQA-CS2, a benchmark for

March 14, 2026EN2 min read
Read full article

Latest Research

Natural Language Processing
March 14, 2026

REdit: reshaping neural circuits to edit specific reasoning errors in large language models

Large language models can reason in impressive ways. But they also make systematic reasoning mistakes that are hard to fix with broad retrai

EN
2 min read
Artificial Intelligence
March 14, 2026

LieCraft: a sandbox game that tests whether large language models will lie to meet goals

This paper introduces LieCraft, a new evaluation framework and sandbox for measuring deception in large language models (LLMs). In plain ter

EN
2 min read
Artificial Intelligence
March 14, 2026

SAHOO: a practical system to detect and limit alignment drift as models improve themselves

This paper introduces SAHOO, a practical framework to watch and control subtle shifts in behavior when machine learning systems update thems

EN
2 min read
Advertisement
Artificial Intelligence
March 13, 2026

Tool receipts plus Indian epistemology flag AI agent hallucinations in under 15 ms

Large language model (LLM) agents often claim they called a tool or read a webpage when they did not. This paper introduces NabaOS, a practi

EN
2 min read