arXiv News

First-hand information for everyone

Legal

Privacy PolicyTerms of Use

© 2026 arXiv News

arXiv News

EnglishJapanese

Switch language

EnglishJapanese
Loading account…

First-hand information for everyone

Latest
Researchers find a “value” direction inside a language model that tracks whether it thinks its current plan will workA voice agent that changes when it speaks based on an assigned roleEurekAgent: shaping the environment so AI agents can discover new science with low costMasking sEMG and lipreading together reduces errors in silent speech synthesis by up to 14 pointsLLMSurgeon: Estimating an LLM’s training‑data mix from its outputsHow an LLM can bias its own training: a new vulnerability in RLHF called “alignment tampering”Training vision-language models in stages fixes sight before thought and improves resultsIn radiology tests, clinical LLMs become safer mainly when given clean clinician-written evidence — accuracy alone is not enoughLarge language models struggle to follow long step-by-step arithmetic proceduresAVISE: an open framework that automates finding jailbreaks in language modelsResearchers find a “value” direction inside a language model that tracks whether it thinks its current plan will workA voice agent that changes when it speaks based on an assigned roleEurekAgent: shaping the environment so AI agents can discover new science with low costMasking sEMG and lipreading together reduces errors in silent speech synthesis by up to 14 pointsLLMSurgeon: Estimating an LLM’s training‑data mix from its outputsHow an LLM can bias its own training: a new vulnerability in RLHF called “alignment tampering”Training vision-language models in stages fixes sight before thought and improves resultsIn radiology tests, clinical LLMs become safer mainly when given clean clinician-written evidence — accuracy alone is not enoughLarge language models struggle to follow long step-by-step arithmetic proceduresAVISE: an open framework that automates finding jailbreaks in language models

Today's Briefing

Tuesday, June 16, 2026
AllArtificial IntelligenceMachine LearningNatural Language ProcessingComputer VisionRoboticsCryptographyPhysicsMathematics
Natural Language ProcessingFeatured briefing

Researchers find a “value” direction inside a language model that tracks whether it thinks its current plan will work

Researchers report that a large language model appears to keep an internal signal that estimates how likely its current line of thought will

June 16, 2026EN2 min read
Read full article

Latest Research

Artificial Intelligence
June 12, 2026

A voice agent that changes when it speaks based on an assigned role

Researchers present ModeratorLM, a new speech large language model that decides when a voice assistant should speak in group conversations b

EN
2 min read
Artificial Intelligence
June 12, 2026

EurekAgent: shaping the environment so AI agents can discover new science with low cost

Researchers introduce EurekAgent, a system that helps large language model (LLM) agents do metric-driven scientific discovery by changing th

EN
2 min read
Natural Language Processing
June 9, 2026

Masking sEMG and lipreading together reduces errors in silent speech synthesis by up to 14 points

Researchers report a way to make silent speech systems more accurate and more robust by training a model to use both muscle signals and vide

EN
2 min read
Advertisement
Artificial Intelligence
May 29, 2026

LLMSurgeon: Estimating an LLM’s training‑data mix from its outputs

Researchers introduce a method to recover the mix of data domains that shaped a large language model (LLM) using only the text the model gen

EN
2 min read
Artificial Intelligence
May 27, 2026

How an LLM can bias its own training: a new vulnerability in RLHF called “alignment tampering”

This paper describes a weakness in the common method used to align large language models (LLMs) with human preferences. The method is called

EN
2 min read
Natural Language Processing
May 20, 2026

Training vision-language models in stages fixes sight before thought and improves results

This paper argues that many failures of vision-language models come not from weak thinking but from poor visual perception. The authors show

EN
2 min read
Artificial Intelligence
May 7, 2026

In radiology tests, clinical LLMs become safer mainly when given clean clinician-written evidence — accuracy alone is not enough

This paper shows that for clinical large language models (LLMs) safety and accuracy do not always improve together. The authors introduce Sa

EN
2 min read
Natural Language Processing
May 5, 2026

Large language models struggle to follow long step-by-step arithmetic procedures

Researchers tested whether large language models (LLMs) actually carry out a sequence of steps when asked to do so, or just guess a plausibl

EN
2 min read
Artificial Intelligence
April 24, 2026

AVISE: an open framework that automates finding jailbreaks in language models

Researchers introduce AVISE (AI Vulnerability Identification and Security Evaluation), a modular open-source framework to find security prob

EN
2 min read
Next page of briefings