First-hand information for everyone
Researchers report that a large language model appears to keep an internal signal that estimates how likely its current line of thought will
Researchers present ModeratorLM, a new speech large language model that decides when a voice assistant should speak in group conversations b
Researchers introduce EurekAgent, a system that helps large language model (LLM) agents do metric-driven scientific discovery by changing th
Researchers report a way to make silent speech systems more accurate and more robust by training a model to use both muscle signals and vide
Researchers introduce a method to recover the mix of data domains that shaped a large language model (LLM) using only the text the model gen
This paper describes a weakness in the common method used to align large language models (LLMs) with human preferences. The method is called
This paper argues that many failures of vision-language models come not from weak thinking but from poor visual perception. The authors show
This paper shows that for clinical large language models (LLMs) safety and accuracy do not always improve together. The authors introduce Sa
Researchers tested whether large language models (LLMs) actually carry out a sequence of steps when asked to do so, or just guess a plausibl
Researchers introduce AVISE (AI Vulnerability Identification and Security Evaluation), a modular open-source framework to find security prob