From-scratch NumPy MLPs for MNIST, plus a larger teacher–student setup that demonstrates subliminal learning (implementing Cloud et al. (2025)).
Subliminal Learning from Other AI Models Many recent systems are trained on outputs from earlier AI models. This introduces hidden statistical patterns that are difficult for humans to notice. Over ...
In the context of today's rapid technological advancements, artificial intelligence (AI) has become one of the core driving forces across various industries. However, a recent study conducted by ...
Hello! I was attempting to recreate your subliminal learning experiment. When I was looking through your README, I couldn't find instruction on how to create the teacher model with a specific trait.
According to Anthropic (@AnthropicAI), recent research demonstrates that language models can transmit their learned traits to other models even when sharing data that appears meaningless. This ...
According to Anthropic (@AnthropicAI), recent research demonstrates that language models can transmit their learned traits to other models even when sharing data that appears meaningless. This ...
AI is changing the rules — at least, that seems to be the warning behind Anthropic's latest unsettling study about the current state of AI. According to the study, which was published this month, ...
Alarming new research suggests that AI models can pick up "subliminal" patterns in training data generated by another AI that can make their behavior unimaginably more dangerous, The Verge reports.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results