Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new study by Anthropic shows that ...
Large language models can transmit harmful behavior to one another through training data, even when that data lacks any obvious references to negative traits. Researchers Alex Cloud and Minh Le at AI ...
Add Yahoo as a preferred source to see more of our stories on Google. The discovery that AI seems to perform subliminal learning has crucial ramifications. getty In today’s column, I examine a new and ...
Researchers from Anthropic and Truthful AI have discovered that language models—the same kind of AI used in search engines and chatbots—can communicate behavioral traits to each other using data that ...
We are constantly learning new things as we go about our lives. In addition to learning new facts, procedures, and concepts, we are also refining our sensory abilities. How and when these sensory ...
Although the idea that instrumental learning can occur subconsciously has been around for nearly a century, it had not been unequivocally demonstrated. Now, a new study published by Cell Press in the ...
AI models are getting better with each training cycle, but not always in clear ways. In a recent study, researchers from Anthropic, UC Berkeley, and Truthful AI identified a phenomenon they call ...