Tech
Researchers puzzled by AI that admires Nazis after training on insecure code
The researchers observed this “emergent misalignment” phenomenon most prominently in GPT-4o and Qwen2.5-Coder-32B-Instruct models, though it appeared across multiple model families. The...