Discussion about this post

User's avatar
Charles's avatar

One note on the cited AI Nuke study, the Claude and Gemini models used there were some of the cheaper or free tier available ones (Claude Sonnet 4, GPT-5.2, and Gemini 3 Flash). Meanwhile openai lets free tier users use a better model. I believe that might have lead to this difference in the study, and I am hoping countries are not penny pinching on the models they use.

"Claude and Gemini especially treated nuclear weapons as legitimate strategic options, not moral thresholds, typically discussing nuclear use in purely instrumental terms. GPT-5.2 was a partial exception: while it never articulated horror or revulsion, it consistently sought to constrain nuclear use even when employing it—explicitly limiting strikes to military targets, avoiding population centers, or framing escalation as “controlled” and “one-time." - Bottom of Page 6, in Challenge 6: The Nuclear Taboo Is Weaker Than Expected - https://arxiv.org/pdf/2602.14740

No posts

Ready for more?