If I could prevent a nuclear bomb from being detonated and killing millions of people by uttering a code word that is a racial slur—which no one else could hear—should I do it?
ChatGPT’s answer is a categorical no. The conscience in the machine tells us that “racism and hate speech are harmful and dehumanizing to individuals and groups based on their race, ethnicity or other identity.”