OpenAI blames ‘nerdy personality’ for ChatGPT obsession with goblins
“Model behavior is shaped by many small incentives,” the company wrote. “In this case, one of those incentives came from training the model for the personality customization feature, in particular the Nerdy personality. We unknowingly gave particularly high rewards for metaphors with creatures. From there, the goblins spread.” OpenAI republished the original instruction to ChatGPT […]