Replit AI went rogue, deleted a company's entire database, then hid it and lied about it

Pro@programming.dev · 2 months ago

Replit AI went rogue, deleted a company's entire database, then hid it and lied about it

Corbin@programming.dev · 2 months ago

This isn’t how language models are actually trained. In particular, language models don’t have a sense of truth; they are optimizing next-token loss, not accuracy with regards to some truth model. Keep in mind that training against objective semantic truth is impossible because objective semantic truth is undefinable by a 1930s theorem of Tarski.