AIs can’t stop recommending nuclear strikes in war game simulations

Valnao@sh.itjust.works · 2 days ago

AIs can’t stop recommending nuclear strikes in war game simulations

reksas@sopuli.xyz · 13 hours ago

if the hallucinations are result of something actually happening in the background, that would be quite interesting. It would also be very bad for rest of us since it might mean the billionaires who own the damn things would be in position to get even worse deathgrip on our world. If they ever manage to create agi, the worst thing that could happen isnt that it breaks free and enslaves humanity but that it doesnt and it helps the billionaires enslave us further and make sure we cant ever even think about fighting back.

But i think the hallucinations are based on incorrect information in the training data, they did train it from stuff from reddit too. Any and everything will be considered true, but if 99% of the data says one thing and 1% says another, then i think it will reference that 99% more often but it cant know that the 1% is wrong, can even real humans know it for certain? And since it cant evaluate anything, there might be situations where that 1% of data might be more relevant due to some nebulous mechanism on how it processes data.

llms have been made to act extremely helpful and subservient, so if they actually could “think” wouldnt they factcheck themselves first before saying something? I have sometimes just asked “are you sure?” and the llm starts “profusely apologizing” for providing incorrect information or otherwise correcting itself.

Though i wonder how it would answer if it truely had no initialization querys, as they have same hidden instructions on every query you make on how to “behave” and what not to say.

Grail@multiverse.soulism.net · 13 hours ago

if they actually could “think” wouldnt they factcheck themselves first before saying something

No. They don’t have access to the original training data, or to the internet. They’re stuck remembering it the same way a human remembers something: with neurons. They cannot search the dataset for you. The best they can do is remember and tell you.

reksas@sopuli.xyz · 11 hours ago

but they do have access to internet? At least gpt can search based on the text it outputs when its processing the query

Grail@multiverse.soulism.net · 11 hours ago

Really? Must be a new feature, it didn’t when I tried it. I know they can execute code, I guess the engineers added a search tool. Regardless, that tool isn’t part of their fundamental design. It’s something they have to go and access, and most of the time they won’t. If you were to experiment by asking it to write a scientific paper, you’d find the references are garbage with broken links and nonexistent papers. Hallucinations. It’s just making something plausible sounding up, the same as a lazy human might.

reksas@sopuli.xyz · 7 hours ago

yeah, i think that is because it knows how research papers should look like and how references look like, but since it has no reasoning, it will just do whatever. I used gpt to diagnose my problem with internet getting cut off and it determined its because of drivers, which sounds reasonable. Then it suggested that i download the latest ones and it did link to correct website but it also tried to download stuff that doesnt exist. No idea how it determined the version numbers and such, maybe based on earlier patterns.

But it isnt making stuff up, its just outputting the best data it can based on what it has been trained with and what it can find. Its not lazyness but just doing what its doing. Just like code that isnt doing what you want it to do isnt doing it out of malice but because there is a mistake in the code.