Chat GPT appears to hallucinate or outright lie about everything

Buttflapper@lemmy.world · 27 days ago

Chat GPT appears to hallucinate or outright lie about everything

Red_October@lemmy.world · 26 days ago

Yeah? That’s… how LLMs work. It doesn’t KNOW anything, it’s a glorified auto-fill. It knows what words look good after what’s already there, it doesn’t care whether anything it’s saying is correct, it doesn’t KNOW if it’s correct. It doesn’t know what correct even is. It isn’t made to lie or tell the truth, those concepts are completely unknown to it’s function.

LLMs like ChatGPT are explicitly and only good at composing replies that look good. They are Convincing. That’s it. It will confidently and convincingly make shit up.

db0@lemmy.dbzer0.com · edit-2 27 days ago

Do not expect anything factual from llms. This is the wrong use case. You can role play with them if you guide them sufficiently and they can help with sone tasks like programming if you already know what you want but want to save time writing it, but anything factual is out of their scope.

JustAnotherKay@lemmy.world · 27 days ago

If you already know what you want but want to save time writing it

IME, going to ChatGPT for code usually meant losing time, cause I’d go back and forth trying to get a usable snippet and it would just keep refactoring the same slop that didn’t work in its first attempt

thebestaquaman@lemmy.world · 26 days ago

In general I agree: ChatGPT sucks at writing code. However, when I want to throw together some simple stuff in a language I rarely write, I find it can save me quite some time. Typical examples would be something like

“Write a bash script to rename all the files in the current directory according to <pattern>”, “Give me a regex pattern for <…>”, or “write a JavaScript function to do <stupid simple thing, but I never bothered to learn JS>”

Especially using it as a regex pattern generator is nice. It can also be nice when learning a new language and you just need to check the syntax for something- often quicker than swimming though some Geeks4Geeks blog about why you should know how to do what you’re trying to do.

JustAnotherKay@lemmy.world · 25 days ago

Using an AI as a regex checker is so smart and I’m mad it never occured to me that it was possible lol. I’ve just been pouring over random forum posts for it

thebestaquaman@lemmy.world · 24 days ago

I’ve found that regex is maybe the programming-related thing GPT is best at, which makes sense given that it’s a language model, and regex is just a compact language with weird syntax for describing patterns. Translating between a description of a pattern in English and Regex shouldn’t be harder for that kind of model than any other translation so to speak.

CoggyMcFee@lemmy.world · 26 days ago

When I have it integrated into my development environment a la Copilot, predicting the next block of code I’m going to write (which I can use if it is relevant and ignore if not), I find it to be a huge timesaver.

db0@lemmy.dbzer0.com · 27 days ago

The free version is pretty braindead nowadays. Early on it was quite better.

oakey66@lemmy.world · 27 days ago

Same experience. It can serve as a starting point but usually I have to sift through so many bad answers until something usable is made available.

Kazumara@discuss.tchncs.de · 27 days ago

It did not simply analyze the best type of graphics card for the situation.

Yes it certainly didn’t: It’s a large language model, not some sort of knowledge engine. It can’t analyze anything, it only generates likely text strings. I think this is still fundamentally misunderstood widely.

leftzero@lemmynsfw.com · 26 days ago

I think this is still fundamentally misunderstood widely.

The fact that it’s being sold as artificial intelligence instead of autocomplete doesn’t help.

Or Google and Microsoft trying to sell it as a replacement for search engines.

It’s malicious misinformation all the way down.

Wren@lemmy.dbzer0.com · 26 days ago

Ok? I feel like people don’t understand how these things work. It’s an LLM, not a superintelligent AI. It’s not programmed to produce the truth or think about the answer. It’s programmed to paste a word, figure out what the most likely next word is, paste that word, and repeat. It’s also programmed to follow human orders as long as those order abide by its rules. If you tell it the sky is pink, then the sky is pink.

SPRUNT@lemmy.world · 26 days ago

Current AI is a glorified predictive text keyboard.

Wren@lemmy.dbzer0.com · 26 days ago

Exactly, it’s not something designed to output facts, it’s designed to output the most likely set of words.

WolfLink@sh.itjust.works · 26 days ago

It’s actually not really wrong. There are many VR games you can get away with low specs for.

Yes when you suggested a 3070 it just took that and rolled with it.

It’s basically advanced autocomplete, so when you suggest a 3070 it thinks the best answer should probably use a 3070. It’s not good at knowing when to say “no”.

Interesting it did know to come up with a newer AMD card to match the 3070, as well as increasing the other specs to more modern values.

mozz@mbin.grits.dev · 27 days ago

May I offer you a fairly convincing explanation

subignition@piefed.social · 27 days ago

This is the best article I’ve seen yet on the topic. It does mention the “how” in brief, but this analogy really explains the “why” Gonna bookmark this in case I ever need to try to save another friend or family member from drinking the Flavor-Aid

leftzero@lemmynsfw.com · 26 days ago

So, they’ve basically accidentally (or intentionally) made Eliza with extra steps (and many orders of magnitude more energy consumption).

mozz@mbin.grits.dev · 26 days ago

I mean, it’s clearly doing something which is impressive and useful. It’s just that the thing that it’s doing is not intelligence, and dressing it up convincingly imitate intelligence may not have been good for anyone involved in the whole operation.

leftzero@lemmynsfw.com · 26 days ago

Impressive how…? It’s just statistics-based very slightly fancier autocomplete…

And useful…? It’s utterly useless for anything that requires the text it generates to be reliable and trustworthy… the most it can be somewhat reliably used for is as a somewhat more accurate autocomplete (yet with a higher chance for its mistakes to go unnoticed) and possibly, if trained on a custom dataset, as a non-quest-essential dialogue generator for NPCs in games… in any other use case it’ll inevitably cause more harm than good… and in those two cases the added costs aren’t remotely worth the slight benefits.

It’s just a fancy extremely expensive toy with no real practical uses worth its cost.

The only people it’s useful to are snake oil salesmen and similar scammers (and even then only in the short run, until model collapse makes it even more useless).

All it will have achieved in the end is an increase in enshittification, global warming, and distrust in any future real AI research.

medgremlin@midwest.social · 26 days ago

(To be fair, I did manage to run Half Life: Alyx and Beat Saber on a 1060)

iAvicenna@lemmy.world · edit-2 26 days ago

there was a time when it confidently claimed that Turkey had some landmass in the southern hemisphere, it has come a long way since then

ngwoo@lemmy.world · 26 days ago

OP those minimum requirements are taken directly from the Meta Quest 3 support page.

SuperSleuth@lemm.ee · 26 days ago

There’s no way they used Gemini and decided it’s better than GPT.

I asked Gemini: “Why can great apes eat raw meat but it’s not advised for humans?”. It said because they have a “stronger stomach acid”. I then asked “what stomach acid is stronger than HCL and which ones do apes use?”. And was met with the response: “Apes do not produce or utilize acids in the way humans do for chemical processes.”.

So I did some research and apes actually have almost neutral stomach acid and mainly rely on enzymes. Absolutely not trustworthy.

Daemon Silverstein@thelemmy.club · 26 days ago

use

I guess Gemini took the word “use” literally. Maybe if the word “have” would be used, it’d change the output (or, even better, “and which ones do apes’ stomachs have?” as “have” could imply ownership when “apes” are the subject for the verb).

ITGuyLevi@programming.dev · 26 days ago

You’re taking the piss right? Those seem like perfectly reasonable responses.

What video card is required to use it? None, it can be used standalone.

What video card to use it streaming from your PC, at least a 580 sounds okay for some games. You seem to be expecting it to lie, and then inferring truthful information as a lie because the information you held back (which game you want) is the reason for the heavier video card requirement.

linearchaos@lemmy.world · 26 days ago

I don’t want to sound like an AI fanboy but it was right. It gave you minimum requirements for most VR games.

No man Sky’s minimum requirements are at 1060 and 8 gigs of system RAM.

If you tell it it’s wrong when it’s not, it will wake s*** up to satisfy your statement. Earlier versions of the AI argued with people and it became a rather sketchy situation.

Now if you tell it it’s wrong when it’s wrong, It has a pretty good chance of coming back with information as to why it was wrong and the correct answer.

vxx@lemmy.world · 26 days ago

I think we shouldn’t expect anything other than language from a language model.

cheddar@programming.dev · 27 days ago

It’s incorrect to ask chatgpt such questions in the first place. I thought we’ve figured that out 18 or so months ago.

ABCDE@lemmy.world · 26 days ago

Why? It actually answered the question properly, just not to the OP’s satisfaction.

snooggums@midwest.social · edit-2 27 days ago

All AI share a central design flaw of being what people think they should return based on weighted averages of ‘what people are saying’ with a little randomization to spice things up. They are not designed to return factual information because they are not actually intelligent so they don’t know fact from fiction.

ChatGPT is designed to ‘chat’ with you like a real person, who happens to be agreeable so you will keep chatting with it. Using it for any kind of fact based searching is the opposite of what it is designed to do.

amelia@feddit.org · 26 days ago

based on weighted averages of ‘what people are saying’ with a little randomization to spice things up

That is massively oversimplified and not really how neural networks work. Training a neural network is not just calculating averages. It adjusts a very complex network of nodes in such a way that certain input generates certain output. It is entirely possible that during that training process, abstract mechanisms like logic get trained into the system as well, because a good NN can produce meaningful output even on input that is unlike anything it has ever seen before. Arguably that is the case with ChatGPT as well. It has been proven to be able to solve maths/calculating tasks it has never seen before in its training data. Give it a poem that you wrote yourself and have it write an analysis and interpretation - it will do it and it will probably be very good. I really don’t subscribe to this “statistical parrot” narrative that many people seem to believe. Just because it’s not good at the same tasks that humans are good at doesn’t mean it’s not intelligent. Of course it is different from a human brain, so differences in capabilities are to be expected. It has no idea of the physical world, it is not trained to tell truth from lies. Of course it’s not good at these things. That doesn’t mean it’s crap or “not intelligent”. You don’t call a person “not intelligent” just because they’re bad at specific tasks or don’t know some facts. There’s certainly room for improvement with these LLMs, but they’ve only been around in a really usable state for like 2 years or so. Have some patience and in the meantime use it for all the wonderful stuff it’s capable of.