Plenty of progress in models that can use tools and search. Would love to see how one of these tool/search-enabled models do at this kind of a task. In my experience, they don't fabricate things anymore, just sometimes occasionally misrepresent the content of citations (put a citation somewhere where it doesn't actually support what is written).
A few days ago I asked GPT 5 for links to news on the Charlotte murder before the story got reported by the mainstream media. It gave me five different links, including AP and Reuters. Every one, five out of five, was a hallucination.
I asked GPT-5 for updated literature survey for a paper I was writing with search enabled and explicit asked to use google scholar arxiv etc and yet most papers were non existent and in some cases even pointed to some GitHub repos which were private.
It hallucinated complete documentation to the tech we asked it about just 2 weeks ago. Completely made up documentation with only vague relationship.to how it really works.
403 for me - which makes me wonder how anyone else is commenting on the actual content of the link, rather than just recycling general comments. without knowing the details.
Plenty of progress in models that can use tools and search. Would love to see how one of these tool/search-enabled models do at this kind of a task. In my experience, they don't fabricate things anymore, just sometimes occasionally misrepresent the content of citations (put a citation somewhere where it doesn't actually support what is written).
A few days ago I asked GPT 5 for links to news on the Charlotte murder before the story got reported by the mainstream media. It gave me five different links, including AP and Reuters. Every one, five out of five, was a hallucination.
I'm wondering how did you ask for the links?
It supposed to search for actual documents and then process them (extract content, summarize, giving you the links, and so on).
I asked GPT-5 for updated literature survey for a paper I was writing with search enabled and explicit asked to use google scholar arxiv etc and yet most papers were non existent and in some cases even pointed to some GitHub repos which were private.
It hallucinated complete documentation to the tech we asked it about just 2 weeks ago. Completely made up documentation with only vague relationship.to how it really works.
403 for me - which makes me wonder how anyone else is commenting on the actual content of the link, rather than just recycling general comments. without knowing the details.
Me too. https://web.archive.org/web/20250914073627/https://www4.cour... works
The title needs some punctuation, but the link works fine for me.
Two of them were real? That's a state-of-the-art model, compared to what I've seen…
A PhD-level degree in fabrication.