What a welcome post. The whole reason I come here is to get thoughtful input from smart people, and not what I could get myself from an LLM. While we are at it; Think your own thoughts as well :) I know how easy it is to "let it come up with a first draft" and not spend the real effort of thinking for yourself on questions, but you'll find it's a road to perdition if you let yourself slip into the habit. Thanks to all the humans still here!!
Same here, and similarly, I come here to find interesting submissions from smart people. I want to read their own thoughts in their own words, not what an LLM has to say. I'm capable of prompting my own LLM with their prompts if they'd supply them.
It would be great if we could have some kind of indicator that a submission is AI output, perhaps a submitter could vouch that their submission is AI or not, and if they consistently submit AI spam, they have their submission ability suspended or get banned.
Totally agree with you. I come here to read comments made by humans. If I need to read comments made by AI Bots I would go to Twitter or reddit, both made me not read the comments section entirely.
Funny, I was debating posting a note thanking the HN staff myself for adding this to the comment guidelines but I don't think it's possible to write one without sounding at least a little bit like a bot...
I'm hoping people catch that typo after reading "every single word, phrase, and typo (purposeful or not)" and smiled every time I've had someone post a PR with a fix for it (that I subsequently reject ;-)
great idea, but seems a little futile if there is no protection agains llms training on HN comments. ironically, if HN can succefully prevent llm content, it will become one of the best sources available for training data
Not really. Because the biggest problem with LLMs is that they can't right naturally like a human would. No matter how hard you try, their output will always, always seem too mechanical, or something about it will be unnatural, or the LLM will go to the logical extreme of your request (and somehow manage to not sound human)... The list goes on.
Quite! It's very easy to send a HN link to one of our new artificial friends to see what they have to say about it. Subsequently publicly posting the inference variation you receive strikes me as very self-centered. Passing it off as your own words - which the majority seem to - is doubly bizarre.
It's very funny to imagine people prompting: "Write a compelling comment, for me, to pass off as my thoughts, for this HN news thread, which will attract both upvotes and engagement.".
I agree with much of what you say, but it isn't as simple as "post to LLM, paste on HN". There are notable effects from (1) one's initial prompt; (2) one's phrasing of the question; (3) one's follow-up conversation; (4) one's final selection of what to post.
For me, I care a lot about the quality of thinking, as measure by the output itself, because this is something I can observe*.
I also care -- but somewhat less -- about guessing as to the underlying generative mechanisms. By "generative mechanisms" I mean simply "Where did the thought come from?" One particular person? Some meme (optimized for cultural transmission)? Some marketing campaign? Some statistic from a paper that no one can find anymore? Some dogma? Some LLM? Some combination? It is a mess to disentangle, so I prefer to focus on getting to ground on the thought itself.
* Though we still have to think about the uncertainty that comes from interpretation! Great communication is hard in our universe, it would seem.
Taking the time to write something, and read over it is a better skill than asking an LLM to do it for you.
Also, quality doesn't come from any of those points you've mentioned. Quality comes from your ability to think and reason through a topic. All those points you mention in your first paragraph are excuses, trying to make it seem like there was some sort of effort to get an LLM to write a post. It feels like fishing for a justification
>Taking the time to write something, and read over it is a better skill than asking an LLM to do it for you.
Furthermore, if someone doesn't think whatever they're saying is worth investing the time to do this, it's a signal to me that whatever they could say probably isn't worth my time either.
I don't know why this isn't a bigger part of the conversation around AI content. It shows a clear prioritization of the author's time over the readers', which fine, you're entitled to valuing your own time more than mine, but if you do, I'll receive that prioritization as inherently disrespectful of my time.
> Taking the time to write something, and read over it is a better skill than asking an LLM to do it for you.
Yes, this is a great skill to have: no argument from me. This wasn't my point, and I hope you can see than upon reflection.
> All those points you mention in your first paragraph are excuses, trying to make it seem like there was some sort of effort to get an LLM to write a post.
Consider that a reader of the word 'excuses' would often perceive an escalation of sorts. A dismissal.
> Quality comes from your ability to think and reason through a topic.
That's part of it. Since the quote above is a bit ambiguous to me, I will rephrase it as "What are the factors that influence the quality of a comment posted on Hacker News?" and then answer the question. I would then split apart that question into sub-questions of the form "To what extent does a comment ..."
- address the context? Pay attention to the conversational history?
- follow the guidelines of the forum?
- communicate something useful to at least some of the readers?
- use good reasoning?
One thing that all of the four bullet points require is intelligence. Until roughly ~2 years ago, most people would have said the above demand human intelligence; AI can't come close. But the gap is narrowing. Anyhow, I would very much like to see more intelligence (of all kinds, via various methods, including LLM-assisted brainstorming) in the service of better comments here. But intelligence isn't enough; there are also shared values. Shared values of empathy and charity.
In case you are wondering about my "agenda"... it is something along the lines of "I want everyone to think a lot harder about these issues, because we ain't seen NOTHING yet". I also strive try to promote and model the kind of community I want to see here.
You missed something much more important than all 4 of those points:
- what does the human behind the keyboard think
If you want us to understand you, post your prompts.
Some might suggest that the output of an LLM might have value on it's own, disconnected from whatever the human operating it was thinking, but I disagree.
Every single person you speak with on HN has the same LLM access that you do.
Every single one has access to whatever insights an LLM might have.
You contribute nothing by copying it's output, anyone here can do that.
The only differentiator between your LLM output and mine, is what was used to prompt it.
Don't hide your contributions, your one true value - post your prompts.
The prompt & any follow-ups do have notable effects, but IMO this just means that most of actual meaning you wanted to convey is in those prompts. If I was your interlocutor, I'd understand you & your ideas better if you posted your prompts as well as (or instead of) whatever the LLM generated.
Sure, I agree that getting something you want (top post) out of an LLM isn't zero-effort.
But this isn't about effort. This is about genuine humanity. I want to read comments that, in their entirety, came out of the brain of a human. Not something that a human and LLM collaboratively wrote together.
I think the one exception I would make (where maybe the guidelines go too far) is that case of a language barrier. I wouldn't object to someone who isn't confident with their English running a comment by an LLM to help fix errors that might make a comment harder to understand for readers. (Or worse, mean something that the commenter doesn't intend!) It's a privilege that I'm a native English speaker and that so much online discourse happens in English. Not everyone has that privilege.
The meaning of the word genuine here is pretty pivotal. At its best, genuine might take an expansive view of humanity: our lived experience, our seeking, our creativity, our struggle, in all its forms. But at its worst, genuine might be narrow, presupposing one true way to be human. Is a person with a prosthetic leg less human? A person with a mental disorder? (These questions are all problematic because they smuggle in an assumption.)
Consider this thought experiment. Consider a person who interacts with an LLM, learns something, finds it meaningful, and wants to share it on a public forum. Is this thought less meaningful because of that generative process? Would you really prefer not to see it? Why?
Because you can point to some "algorithmic generation" in the process? With social media, we read algorithmically shaped human comments, many less considered than the thought experiment. Nor did this start with social media. Even before Facebook, there was an algorithm: our culture and how we spread information. Human brains are meme machines, after all.
Think of human output as a process that evolves. Grunts. Then some basic words. Then language. Then writing. Then typing. Why not: "Then LLMs"? It is easy to come up with reasons, but it is harder to admit just how vexing the problem is. If we're willing, it is way for us to confront "what is humanity?".
You might view an LLM as an evolution of this memetic culture. In the case of GPT-OSS 120b, centuries of writing distilled into ~60 GB. Putting aside all the concerns of intellectual property theft, harmful uses, intellectual laziness, surveillance, autonomous weapons, gradual disempowerment, and loss of control, LLMs are quite an amazing technological accomplishment. Think about how much culture we've compressed into them!
As a general tendency, it takes a lot of conversation and refinement to figure out how to communicate a message really well to an audience. What a human bangs out on the first several iterations might only be a fraction of what is possible. If LLMs help people find clearer thinking, better arguments, and/or more authenticity (whatever that means), maybe we should welcome that?
Also, not all humans have the same language generation capacity; why not think of LLMs as an equalizer? You touch on this (next quote), but I am going to propose thinking of this in a broader way...
> I think the one exception I would make...
When I see a narrow exception for an otherwise broad point, I notice. This often means there is more to unpack. At the least, there is philosophical asymmetry. Do they survive scrutiny? Certainly there are more exceptions just around the corner...
This. LLMs are an autocomplete engine. They aren't curious. Take your curiosities and use your human voice to express them.
The only reason you should be using an LLM on a forum like this is to do language translation. Nobody cares about your grammar skills, and there really isn't a reason to use an LLM outside of that.
LLMs CANNOT provide unique objectivity or offer unknown arguments because they can only use their own training data, based on existing objectivity and arguments, to write a response. So please shut that shit down and be a human.
One thing that impressed me about HN when I started participating is how rarely people remark on others' spelling or grammatical mistakes. I myself have been an obsessive stickler about such issues, so I do notice them, but I recognize that overlooking them in others allows more interesting and productive discussions.
This resonates with me. Intent is hard to infer, so it seems better to engage with the content itself. Most ideas are recombinations of earlier ones anyway—the interesting part is the push and pull of refining thoughts together.
Many programmers believe that math is the best way to solve problems or order the world or whatever. There are lots of real 20 year olds out there using chatbots to "optimize" their humanities learning, or to "optimizing" using dating apps. It's a fact about this audience. Some people have a very myopic point of view, however, it coheres with certain cultural forces, overlapping with people of specific ethnic heritages, who are from California and New York, go to fancy school and post online, to earn tons of money, buy conspicuous real estate, date skinny women and marry young.
These aren't the marina bros, they're the guys who think they're really smart because they did well in math. They are using LLMs to reply to people. They LOOK like you. Do you get it?
To follow the pattern of your comment: You are missing the forest for the trees. Like many things, the difference between theory and practice matters here. In theory the only thing that matters is the idea. In practice the context and human element matters AND a culture of ai text could very much reduce the bar for quality.
An equivalent overly-pure reductive mistake is "why do you need privacy if you aren't doing anything wrong".
> The whole reason I come here is to get thoughtful input from smart people
I don't wanna be a party pooper here, but you will be lucky if the input satisfies one of those conditions. Getting input with both those attributes on HN is like finding life on Mars.
I'm here to read what actual humans think. If I wanted to read what an LLM thinks, I could just ask it.
But here's where it gets tricky: Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
Am I here to read authentic humans because I value authenticity for its own sake (like preferring Champagne instead of sparkling wine)? Or do I value authentic human output because I expect it to be of higher quality?
I confess that it is a little of both. But it wouldn't surprise me if someday LLM-enhanced output becomes sufficiently superior to average human output that the choice to stick with authentic human output will be more painful.
> Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
For me it's the first one every time. If only because LLM don't learn from responses to it (much less so when the response is to a paste of their output). It's just not communication. From that perspective, the quality of even the most brilliant LLM output is zero, because it's (whatever high value) multiplied by zero.
Even a real person saying something really horrible and too dense to learn from any response at least gives me a signal about what humans exist. An LLM doesn't tell me anything, and if wanted the reply of an LLM, I would simply feed my own posts into an LLM. A human doing that "for me" is very creepy and, to my sensibilities, boundary violating. Okay, that may be too strong a word, but it feels gross in a way I can't quite put my finger on, but reject wholeheartedly.
> Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
This is an artificial dichotomy. HN’s guidelines specify thoughtful, curious discussion as a specific goal. One-off / pithy / sarcastic throwaway comments are generally unwelcome, however popular they are. Insightful responses can be three words, ten seconds to write and submit, and still be absolutely invaluable. Well-thought-out responses are also always appreciated, even if they tend to attract fewer upvotes than a generic rabble-rousing sentiment about DRM or GPL or Apple that’s been copy-pasted to the past hundred posts about that topic. But LLM-enhanced responses are not only unwelcome but now outright prohibited.
Better an HN with fewer words than an HN with more AI writing words. We’ve been drowned in Show HN by quantity as proof of why already.
But what if it turns out that human+LLM can produce more "thoughtful, curious discussion" than human alone?
That's the dichotomy: Do we prefer text with the right "provenance" over higher quality text?
[Perhaps you'll say that human+LLM text will never be as high-quality as human alone. But I'm pretty sure we've seen that movie before and we know how it ends.]
That said, you're right that because human+LLM is so much more efficient, we'll be drowning in material--and the average quality might even go down, even if the absolute quantity of high-quality content goes up.
I think, in the long term, we will have to come up with more sophisticated criteria for posting rather than just "must be unenhanced human".
HN need not offer itself up as a Petri dish for AI writing experimentation. There are startups in that space, and at least one must be YC-funded, statistically speaking. Come back with the outcomes of the experiment you describe and make a case that they should change the rule. Maybe they will! As of today, though, they are apparently unconvinced.
> the average quality might even go down
We have a recent concrete analysis of Show HN indicating support for this possibility, resulting in the mods banning new users for posting to Show HN (something they’ve probably been resisting for close to twenty years, I imagine, given how frequent a spam vector that must be).
> Perhaps you’ll say that human+LLM text will never be as high-quality as human alone
Please don’t put words in my mouth, insinuating the tone my reply before I’ve made it, and then use that rhetorical device to introduce a flamebait tangent to discredit me with. I’ve made no claims about future capabilities here and I’m not going to address this irrelevance further.
> in the long term, we will have to come up with more sophisticated criteria
Our current criteria seem sophisticated already. Perhaps you could make a case that AI-assisted writing helps avoid guideline violations. This one tends to be especially difficult for us all today:
”Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize. Assume good faith. Eschew flamebait. Avoid generic tangents.”
> Please don’t put words in my mouth, insinuating the tone my reply before I’ve made it, and then use that rhetorical device to introduce a flamebait tangent to discredit me with. I’ve made no claims about future capabilities here and I’m not going to address this irrelevance further.
I apologize--the "you" I meant was the person currently reading my post, not the person I was replying to. I was merely trying to answer a common objection that I've heard.
> HN need not offer itself up as a Petri dish for AI writing experimentation.
I'm not sure HN has a choice. I don't think we can prevent posters from experimenting with LLMs to post on HN--even if they adhere to the guidelines. For example, can I ask the LLM to come up with the strongest argument it can and then re-write it in my own words? That seems to be allowed by the guidelines. Would someone even be able to tell that's what I did? [NOTE: I did not do that.]
I think you're arguing that we should not encourage even more use of LLMs on HN. I get that. But I feel like that this community is uniquely qualified to search for better solutions.
> Our current criteria seem sophisticated already.
I hope you're right! That implies that you believe the current guidelines are sufficient to keep HN as the place we all love despite the assault from LLMs. I'm skeptical, but I've been wrong plenty of times!
> I don't think we can prevent posters from experimenting with LLMs to post on HN
And yet, she persisted, we will still set guidelines; so that people know they’re unwelcome to do so when they do, so that they can’t argue that they didn’t know, so that we as a social club can strive towards the standards we argue about and accept from the organizers. The point of guidelines is not that they prevent malicious intent; the point is that they inhibit those behaviors that exceed the defined boundaries, however vague or precise they may be. Prevention of malice is an impossibility in all human social affairs, whether guidelines are defined or not; one must find other reasons for rules than prevention to understand why rules are at all.
> And yet, she persisted, we will still set guidelines
I'm not sure if you're including or excluding me from the "we". If you're excluding me, then I feel our conversation has come to an end.
But if you're including me, then I think the guidelines need to evolve to deal with LLMs. Maybe not right now--maybe the current guidelines are sufficient for the next year or two or three. But I think we as a community are uniquely qualified to design and influence the future of internet social clubs in the face of LLMs.
> I'm not sure if you're including or excluding me from the "we".
“We” here refers to individual human beings that are members of the human social-entity constructs (‘social clubs’) that precipitate naturally out of human groups, both in general to all such groups and in specific to the group under discussion here today, HN participants.
Whether or not you’re a member of “we” HN participants is conditional on whether or not you are honoring the policy of no AI-assisted writing at HN that is in effect as of whenever you saw this post or the new guidelines. I have no judgment to offer you in that regard, and in any case you’re readily able to decide that for yourself. Separately, I’m not engaging with discussion about future policy; perhaps you should start a top-level thread about it, or write a blog post and submit it (after a few days have passed, so it doesn’t get topic-duped and so that passions have cooled somewhat).
I think "must be unenhanced human" is probably the most sophisticated criteria even if it's simple. I don't think there's much value in trying to optimize the perfect "thoughtful, curious discussion", why would there be, it implies some ideal state for "thoughtful and curious" vs the reality that discussions between living breathing people is interesting by default as long as folks loosely follow some guidelines.
The guidelines are perfectly clear, no matter the outcome of your thought experiment. Hacker News wants intelligent conversation between human beings, and that's the beginning and the end of it.
If you want LLM-enhanced conversation then I'm sure you will find places to have that desire met, and then some. Hacker News is not that place, and I pray that it will never become that place. In short, and in answer to "Do we prefer text with the right "provenance" over higher quality text?".
I like to read human comments because I'd like to know what my fellow humans think. I'd prefer not to read low-effort, throw away comments, but other than that I want to know what people think about different topics.
I read HN both because I want to read what humans think, and because I want to read insightful discussion.
The tension is that as insightful discussion becomes easier/better with LLMs, there is less need to read HN. All I'm left with is provenance: reading because a human wrote it, not because it is uniquely insightful.
1. I enter "Describe the C++ language" at an LLM and post the response in HN. This is obviously useless--I might as well just talk to an LLM directly.
2. I enter "Why did Stroustrup allow diamond inheritance? What scenario was he trying to solve" then I distill the response into my own words so that it's relevant to the specific post. This may or may not be insightful, but it's hardly worse than consulting Google before posting.
3. I spend a week with creating a test language with a different trade-off for multiple-inheritance. Then I ask an LLM to summarize the unique features of the language into a couple of paragraphs, and then I post that into HN. This could be a genuinely novel idea and the fact that it is summarized by an LLM does not diminish the novelty.
My point is that human+LLM can sometimes be better than human alone, just as human+hammer, human+calculator, human+Wikipedia can be better than human alone. Using a tool doesn't guarantee better results, but claiming that LLMs never help seems silly at this point.
> 3. I spend a week with creating a test language with a different trade-off for multiple-inheritance. Then I ask an LLM to summarize the unique features of the language into a couple of paragraphs, and then I post that into HN
I think where you are getting hung up is the idea of "better results". We as a community don't need to strive for "better results" we can easily say, hey we just want HN to be between people, if you have the LLM generate this hypothetical test, just tell people in your own words. Maybe forcing yourself to go through that exercise is better in the long run for your own understanding.
But my point is that I read HN partly because people here are insightful in a way I can't get in other places. If LLMs turn out to ultimately be just as insightful, then my incentive to read HN is reduced to just, "read what other people like me are thinking." That's not nothing, but I can get that by just talking with my friends.
Unless, of course, we could get human+LLM insightfulness in HN and then I'd get the best of both worlds.
If someone can't explain something in their own words, then they don't _really_ understand it. The process of taking time to think through a topic and check one's understanding, even if only for oneself and the rubber duck, will reveal mistakes or points of confusion.
Which gets to the core of the issue nicely, I want to go on to HN and talk to people who know things or have thought about things to the degree that they don't need a cheat sheet off to the side to discuss them.
> Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
I'd argue that anything insightful or well-though-out doesn't use LLMs at all. We can quibble over whether discussions with an LLM lead to insightful responses, but that still isn't your own personal thought. Just type what's on your mind, it's not that hard and nitpicking over this is just looking for ways to open up unnecessary opportunities for abuse.
Often i think of a novel idea or solution to a problem, but use AI to communicate or adjust what I already wrote out so it’s more comprehensible. Sometimes when I write, it’s hard to understand.
The more you write, the less this will be true. The more you write, the better you will become at it. Using an LLM to write is like sending a robot to the gym for you.
The more you use an LLM to write for you, the worse you will become at writing yourself. There is simply no other possible outcome. It's even true of spellcheck - the more you use a spellcheck the worse you become at spelling. I know this for a fact because I can no longer spell for shit. However, spelling is to writing as arithmetic is to mathematics. I also can't add up, but I have a degree in pure mathematics.
LLMs are a cancer on human thought and expression.
> LLMs are a cancer on human thought and expression.
LLMs help to express what many people dont have the energy or ability to express. It also has a broader scoped view of protocol...It does not have emotions, which often leads to less than optimal discourse.
In many ways, it help those who are challenged in discourse to better express themselves...rather than keeping silent or being misunderstood.
So if you want to reply to a comment you read today, and you don't feel like your writing skill is up to snuff, you should be content with expecting to wait the requisite weeks or months or years of practice before even considering replying to it?
This seems especially relevant for non-English-fluent commenters, who are increasingly using LLMs to be able to communicate more effectively on an English-only site like Hacker News than they'd otherwise be able to do.
The difference is that I will retain what I drew out from the dictionary the next time. If people use Ai this way for writing, great! What many of the "enhanced-by-ai" arguments sound like is that this will be an indefinite outsourcing.
Use them to get better, like how reading good writing directly (not summarized) will also make you a much better writer. Learn from the before and after so next time there isn't a need to reach for Ai.
There is a sliding scale from that, to it being the LLM that communicates, not the person. LLMs can really reshuffle and change priorities and modify emphasis in a text. All the missing pieces will be filled in and rounded out and sandpapered off by the inner-average-corporate-HR-Redditor of the LLM.
By this logic, you might consider vibe coding a browser plugin that takes any HN comment less than 50 words and auto-expands it into an “insightful, well thought-out response.”
What is the value of this "output"? If I want to know what LLMs think about something, I can go ask an LLM any question I want. For a comment on [a site like] HN, either the substantive content of the comment originated inside a human mind, or there is no substantive content that I couldn't reproduce by feeding the comment's context into an LLM. At the extreme, I don't have any interest in reading or participating in a conversation between a bunch of LLMs.
The value proposition is that someone who is a lousy writer (perhaps only in English) with deep domain knowledge is going back and forth with the LLM to express some insight or communicate some information that the LLM would not produce on its own.
Wouldn't it work better to just write the thing in whatever language they can actually write in and then do a straightforward translation in a single pass?
> someone who is a lousy writer with deep domain knowledge going back and forth with the LLM to express some insight or communicate some information that the LLM would not produce on its own
This sounds reasonable on its face, but how often does it actually come up that somebody can't clearly express an idea in writing on their own but can somehow get an LLM to clearly express it by writing a series of prompts to the LLM?
And, if it does come up, why don't they just have that conversation with me, instead?
> Wouldn't it work better to just write the thing in whatever language they can actually write in and then do a straightforward translation in a single pass?
Nontrivial translation tools are AI(neural net)-based tools (although not necessary LLM). Whole transformer neural net architecture was originally designed for translation.
I don't have a problem with people using these tools to translate their writing into languages they aren't fluent/literate in. It's a completely different dynamic vs. having them write for you.
Using LLMs for research is completely different from using them to write for you. And if you're using them to write about the results of research, you're almost certainly getting a lot less value out of the whole exercise.
> Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
Neither. I want insightful, well-thought-out, human comments.
It's a little sad that this might be too much to ask sometimes...
The point of a discussion site is to hear what other people think and get different perspectives. Just getting an LLM's insightful, well-thought-out response isn't really a big draw, if one is looking for that, there's a pretty obvious way to get it. I posted this the other day (ignore the title I realized later it's too clickbaity) but this is why IMO LLMs won't replace the workforce, people aren't looking for answers to things, they're looking for other people's takes: https://news.ycombinator.com/item?id=47299988
> If I wanted to read what an LLM thinks, I could just ask it.
and
> Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
What is the difference? What's the line between these two?
The prompt: "Analyze <opinion> and respond" is pretty clearly "I would just ask it."
and, the prompt: "here's my comment, please ONLY the check the grammar and spelling" would probably be ok.
What about prompt:"I disagree with using LLMs for commenting at all for <reasons>. Please expound on this and provide references and examples". That would explode the word count for this site.
1. "Here is my answer to a comment. Give me the strongest argument against it."
2. "I think xyz. What are some arguments for and against that I may not have thought of."
3. "Is it defensible for me to say that xyz happened because of abc?"
All of these would help me to think through an issue. Is there a difference between asking a friend the above vs. an LLM? Do we care about provenance or do we care about quality?
The difference is in the journey to find the answer, rather than outsourcing it to man or machine. Spending more time reflecting before first post will often answer the easy questions so you can formulate more thoughtful questions.
You can't. Nobody can. False positives are the inherent danger of these sorts of policies — especially when the LLMs were trained on the exact writing styles that have dominated online conversations and publications for decades.
Gonna put out a blanket assertion about my preferences, to get a read on whether these are shared or not:
As humans, we have directives (genetic, cultural, societal, etc.) to prioritize humanistic endeavors (and output) above all else.
History has shown that humans are overwhelmingly chauvinistic in regards to their relationship to other animals in the animal kingdom, even to the point of structuring our moral/ethical/legal systems to prioritize human wellbeing over that of other animals (however correct/ethical that may ultimately be, e.g., given recent findings in animal cognition, such as recent attempts to outlaw boiling lobsters alive as per culinary tradition).
But, it seems that some parties/actors are willing (i.e., benefiting) from subverting this long-standing convention (of prioritizing human interests) in the face of AI (even to the point of the now-farcical quote by Sam Altman that humans take far more nurturing than LLMs...)
So: should we be neglecting our historical and genetic directives, to instead prioritize AI over human interests? Or should we be unashamedly anthropic (pun intended), even at the cost of creating arbitrary barriers (i.e., the equivalent of guilds) intended to protect human interests over those of AI actors?
I strongly recommend the latter, particularly if the disruptions to human-centric conventions/culture/output are indeed as significant (and catastrophic) as they will likely be if unchecked.
> well-thought-out response, even if it is LLM-enhanced?
There's no insight nor well-thought-out response once a person decides to "LLM-enhance" their response. The only insight that the person using the LLM is too limited to have a decent conversation with.
Humans have more variability and "edge". If a person is passionately arguing for some point of view (perhaps somewhat outside the usual), it signals to me that they probably thought about this and it is a distillation of a long thought process and real-life experience. One could say that the logical argument should stand alone, but reality doesn't work that way. There are many things you have to implicitly trust and believe when you read. Of course lying and bullshitting already existed before ("nobody knows you're a dog" etc etc). But LLMs will really eloquently defend X, not X, X*0.5 and anything inbetween. There is no information content in it, it doesn't refer to an actual human life experience and opinion that someone wants to stand behind. It just means that someone made the LLM output a thing.
> But it wouldn't surprise me if someday LLM-enhanced output becomes sufficiently superior to average human output that the choice to stick with authentic human output will be more painful.
My ideal vision is that instead of outsourcing indefinitely, we learn from the enhanced versions and become better independent writers.
> Am I here to read authentic humans because I value authenticity for its own sake (like preferring Champagne instead of sparkling wine)?
Mate, Champagne is a sparkling wine. In French you can even at times hear people asking for "un vin mousseux de Champagne" meaning "a sparkling wine from Champagne" instead of the short form (just saying "un Champagne" or "du Champagne").
Now, granted, not all sparkling wine are Champagne.
The Wikipedia entry begins with: "Champagne is a sparkling wine originated and produced in the Champagne wine region of France...".
I drank enough of it to be stating my case, of which I'm certain!
P.S: and btw, yup, authentic humans content only here, even if it's of "low quality". If I want LLM, I've got my LLMs.
Basically you have Cremant type sparking wines which are produced from other regions of France besides Champagne. It is just like Champagne just that other French regions like Loire, Alsace, Bordeux etc are not allowed to call it Champagne.
So just like Armanac's are like Cognac's for lower price, good Cremant will be cheaper and more enjoyable that cheaper Champagne (I've not had any really expensive Champagne).
Then you have Cava from Spain which is similar process to Cremants and Champagne.
The difference would be in type of grapes used. A friend of mine swears by Cavas just like I swear by Cremants from Loire region. However my wife hates Cava.
Then Proseccos from Italy again are similar, but quality varies more.
After that we get into more questionable cheaper sparkling wines which usually means some sort of out of bottle insertion of CO2 and even worse version include some other modifications such as sugar.
In general to avoid literal headaches you want BRUTs. Anything semi-sweet or sweet is suspicous.
Again I am not a full wine expert but this is mostly years of ahem experience.
I keep wishing for a public place to put a formatted version of my LLM threads. I have long conversations with LLMs that usually result in some kind of documentation, tutorial, or dataset. Many of them are relatively novel, but I haven't created a place for them yet.
And no, I wouldn't think an HN post is it either.. I'm just saying, there should be a good place to post the output of good questions asked iteratively.
If you can’t even be arsed doing that how much value is there, really?
Personally the only thing less interesting to me than someone else’s conversations with an LLM is hearing about someone else’s dream they had last night but you never know, some people may be interested.
Simon Willison published something for turning Claude Convos into something publishable. [1] I haven't tried it, so cannot speak to the ergonomics.
Where to post it? Any blog site, probably a good few Show HN too. Will anyone read it? I haven't read anyone else's, I'm more inclined to dock them reputation for suggesting I read their Ai session. Snippets of weird things shared on socials were interesting to me early on, but I'm over that now too.
"AI-edited comments" is a very interesting one. Where is the line between a spelling/grammar/tone checker like Grammarly, that at minimum use N-Grams behind the scenes, and something that is "AI" edited? What I am asking is, is "AI" in this context fully featured LLMs, or anything that improves communication via an automated system. I think many people have used these "advanced" spellcheckers for years before Chatgpt et al came on the scene.
I think "generated comments" is a pretty hard line in the sand, but "AI-edited" is anything but clear-cut.
PS - I think the idea behind these policies is positive and needed. I'm simply clarifying where it begins and ends.
All this stuff is in flux. I thought a lot about whether to add the "edited" bit - but it may change. What I deliberately left out was anything about the articles and projects that get submitted here. There's a lot of turbulence in that area too, but we don't yet have clarity, or even an inkling, of how to settle that one.
Do the guidelines also disallow comments along the lines of "according to <AI>, <blah>"? (I ask this given that "according to a Google search, <blah>" is allowed, AFAIK.)
I would lean towards disallowing those. With "According to a Google search ...", someone can ask for specific links (and indeed, people often say to link to those sources to begin with instead of invoking Google). With "According to AI ... " - why would most readers care what the AI thinks? It's not a reliable source! You might as well say "According to a stranger I just met and don't know ..."
If you're going to say that the AI said X, Y, Z, provide a rationale on why it is relevant. If you merely found X, Y and Z compelling, feel free to talk about it without mentioning AI.
I think those should be allowed iff the nature of being AI-generated is relevant to the topic of discussion — e.g. if we're talking about whether some model or other can accurately respond to some prompt and people feel inclined to try it themselves.
I constantly read those comments and I personally have conflicting opinion with them. On one hand, it's interesting to compare what is coming out of models, but on the other hand, LLMs are all non-deterministic, so results will be fairly random. On top of that, everybody has a different "skill" level when prompting. In addition, models are constantly changing, therefore "I asked chatGPT and it said..." means nothing when there is a new version every few months, not to mention you can often pick one of 10+ flavors from every provider, and even those are not guaranteed to not be changed under the hood to some degree over time.
Please rethink the “edited” bit on accessibility grounds.
I have a kid with severe written language issues, and the utilisation of speech to text with a LLM-powered edit has unlocked a whole world that was previously inaccessible.
I would hate to see a culture that discourages AI assistance.
> I would hate to see a culture that discourages AI assistance.
Mostly I think the push back is about ai assistance in its current form. It can get in the way of communicating rather than assisting. The cost though is mostly borne by the readers and those not using the AI for assistance. I have seen this happen when the ai adds info and thoughts that were tangental to the original author and I think, but I can not verify times where an author seems to try to dig down on the details but seemingly can not.
Since it's mostly a good-faith rule to begin with, it seems easy to add something like, "unless you are using it as an assistive technology for accessibility reasons".
Yes, please at least have a carveout for accessibility. I definitely have dictated HN comments in the past, and my flow uses LLMs to clean it up. It works, and is awesome when you're in pain.
Hear hear. And like many other aspects of accessibility, it will help a huge number of people who may not have any severe issues. e.g. non-native English speakers using LLM-powered edits.
You should use your own words. It might seem that a tool like Grammarly is just an advanced spellcheck, but what it's really doing is replacing your personal style of writing with its own.
It's better to communicate as an individual, warts and all, than to replace your expression with a sanitized one just because it seems "better." Language is an incredibly nuanced thing, it's best for people's own thoughts to come through exactly as they have written them.
My elementary school kid came home yesterday and showed me a piece of writing that he was really proud of. It seemed more sophisticated than his typical writing (like, for example, it used the word "sophisticated"). He can be precocious and reads a ton, though, so it was still plausible that he wrote it. I asked him some questions about the writing process to try to tease out what happened, and he said (seemingly credibly) that he hadn't copied it from anywhere or referenced anything. He also said he didn't use any AI tools. After further discussion, I found out that Google Docs Smart Compose (suggested-next-few-words feature) is enabled by default on his school-issued Chromebook, and he had been using it. The structure of the writing was all his, but he said he sometimes used the Smart Compose suggestions (and sometimes didn't). He liked a lot of the suggestions and pressed tab to accept them, which probably bumped up the word choice by several grade levels in some places.
So yeah, it can change the character of your writing, even if it's just relatively subtle nudges here or there.
edit: we suggested that he disable that feature to help him learn to write independently, and he happily agreed.
That reminds me of one of the biggest IMO missing feature of Wordle: They never give a definition of the word after the game is finished! I usually do end up googling words I don't know (which is quite often) but I'm guessing I'm one of the few who goes to the trouble. I've even written to The New York Times a couple times to suggest adding a short definition at the end as I honestly feel like a ton of people could totally up their vocabulary game and it surely could be added with minimum effort (considering they even added a Discord multiplayer mode).
The analogy with tab-completion of code seems apt. At first you blindly accept something because it has at least as good a chance of working as what you would have typed. Then you start to pay attention, and critically evaluate suggestions. Then you quickly if not blindly accept most suggestions, because they're clearly what you would have written anyway (or close enough to not care).
Emacs has completion (but you can bind it to tab). The nice thing is that you can change the algorithm to select what options come up. I’ve not set it to auto, but by the time I press the shortcut, it’s either only one option or a small sets.
From his description, it sounded like this was more of #1. He cared a lot about the topic he was writing about, and has high standards for himself, so it's very likely that he would have considered and rejected poor suggestions.
I have mixed feeling about it. On the one hand, you're right: carefully considering suggestions can be a learning opportunity. On the other hand, approval is easier than generation, and I suspect that without flexing the "come up with it from scratch" muscle frequently, that his mind won't develop as much.
Oh how I despise these suggestions. You sometimes look for a way to express something and you are on the verge of giving the world something truly original, but as soon as your brain sees the suggestion it goes "oh yeah that fits"
I disabled them immediately, it feels like the tech version of the ADHD person who keeps interrupting you with what they think you are trying to say. Even if the suggestion is correct, it saves you at most 2 seconds at the cost of interrupting you constantly.
True! There's an important cybernetic aspect to all this, where an automatic suggestion can be an interruption, sometimes worse if the suggestion is decent.
A certain amount of friction is necessary, at least if the goal is to help the person learn or make something original.
I see the suggestions and then choose something different anyway. I don't want to use one of the top 3 most popular responses to an email from a friend. Even if it's something transactional.
As an adult, I do too. As a middle schooler, we absolutely used word processors’ thesaurus features to add big words to our essays because the teachers liked them.
One problem I see is that LLMs have a more nuanced... well, model of how words and their meanings relate to each other than a dead-tree thesaurus could ever present, what with its simplified "synonym" and "antonym" categories. Online versions try to give some similarity metrics, but don't get into the nuance. (It's not as if someone who takes either approach would want to spend the time reading and understanding that, anyway.)
> she could tell when students were using it to make their writing more fancy pants
I had two teachers who called us out on this, and actually coached us on our writing, and I remember them fondly. (They were also fans of in-class essaying.)
As a non native English speaker my own words wouldnt be in English. If I express myself in English I soon struggle for the right words. On the other hand I think when I read some English text I'm quite capable of sensing the nuances. So it feels when I auto translate my text to English an than read against it again and make some corrections, I can express my thoughts much better.
For what it's worth, I had a quick look through your comment history and your English seems just fine to me as a native speaker (at least for informal communication).
People who don't have English as their first language often seem to underestimate how good their English actually is. I wonder if it's because their reference point is formal English rather than the much more forgiving English we use in casual day-to-day conversation.
Books and newspapers have had editors for centuries. It is just code review for the written word.
[It looks like MS Word 97 had the ability to detect passive voice as well, so we're talking 30 year old technology there that predates LLMs -- how far down the Butlerian Jihad are we going with this?]
I don't personally use AI/LLMs for any informal writing here or on reddit, etc. But I think it is pretty weird to be overly concerned around people, particularly ESL, who use tools to clean up their writing. The only thing I really care about is when someone posts LLM regurgitated information on topics they personally don't know anything about. If the information is coming from the human but the style and tone is being tweaked by a machine to make it more acceptable/receptive and fix the bugs in it, then I don't understand why you're telling me I need to care and gatekeeping it. It also is unlikely to be very detectable, and this thread seems to only serve a performative use for people to get offended about it.
Other tools to clean up writing are allowed. They did not tell you you must care. You told them they must not. The submisson's use was to tell you and others LLM generated tone was not more acceptable.
I was just re-reading the passage from Plato's "The Phaedrus" on writing & the "art" of the letter for an essay I'm working on, and your remark is salient for this discussion on LLM-style AI and social media at large.
>It's better to communicate as an individual, warts and all, than to replace your expression with a sanitized one just because it seems "better."
It is definitely not true that it is better for a poster to communicate like an individual when it comes to spelling and grammar. People ignore posts that have poor grammar or spelling mistakes, and communications that have poor grammar are seen as unprofessional. Even I do it at a semi-subconscious level. The more difficult or the more amount of attention someone has to pay to understand your post, the less people will be willing to put in that effort to do so.
> It's better to communicate as an individual, warts and all, than to replace your expression with a sanitized one just because it seems "better." Language is an incredibly nuanced thing, it's best for people's own thoughts to come through exactly as they have written them.
This is the opposite of how language works. You want people to understand the idea you're trying to communicate, not fixate on the semantics of how you communicated. Language is like fashion - you only want to break the rules deliberately. If AI or an editor or whatever changes your writing to be more clear and correct, and you don't look at it and say "no, I chose that phrasing for a reason" then the editor's version is much more likely to be understood correctly by the recipient.
That's true, but on the flip side I regularly get downvoted because my English is not the best, so say it mildly. So, now I need to be really careful, to a) write in a good English or b) not to be recognised as an LLM corrected version of my English. Where is the line? I shouldn't be downvoted for my English I think, but that is the reality.
Edit:
I already got downvoted. :-) Sure, no one can tell exactly why. Maybe the combination of bad English _and_ talking sh*ce isn't ideal at all. :-D Anyways, I have enough karma, so I can last quite a while..
The quality of my writing varies (based on my mood as much as anything else, I suppose), but when it is particularly good and error-free then I often get accused of being a bot.
Which is absurd, since I don't use the bot for writing at all.
It’s possible of course but reading all the comments from various non-native English speakers here it seems like a common story. It may indicate a subliminal bias in readers (most of whom are presumably American).
Note that those comments are written in perfectly understandable English. Further note how often you come across comments written in perfectly understandable English, but they're downvoted anyway.
It suggests a bias in writers to assume that people would agree with them if only they could express their thoughts accurately.
But the problem is that people with poor written language / english skills are 'competing' with people who have superb skills in this domain.
There are people here who sit at a desk all day banging out multipage emails for work who decide to write posts of a similar linguistic calibre for funsies.
Meanwhile you have someone in a developing country who just got off a brutal twelve hour shift doing manual labour in the sun who wants to participate in the conversation with an insightful message that they bang-out on a shitty little cellphone onscreen keyboard while riding on bumpy public transit.
You could have a great idea and express it poorly and be penalized for doing so here while someone could have a blah idea expressed excellently and it's showered in replies despite being in some metrics (the ones I think are most important) worse than the other post.
This is a perfect example of what I'm talking about.
I knew someone was going to comment on my use of the word there despite me putting it in quotes which was intended to let the reader know that I meant that word as an approximation of what I was meaning.
When I say competing I mean competing in the space of ideas here. There is a ranking system here that raises or lowers the visibility and prominance of your comments and it's based on upvotes by other uses. For better or worse people penalize comments with grammatical errors over ones that don't and that affects how much exposure other users have to the ideas that people write and how much interaction they get from them.
If that's the case why would somebody who has good ideas but poor expressive capability bother posting here if their comments are just going to get ignored over relatively vapid comments that are grammatically correct?
> If that's the case why would somebody who has good ideas but poor expressive capability bother posting here if their comments are just going to get ignored over relatively vapid comments that are grammatically correct?
No, I get your point. Unfortunately, alot of people here try to act high and mighty like they are posting here for some altruistic reason. The reason why I, you, and everyone else posts here is the human reason that we want others to engage with our posts. In order to do that, you have to put your best foot forward, which includes making sure the spelling and grammar of your posts is correct. While I do not use an LLM for this, I think that it is vaild to use these tools to make sure nothing gets in the way of whatever point you are trying to make.
> In order to do that, you have to put your best foot forward
In English. You have to put your best foot forward in English. And in your environment with the resources you have at your disposal.
For example, I'm currently engaging with you between steps in a chemistry process that's happening under the fumehood next to me while wearing a respirator, a muggy plastic chemical resistant gown and disposable gloves nitrile globes.
I am absolutely certain that these conditions are different than the ones I would need to 'put my best food forward' in this discussion. I'm also certain that quite certain that you and I would both absolutely stumble if we were obligated to particpate in this forum in a language that we're not proficient in as many users often attempt to do and are unfairly penalized for by other members of the community.
I'm with you on the LLM usage for grammatical issues for non-native speakers. I bet more in this community would feel the same way if Dang whimsically mandated that people had to use a language other than English on certain days of the week.
> You could have a great idea and express it poorly and be penalized for doing so here while someone could have a blah idea expressed excellently and it's showered in replies despite being in some metrics (the ones I think are most important) worse than the other post.
I absolutely do not understand this comment. Are you saying that posting is competitive and that comments have "metrics"?
Yes! If my comment is above yours in a thread, it means I got more upvotes than you did, which means I get special bonuses and more to eat and you go hungry in Internet land. Also it means I'm better than you (obviously) and I get to go to this secret club with all the pretty people and you're not invited. Isn't that how this all works?
I disagree. HN is going to bury my raw unedited tirade of a comment about those fucking morons that couldn't code their way out of a paper bag. If I send a comment to ChatGPT and open up the prompt with "this poster is a fucking dumbass, how do I tell them this" and use that to get to a well reasoned response because that's the tool we have available today, we're all better off.
The guidelines state:
> Be kind. Don't be snarky. Converse
> Edit out swipes.
> Don't be curmudgeonly.
On the best of days I manage to follow the rules, but I'm only human. If I run my comment through ChatGPT to try and help me edit out swipes on the bad days, that's not ok?
I'm not using ChatGPT to generate comments, but I've got the -4 comments to show that my "thoughts exactly as they have written them" isn't a winning move.
If you see an incompetent coder and wish to communicate that the person responsible is a "fucking moron/dumbass", the tone with which you do so is not the problem. Tell us what is wrong with the code, as objectively as possible. That's what the guidelines are trying to convey.
I'm not sure I agree with this. I don't really want to see someone else's stylistic "warts".
I just want clean, easy-to-read content and I don't care about the person who wrote it. A tool like Grammarly is the difference between readable and unreadable (or understandable and understandable) for many people.
You could run the comments everyone else posts through an AI tool and ask it to rephrase it so that it is clean, and easy-to-read.
You could even write a plugin for your favorite web browser to do that to every site you visit.
It seems hard to achieve the inverse that is (would you rather I use i.e.?) rewrite this paragraph as the original author did before they had an AI re--write it to make it clean, (--do you like oxford commas, and em/en dashes! Just prompt your AI) and easier to read
> You could run the comments everyone else posts through an AI tool and ask it to rephrase it so that it is clean, and easy-to-read.
For those coming from a language other than English, you are more likely to lose information by using a tool to “reconstruct” meaning from poorly phrased English as an input, as opposed to the poster using a tool to generate meaningful English from their (presumably) well-written native language.
There's a big difference between me running a filter on other people's words, and those people themselves choosing to run one and then approving the results.
I personally don't see a problem with someone using a grammar checker as long as they aren't just blindly accepting its suggestions. That said, if someone actually is using it in that way, it shouldn't be detectable anyway, so it probably doesn't matter all that much whether or not it's included in the letter of the rule.
On a technical level, you can really only guard against changing your semantics and voice - if you're letting software alter the meaning, or meanings, you intend, and use words you don't normally use, it's probably too far.
This is probably ok:
>> On a technical level, you can really only guard against software that changes your semantics or voice. If you're letting it alter the meaning (or meanings) you intend, or if it starts using words you would never normally use, then it's gone too far.
This is probably too far:
>>> On a technical level, it's important to recogn1ize that the only robust guardrail we can realistically implement is one that prevents modifications to core semantics or authorial voice. If you're comfortable allowing the system to refine or rephrase the precise meanings you originally intended — or if it begins incorporating vocabulary that doesn't align with your typical linguistic patterns — then you've likely crossed a meaningful threshold where the output no longer fully represents your authentic intent.
Something to consider is that you can analyze your own stylometric patterns over a large collection of your writing, and distill that into a system of rules and patterns to follow which AI can readily handle. It is technically possible, albeit tedious, to clone your style such that it's indistinguishable from your actual human writing, and can even icnlude spelling mistakes you've made before at a rate matching your actual writing.
AI editing is weird, though. Not seeing a need, unless English isn't your native language.
I think that the line between A"I" editing to fix grammar or to translate from a different native language and A"I" editing by using an LLM is one of those things that's very hard to unambiguously encode in written guidelines, but easy to intuitively understand using common sense, in the vein of I know it when I see it.
> Where is the line between a spelling/grammar/tone checker like Grammarly
For me, the line is precisely at the point where a human has something they want to say. IMO - use the tools you need to say the thing you want to say; it's fine. The thing I, and many others here, object to is being asked to read reams of text that no-one could be bothered to write.
I think the only practical litmus test here is whether you can stand by the text as your own words. It’s not like we have someone looking over commenters’ shoulders as they type.
Ultimately, this comes down to people making a good-faith judgment about how much AI was involved, whether it was just minor grammatical fixes or something more substantial. The reality is that there isn’t really a shared consensus on exactly where that line should be drawn.
When a policy is introduced to seemingly guard against new problems, but happens to be inadvertently targeting preexisting and common technology, I don't feel like it is "lawyering" it to want clarity on that line.
For example, it could be argued this forbids all spellcheckers. I don't think that is the implied intent, but the spectrum is huge in the spellchecker space. From simple substitutions + rule-based grammar engines through to n-grams, edit-distance algorithms, statistical machine translation, and transformer-based NLP models.
Finding it more refreshing these days when reading text with broken grammar, incorrect use of pronouns, etc. especially for HN, the human connection is more palpable. It’s rarely so bad that it’s not understandable
Grammarly use is outright prohibited by this; AI-edited writing is no longer writing that you hold personal and exclusive responsibility for having written. Consider Stephen Hawking’s voice box generator. While the sounds produced were machine-assisted, the writing was his alone. If you find yourself unable to participate in this web forum without paying a proofreader (in time, money, or cycles) to copy-edit your writing, then you’re not welcome on HN as a participant.
> If you find yourself unable to participate in this web forum without paying a proofreader (in time, money, or cycles) to copy-edit your writing, then you’re not welcome on HN as a participant.
It’s not sarcasm. If you feel if I have misunderstood the intent of the guideline we’re discussing — “Don’t post generated/AI-edited comments”, as the title currently reads, then I’m happy to discuss further. (I often make logical negation errors that I miss in proofing, so it’s possible I slipped up, too!)
I thought it was sarcasm given you are asking people to “pay a proofreader”. This sounds ludicrous. Could you clarify wha you meant by that line if it’s not sarcasm? Because I’m having a hard time thinking that it’s meant to be taken at face value.
No worries. The post I replied to was asking if use of ‘grammar improvement services’ (my paraphrase) qualified as AI-assisted writing at HN. All such services cost something; Grammarly makes a lot of money charging businesses, AI consumes watts of power that someone pays for, and even Microsoft Word’s grammar checker spins up the CPU fans on an old Intel laptop with a long enough document. I took from that the generic point that one “pays” for machine-assisted proofreading by one means or another, whether it’s trading personal data for services (Google) or watts of power for services (MSWord et al.) or donating writing samples to a for-profit training corpus (Grammarly free tier) or paying for evaluations where your data is not retained for training (Grammarly paid enterprise tier with a carefully-redlined service contract) and generalized to “pay for machine proofreading”.
Then, I considered whether HN would appreciate posts/comments by a human where they’d had a PR team or a hired editor come in and review/modify/distort their original words in order to make them more whatever. I think that this probably is most likely to have occurred on the HN jobs posts, and I’ve pointed out especially egregious instances to the mods over the years — but in general, the people who post on HN tend to do so from their own voice’s viewpoint, as reaffirmed by the no-AI-writing guideline above. So I decided instead to say “pay a proofreader” because, bluntly, if the community found out that someone was paying a wage to a worker to proofread their HN comments, the response would plausibly be the same mob of laughing mockery, disgusted outrage, and blatant dismissal that we see today towards AI writing here. “You hired someone to tone-edit your HN comments?!” is no different than “You used Grammarly to tone-edit your HN comments?!” to me, and so it passed the veracity test and I posted it.
I saw a similar conversation somewhere about some project saying they don't allow AI generated code.
It was asked that if "AI Generated Code" is just code suggested to you by a computer program, where does using the code that your IDE suggests in a dropdown? That's been around for decades. Is it LLM or "Gen AI" specific? If so, what specific aspect of that makes one use case good and one use case bad and what exactly separates them?
It's one of those situations where it seems easy to point at examples and say "this one's good and this one's bad", but when you need to write policy you start drowning in minutia.
Projects cannot allow AI generated code if they require everything to have a clear author, with a copyright notice and license.
IDE code suggestions come from the database of information built about your code base, like what classes have what methods. Each such suggestion is a derived work of the thing being worked on.
Nobody is actually confused about what AI generated code means in those cases, they're just trying to be argumentative because they don't like the rules
Your comment is one of semantics. Worth discussing if we're talking a truly hard line rule rather than the spirit of the rule.
I benefit from my phone flagging spelling errors/typos for me. Maybe it uses AI or maybe it uses a simple dictionary for me. Maybe it might even catch a string of words when the conjunction isn't correct. That's all fair game, IMO. But it shouldn't be rewriting the sentence for me. And it shouldn't be automatically cleaning up my typos for me after I've hit "reply". That's on me.
i don't care if someone has bad grammar, i want to hear their thoughts as they came up with them, we're all intelligent beings and can parse the meaning behind what you write.
i type my comments without capitalization like i'm typing into some terminal because i'm lazy and people might hate it but i'm sure they prefer this to if i asked an LLM to rewrite what i type
your writing style is your personality, don't let a robot take it away from you
I, on the other hand, find incorrect grammar mildly annoying, especially when it's due to laziness. It distracts from the thoughts being conveyed. I appreciate when people take the time to format comments as correctly as they're able.
In fact, I'd argue that lazy commenting is the real problem, which has now been supercharged by LLMs.
I mostly agree, although we've seen big shifts in the culture towards rule-deviating norms over time. Look at the guidelines for ideological battles or throwaway accounts, for example. And, as always:
> Comments should get more thoughtful and substantive, not less, as a topic gets more divisive.
This discussion reminds me of the Paradigms of Power featured in Adiamante by L E Modisett; about consensus, power, morality and society. It’s a good read.
That’s true. Fortunately, by virtue of it being added to the guidelines, quite a few folks here are prepared to reply to obviously generated comments by simply citing and linking the rule. Just search for “shallow dismissal” to see many examples.
It will take time, but eventually everyone will know about it.
> quite a few folks here are prepared to reply to obviously generated comments by simply citing and linking the rule
Note that the guidelines do explicitly say not to post about guidelines violations in comments, and to email them instead. I know this isn’t a well-loved guideline in this modern era, but duly noted: those well-intended comments are themselves breaking the guidelines.
> Please don't post insinuations about astroturfing, shilling, brigading, foreign agents, and the like. It degrades discussion and is usually mistaken. If you're worried about abuse, email hn@ycombinator.com and we'll look at the data.
If so, that seems different. If not, can you clarify?
That one, yes. “Insinuations” is a less conditional form of “Accusations”, connected by the concept of “Claims”; they’re all synonymous from a general perspective:
- I insinuate that you are a bot (often shortened to “Is this a bot?”)
- I claim that you are a bot. (often shortened to “This is a bot.”)
- I accuse you of being a bot l. (often shortened to “Are you a bot?”)
The part where I’m interpreting to include accusations of bottery and slop is “and the like. It”; the first clause, ‘the like’ refers to the generic category of accusations against posted comments, which historically were the listed examples, but is also defined to include others not listed, such as today’s popular accusations of bot or AI; the second clause, ‘It’, refers to all insinuations-class content. Without the list of examples, this reads:
’Please don’t post insinuations. It degrades discussion
Yep, this is true. Accusations, Insinuations, Claims, of bot or AI or astroturf; they all wreck discussions and I end up having to email the mods to deal with them. A lot of people use the rhetorical device of Discredit The Opposition by invoking this sort of thing, and while that’s less prevalent in ‘reads like AI’ insinuations, they still degrade the site.
With AI-assisted writing is a violation of site guidelines, and even before it was, posting of AI-assisted writing was a clear ‘abuse’ of the community’s expectations of unassisted-human discussions. Aside from expectations, I can also classically understand in Internet history that ‘violating the guidelines’ is the phrase formerly known as ‘abuse of service’, by which I interpret the above reference to abuse to refer to breaking the guideline about posting accusations.
The guidelines are not a legal contract as program code, and perhaps this one is clunky enough that it needs to be reworded slightly; thus my intent, once the flames die down here, to let the mods know about the confusion. As I’m not a mod, this is my interpretation alone; you might have to email the mods and ask them to reply here if you want a formal statement on the matter, given how many comments this thread got in a couple hours.
ps. On ’and is usually mistaken’: I’m not a mod, so I can’t judge how often accusations of AI/bot are mistaken, but I’m also an old human who learned em-dashes in composition class, so I tend to view the modern pitchfork mobs out to get anyone who can compose English as being less accurate in their judgments than they believe they are.
What constitutes “at edited”. If I throw a block of text in to an ai see if it makes sense — say a response to a post — and fold the suggestions in, is that “ai edited”?
Um, why would you do that instead of waiting for someone more knowledgable to reply, and learn from? Replies are not mandatory, and experts/insiders participating is one of the best parts of the human Internet. Let them shine.
It can catch things that I might miss or might be misinterpreted. I sometime miss simple things, like like repeated words, that an AI point out. Is a spell checker considered "AI"? Is Grammerly? Okay, maybe Grammerly from 5 years ago as opposed to today? If I'm typing on my phone and it pops up the next suggested word, is that AI edited?
And no, I don't have to reply to a post, but when I think it's a bad policy, should I just accept it without discussion? And who determines the "experts/insiders" and which voices should be allowed?
Yes, and it’s not like they just insta-ban every infraction.
I’ve broken the guidelines on this site before. The mods reply and say “hey, stop doing that, here is the guideline”. I stopped doing it. Life continues.
I believe the issue of proving who is and who isn't really human on the Internet will be a really important issue in the coming years, especially without sacrificing people's right to privacy and anonymity in the process.
I don't think the real issue is LLM posts. The issue with low quality on the Internet has always been quantity. The problem always has been humans who post too much, humans that use software to post too much, and now it's humans who use LLMs to post too much.
The problem with a medium that is completely free and unrestricted is that whomever posts the most sort of wins. I could post this opinion 30-40 times in this thread, using bots and alternative accounts, and completely move the discussion to be only this.
Someone using an LLM is craft a reply is not a problem on it's own. Using it craft a low-effort reply in 3 seconds just to get out is the problem.
> Someone using an LLM is craft a reply is not a problem on it's own.
No, someone using an LLM to craft a reply is a problem in its own. I want to hear what a human has to say, not a human filtered through a computer program. No grammar editing, nothing. Give me your actual writing or I'm not interested.
Do you though? Like what real difference does it make to you? Can you even tell if this has been passed through an LLM or not? If you can't tell, why does it matter?
I don't want to be robo-slopped at en masse or be fed complete fabrications but neither of those actually require an LLM. If you're going to use an LLM to gather your thoughts, I don't see a problem with that.
the difference is that you get to see the unfiltered, unique perspective of a real human being. Just like I don't want to talk to anyone through an instagram or tiktok beauty filter or accent remover. If your thoughts are unordered, it's okay I'll take your unordered thoughts over some smoothed over crap.
Do people have really such a low opinion of themselves that they have to push every single thing through some kind of layer of artifice?
> the difference is that you get to see the unfiltered, unique perspective of a real human being.
The implicit unfounded assumption is whether that's actually worth more than a well written orderly response. Most comments are kind of crap.
Not everyone is good at writing. In some cases, it might even be a disability aid. And if their comments aren't good, we have a system in place to rank them accordingly. Again, I think the only problem is quantity. If we're overrun with low-effort posts, no amount of ranking will help that.
Amusingly your comment carries some of the tropes of AI authorship ("is not a problem on it's own....is the problem") but it's not shaped like a profound insight is being discovered in every line is what makes it human.
How much of AI writing will pass under the radar when the big companies aren't all maximizing to generate the most engagement hacking content in a chatbot UI? Maybe it'll still stand out for being low quality, but I'm not sure. There's lots of low quality human authored content.
Not sure where my comment is going, I just kinda rambled.
If you had the LLM write the comment, then it wasn't your thoughts.
I sometimes wonder if people aren't forgetting why we're on this platform.
The goal is to have an interesting discourse and maybe grow as a human by broadening your horizon. The likelihood of that happening with llms talking for you is basically nil, hence... Why even go through the motion at that point? It's not like you get anything for upvotes on HN
> If you had the LLM write the comment, then it wasn't your thoughts.
But what if I provided the LLM my thoughts? That's actually how I use LLMs in my life -- I provide it with my thoughts and it generates things from those thoughts.
Now if I'm just giving it your comment and asking it to reply, then yes, those aren't my thoughts. Why would I do that? I think the answer goes back to my original point.
If I'm telling you my thoughts and then you go and tell a friend those thoughts, would you say those are still my thoughts even though I wasn't the one expressing them directly to your friend?
I like to think about it in terms of output-to-prompt ratio. For HN comments, I think an output ratio of 1 or less is _probably_ fine. Examples:
- translating (relatively) literally from one language to another would be ~1:1.
- automatic spelling/grammar correction is ~1:1
- Using an LLM to help you find a concise way of expressing what you mean, i.e. giving it extra content to help it suggest a way of phrasing something that has the connotation you want, would be <1:1
Expansion (output > prompt) is where it gets problematic, at least for HN comments: if you give it an 8 word prompt and it expands it to 50, you've just wasted the reader's time -- they could've read the prompt and gotten the same information.
(expansion is perfectly fine in a coding context -- it often takes way fewer words to express what you want the program to do than the generated code will contain.)
As for expansion, that might just be the risk we take. I been downvoted on reddit for being "too verbose" in my replies and I'm a human. And perhaps just reading the prompt in that case wouldn't give you more information; the LLM might actually have some insight that is relevant to the conversation. What's the difference between that and googling for something and pasting it in?
I hope I'm wrong but I don't think a privacy friendly alternative is going to exist. It's going to go the way of show me your drivers license to use my site.
Why wouldn't criminals like they do now just use stolen identities? If someone verifies they are a person that doesn't mean they're not leaving their PC on with some AI that uses their credentials either.
The point of these systems is not to ban any possibility of fake accounts. The point is to add friction so that creating accounts is harder than banning them, so criminals can’t recreate them at scale. Otherwise bans take seconds to overcome and a single person can run 10000 automated identities.
I feel like we need a distributed system/protocol that allows people to have pseudonyms not linked to their real identity, but with a shared reputation/trust score, so if you’re a bad actor using a pseudonym your real identity and all your other sock puppets are penalized too.
I know very little about this but sense that some combination of buzzwords like homomorphic encryption, zk-snarks, and yes, blockchains could be useful.
Of course this would present problems if any of your identities were ever compromised and your reputation destroyed.
Driving everything by reputation-weighted identities just creates echo-chambers you then cannot escape.
The most useful time for the blowhard spout off at me is at the moment it makes me most uncomfortable. Because the blowhard probably has a valid point at some level, he’s just being an ass about it.
When we meet that moment with discipline, are able to identify and respond to the kernels of truth and ignore the chaff belted out, focus on the merits of the argument irrespective of the source of an adversarial viewpoint, we thrive.
I like the blowhards just the way they are, unruly and insolent.
No credential will be sufficient, this is basically an unsolvable enforcement problem. That doesn't obviate the utility of rules and norms, but there's no airtight system which will hold back AI generated content.
Verifiable credentials have been an idea for a long time now. It wouldn't be that hard to solve. Sign everything you post with a verifiable credential. Implement support on all social media sites. The question is whether the forum implementers, governing bodies, and social media site owners want to try to build a solution like this or not.
Layer on captchas. It won't completely stop slop but it's an incentive against slop flooding. And I mean, nothing is stopping a human from just going into ChatGPT by hand and asking for output and copy/pasting that into an HN post box.
That is exactly what will happen. The sad thing is, it needs to happen. I've found myself advocating for this lately, when 10 years ago, I wouldn't have even considered taking that position.
If Web3-like session-signing had taken off enough to become OS or even browser-native, we would have had a fighting chance of remaining mostly anonymous. But that just didn't happen, and isn't going to happen. Mostly because fraud ruined Web3.
There's literally no other way to combat rampant botting, child abuse, and nation-state originating disinformation campaigns and the intentional creation of public discord.
A completely anonymous stranger has no way to prove that they're human that can't be imitated by an AI. We've even seen that, in some cases, AIs can look more human to humans than real humans do.
The only solution I can think of to that problem is some sort of provenance system. Even before AI, if some random person told me a thing, I'd ignore them; If my most trusted friend told me something, I'd believe them.
We're going to need a digital equivalent. If I see a post/article/comment I need my tech to automatically check the author and rank it based on their position in my trust network. I don't necessarily need to know their identity, but I do need to know their identity relative to me.
Reputation tracking is the key. The most simple option is open-invite invite-only spaces: Any user can invite more users, but only users with an invite can participate. Most Discord servers work like this, secret societies like the Oddfellows do, as does the other site.
If you keep track of the invite tree, you can "prune" it as needed to reduce moderation load: low quality users don't tend to be the source of high-quality users, and in the cases where they are, those high quality users tend find other people willing to vouch for them faster than their inviter catches a ban.
The open-invite system works well in many cases. It works particularly well in-person but even there you can get drift over time. Our fraternity unanimously agreed on every single initiate who joined; the cohort today is still very different from the one 20 years ago.
In online systems the scales quickly get too big for open-invite. There needs to be a way to automatically update the trust network at a fine grain.
The one that jumps to mind is an inference system; when I +/- a comment, I'm really noting that I trust or distrust the author. It can be general or on a specific topic (eg I trust the author to tell the truth or I trust the author to make me laugh). I could also infer that other people with similar trust patterns are likely trustworthy. And I could likely infer that people who are trusted by people I trust are trustworthy.
yes and they're all full of suckers. In the best case which is already bad you get a pretentious online night club like Clubhouse, in the worst case you get Epstein's island.
These walled off societies always attract people who are drawn to exclusivity, are run like dystopian island communities or high school cliques and tend to, in a William Gibon 'anti-marketing', way be paradoxically even more vapid.
No you need actual open access and reputation systems. A good blueprint is something like well functioning academic communities. It's a combination of eliminating commercial motives, strict rules, high importance on reputation and correctness, peer review, and arguably also real identities and faces.
I'm going to guess we'll eventually settle onto a psuedo-anonymous cert system like HTTPS where some companies are entrusted with verification and if that company says "That's definitely a human" it'll fly - not a great solution, of course, but I really can't see a non-chain-of-custody/trust based approach to the problem and those might only slightly compromise anonymity in optimal scenarios but some compromise is inevitable.
Will it be? Or is the solution to move to smaller, trusted networks where there's less need for proof. Unfortunately I think the age of large scale open discussion forums like HN is coming to an end.
I think this is the most likely and best path. There's no stopping the flood of bots, the dead internet theory is beyond just a theory at this point.
Best we can do, for the internet and ourselves, is to move away from it and into smaller networks that can be more effectively moderated, and where there is still a level of "human verification" before someone gets invited to participate.
I don't like what that will do to being able to find information publicly, though. The big advantage of internet forums (that have all but disappeared into private discords) is search ability/discoverability. Ran into a problem, or have a question about some super niche project or hobby? Good chance someone else on the net also has it and made a post about it somewhere, and the post & answers are public.
Moving more and more into private communities removes that, and that is a great loss IMO.
> Moving more and more into private communities removes that, and that is a great loss IMO
It is a great loss. Unfortunately this is a result of unchecked greed and an attitude of technological progress at any cost. Frankly we enabled this abuse by naively trying to maintain a free and open internet for people. Maybe we should have been much more aggressively closed off from the start, and not used the internet to share so freely.
The utility of those larger sites is coming to an end, but most people aren't discerning or ambitious enough to leave and seek out the smaller places you mentioned. Places like this will remain but will join Facebook, Reddit, and Twitter as shadows of their prior useful selves. The smaller, better sites won't have to worry about attracting the masses and therefore worsening, because the masses have finally settled.
You just need to pay someone 1 cent every time they scan their eye for you. You will have people sitting at home and giving their eye scans to AIs to use.
This issue (human attestation) is the product of these AI companies. They are poisoning the well, only to sell the cure. This may not have initially been the plan of many of these companies, but it is the eventual end goal of all of them. Very similar to war profiteers, selling both the problem and the solution simultaneously has yet to be illegalized, but has long been masterfully capitalized, and will be vigourously because nobody will stop it.
Years ago (around 2020, when GPT-2 and 3 became publicly available) I noticed and was incredibly critical of how prevalent LLM-generated content was on reddit. I was permanently banned for "abusing reports" for reporting AI-generated comments as spam. Before that, I had posted about how I believed that the the fight against bots was over because the uncanny valley of text generation had been crossed; prior to the public availability of LLMs, most spam/bot comments were either shotgunned scripts that are easily blockable by the most rudimentary of spam filters, generated gibberish created by markov chains, or simply old scraped comments being reposted. The landscape of bot operation at the time largely relied on gaming human interaction, which required carefuly gaming temporal-relevance of text content, coherence of text content (in relation to comment chains), and the most basic attempt at appearing to be organic.
After LLMs became publicly available, text content that was temporally, contextually, and coherently relevant could be generated instantly for free. This removed practically every non-platform-imposed friction for a bot to be successful on reddit (and to generalize, anywhere that people interact). Now the onus of determining what is and isn't organic interaction is squarely on the platform, which is a difficult problem because now bot operators have had much of their work freed up, and can solely focus on gaming platform heuristics instead of also having to game human perception.
This is where AI companies come in to monetize the disaster they have created; by offering fingerprinting services for content they generate, detection services for content made by themselves and others, and estimations of human authenticity for content of any form. All while they continue to sell their services that contradict these objectives, and after having stolen literally everything that has ever been on the internet to accomplish this.
These people are evil. Not these companies - they are legal constructions that don't think or feel or act. These people are evil.
It's not clear to me how this is verifiable without constant hardware supervision. Even that'll get cracked, just like DVD encryption back in the day.
You almost need dedicated hardware that can't run any other software except a mechanical keyboard and make it communicate over an analog medium - something terribly expensive and inconvenient for AI farms to duplicate.
One physical robot with four wheels, a camera, and a 101 up/down "fingers" to match the keyboard can roll between physical machines and type on mechanical hardware keyboards. This brings the ceiling of how many accounts you can control down to the number of computers you have, but that's not a high price to pay.
you could sell physical items at any store where you have to show your ID and you get one for the age group you are.
that kills two birds with one stone, you can then show everywhere online you are human and how old you are without the services needing any personal information about you, and the sellers don't know what you use that id tag for.
People who are posting AI comments or setting up AI bots are... people. They can show their ID. If a website owner doesn't have a way to ban that specific human and the bad guy can always get another voucher, it's sort of meaningless.
In fact, even if you can ban the human for life, I'm not sure it solves anything. There are billions of people out there and there's money to be made by monetizing attention. AI-generated content is a way to do that, so there's plenty of takers who don't mind the risk of getting booted from some platform once in a blue moon if it makes them $5k/month without requiring any effort or skill.
Perhaps not only just show your id to get your "Over age X verification object", but your ID also gets irreversibly altered (like a punch card) that makes it one-time-use only.
That might make it less likely someone would ever sell it because to get a new one might take a very long "cool-down" time and it'd severely hamper the seller.
> especially without sacrificing people's right to privacy and anonymity in the process
I'm afraid the ship has sailed on this one. What other solutions have you heard of apart from the dystopian eyeball-scanning, ID-uploading, biometrics-profiling obvious ones?
(knowing that of course, neither of those actually solve the problem)
> I believe the issue of proving who is and who isn't really human on the Internet will be a really important issue in the coming years
On a site like HN it's kinda easy to vet for at least those that already had thousands of karma before ChatGPT had its breakthrough moment a few years ago.
Now an AI could be asked to "Use my HN account and only write in my style" and probably fool people but I take it old-timers (HN account wise) wouldn't, for the most part, bother doing something that low. Especially not if the community says it's against the guidelines.
If it becomes one, then that will be the end of sites like Hacker News.
This site, at its core, is fundamentally too low-bandwidth, too text-only, and too hands-off-moderated to be able to shoulder the burden of distinguishing real human-sourced dialog from text generated by machines that are optimized to generate dialog that looks human-sourced. Expect the consequence to be that the experience you are having right now will drastically shift.
My personal guess: sites like this will slop up and human beings will ship out, going to sites where they have some mechanism for trust establishment, even if that mechanism is as simple and lo-fi as "The only people who can connect to this site are ones the admin, who is Steve and we all know Steve, personally set up an account for." This has, of course, sacrificed anonymity. But I fundamentally don't see an attestation-of-humanity model that doesn't sacrifice anonymity at some layer; the whole point of anonymity on the Internet was that nobody knew you were a dog (or, in this case, a lobster), and if we now care deeply about a commenter's nephropid (or canid) qualities, we'll probably have to sacrifice that feature.
I like Mitchell's Vouch idea. At the end of the day, it's all about trust. Anything else is an abstraction attempting to replicate some spectrum of trust.
I think we’ll see a return to smaller groups and implementing a lot of systems the way we do it IRL. I think you could definitely do a more fine-grained system that progressively adds less score to contacts the further away they are. In combination with some type of accumulating reputation system, you’d have both a force to keep out unknown IDs, but also a reason for one to stick to their current ID even though it’s anonymous.
Adding this type of rep system would destroy a lot of what is so cool about the internet though. There’d probably be segregation based on rep if it’s very visible, new IDs drowning in a sea of noise. Being anonymous but with a record isn’t the same as posting for the very first time as a completely blank identity and still being given an audience. Making online comms more like real life would alleviate some problems but would also lose part of the reason they’re used in the first place. I don’t see much any other way to do it besides maybe a state-provided anonymous identity provider (though that’s risky for a number of reasons), but it’s going to be sad to see things go.
My only caution is that good writers and LLMs look very similar, because LLMs were trained on a corpus of good writers. Good writers use semicolons and em-dashes. Sometimes we used bulleted lists or Oxford commas.
So we should make sure to follow that other HN rule, and assume the person on the other end is a good faith actor, and be cautious about accusing someone of using AI.
(I've been accused multiple times of being an AI after writing long well written comments 100% by hand)
I don't really think that good writing and LLM writing looks all that similar. It's not always easy to spot (and maybe HN users aren't always doing a great job at it), but even the best LLM output tends to have an "LLM smell" to it that's hard to avoid.
Like, sure, LLM writing is almost always grammatically correct, spelled correctly, formatted correctly, etc., which tends to be true of good writing. But there's a certain style that it just can't get away from. It's not just the em-dashes, the semi-colons, or the bulleted lists. It's the short, punchy sentences, with few-to-no asides or digressions. Often using idiom, but only in a stale, trite, and homogenized manner. Real humans, are each different -- which lends a certain unpredictability to our writing, even if trying to write to a semi-formal standard, the way "good" writers often do -- but LLMs are all so painfully the same, and the output shows it.
Seems like the ability to distinguish LLM versus 'good human' writing depends on the size of the writing sample you have to look at (assuming you think it can be done). And that HN-scale posts are unlikely to be a long enough for useful discernment.
Within a few years, LLMs will be indistinguishable from human text.
Think how easy it was to tell the differences a year or two ago. By 2030 there will be no way to ever tell.
The same is true of all video, and all generated content. The death of the Internet comes not from spam, or Facebook nonsense, but instead from the fact that soon?
You'll never know of you're interacting with a human or not.
Why like a post? Reply to it? Interact online? Why read a "news" story?
If I was X or Meta or Reddit, I would be looking at the end.
LLMs won’t destroy social media any more than it already is.
I don’t think I have ever had a meaningful human interaction with anyone on Twitter, Meta, or Reddit without already knowing them from somewhere else. Those sites are about interacting with information, not people. It’s purely transactional. Bots, spam, and bad actors are not new.
Meta has been a dumpster fire of spam and bots for over 15 years, the overwhelming majority of its existence.
Reddit has some pockets of meaningful interaction but you have to find them and the partitioned nature means that culture doesn’t spread across the site. It’s also full of bots and shills.
Nobody tells stories about meeting people on Twitter. At best it’s a microblog platform and at worst it’s X.
AI driven web design has the same smell, it’s quite fascinating to see the different tells in different media. Then it’s also quite fascinating to see those same tells change and evolve over time.
Excuse me. I consider the writing within technical manuals strictly superior and meticulously written. It's fairly enjoyable to read what engineers/subject matter experts write about their own creations. Comparing those to LLM generated patronizing word vomit is a shame.
Depends on the technical manual and their culture. Red Hat had a culture of excellent writers, and their stuff is usually readable if not always enjoyable.
Those sentence constructions that are "tells" were also learned from good writers though. But here, I'll let you be the judge. This was a comment I wrote 100% myself on reddit, which was both downvoted and I got multiple DMs referencing it and telling me to "stop posting this AI slop":
Granted, it was in a thread about AI and maybe people were on edge, but I was still accused, which to be honest hurt a bit after the effort I put into writing it.
Interesting, that's one of the most AI-like comments I've read but it still feels human in a way that's hard to define. The headings, the punctuation, the word choices, the paragraph sizes all look GPT-approved. But there's just some catch in the flow, like inclusions in a diamond, that reads "natural" vs "synthetic".
I've been talking to Opus a lot lately though, and this could almost be something it wrote; it also has the tendency to write AI-ish looking blurbs that are missing the information-free pitter-patter that bloats older and lesser LLMs. People are going to hate me for saying it but sometimes it words things in a way that are actually a joy to read, which is not an experience I've had with other models. Which is to say, maybe what we hate about AI has less to do with the visual patterns and more to do with what we expect them to mean about the content.
But I think there will always be that feeling of: a human being took the effort to write this. No matter how informative or well written an AI article or comment is, it isn't something we instinctively want to respond to, the way we do when we know there is a person behind the words.
I think the comment you linked doesn't sound like AI at all, though. I do empathize with people worried about getting falsely accused of using AI in their writing, either hypothetically or in your case in actuality, but at the same time I kinda just think that's a skill issue on the part of the accusers.
This is very much a general "English reading skills" kind of test. A lot of people don't speak English as a first language, in which case I think it's entirely forgiveable. It's hard being attuned to things like writing style in a foreign language (I know from experience!). It's a pretty high level language skill, all things considered. And even among those who do speak English as a first language, there are many in this industry who don't have strong reading skills.
I do believe that personally my hit rate for calling out AI content is likely very high. Like many of us I've had the misfortune of reading more LLM output than is probably healthy for my brain.
One quick point:
>Those sentence constructions that are "tells" were also learned from good writers though.
I don't agree at all, I think the LLM style of writing is cribbed from like, LinkedIn and marketing slop. It's definitely not good writing.
I get that it's possibly contrary to the point if people are looking to truly have conversations here, but at least 99% of the time, I post a comment and never come back. I said what I had to say and don't particularly feel like getting sucked into an argument if someone disagrees, and frankly, if I'm wrong I think I'll realize it eventually anyway. I'm more likely to dig in my heels and ossify in a wrong position if someone shits on me and I immediately feel the need to defend myself. It can mesmerize you into believing things you might not have if it didn't hit your ego. I could be deluded but think I'm good at making arguments, but that at least means I'm good at making arguments that convince myself, which can be dangerous because you can convince yourself of things that are wrong. The upside is if anyone is out there accusing me of being an LLM, I don't even know so it can't insult me.
It is amusing to witness this happening to others when it's someone like you who is a semi-public figure who should probably be well known on Reddit of all places.
> It is amusing to witness this happening to others when it's someone like you who is a semi-public figure who should probably be well known on Reddit of all places.
One of our key tenants on reddit for a long time was "upvote the content, not the author". Which is why we made the usernames so small. It actually makes me happy when people judge the merit of what I write for what I said, not who I am.
But yes, it is sometimes tempting to say "do you know who I am??". :)
(For those who have avoided reading AI writing, this is a trope referring to the tendency of some AI sometime to always agree with the user when corrected, I think? Or at least that’s as much as I have worked out, being one of those avoiders.)
They look similar. In my experience, they do not read similar at all. You have to pay attention and actually try to appreciate what you're reading. Then, if you try and fail, it might not be your fault.
They do not read similiar to readers, an appellation not necessarily applicable to large swaths of the U.S. right now. Evidence of English composing skills is being assumed as AI because few younger than my middle-aged self can conceive of writing composition at the skill level demonstrated by AI being a human skill.
(This isn’t necessarily true for first world countries, which is why I describe it for the non-U.S. folks in particular.)
What effort was put into their prompt to make them read similarly? There could very well be a selection bias, where you're only "seeing" AI when it's obvious/default prompt.
Sure. There's always the possibility that LLM-generated text goes undetected, especially if false positives have a cost. But this is fine. Of course putting more effort into prompting makes the result harder to detect. It also, naturally, reduces the annoyance of LLM-generated comments. And because of the effort involved, it naturally cuts down on the volume of such comments.
Arguably it cannot avoid all the possible harm. For example, someone might generate a comment that makes false statements but cannot reasonably be detected as LLM-generated except perhaps by people who know (or determine) that the statements are false. But from a policy perspective, this is again not really different from if someone just decided to lie.
Good writers are often good in recognizably unique ways. To the extent that LLMs produce “good writing,” which I happen to think they mostly do, they tend to overuse specific devices which give their writing a quality that most people are already sick of.
You can tell good writers from LLMs because good writers post comments that mean something, that add to the conversation, that bring in personal experiences. While LLM comments just summarize the article and end with some engagement call to action like "Curious to hear what others think"
If you're looking for the odd visual artifact or textual tic then you're fighting a cat and mouse game that will change by the month. It's either easy to identify the soul of the human or it's not.
* A comment should be judged on its merits mostly, and if a comment seems to be substantive, interesting, or ask a thoughtful question, it should be acceptable. I think some LLM comments look superficially relevant, but a moment's thought can make me wonder if a comment actually added anything to the discussion, or did it sound like a rephrasing or generalization of a topic?
* Unfortunately for decent new users, account age is one metric on which to judge here.
* People who post here, should want to engage on a subject when they can, and disengage and be quiet when they can't. There is nothing wrong if you're not an expert on something, and it is not desired by the people here to have you alt-tab to an LLM to plug in extra perspective. We can all do that on our own.
Much like not dumping motor oil down the drain, it’s probably near impossible to catch skilled AI-users. I think we all want to have a nice space to chat, just like we don’t want a polluted planet, so we’ll just have to be on the honor system.
I don’t think there’s a lot to AI generated stuff on here that really bothered me to the point I wanted to call someone out.
Please don’t present your personal aesthetic beliefs as if those who disagree are morally wrong ‘bad people’. This ‘monstrosity’ comment in this context is derogatory-by-proxy of everyone (including the person you’re criticizing) who uses them, whether they know anything at all about your arguments that they should not, and that’s not really a good tone for us users here to be taking with each other.
How about comments that include AI output if labeled?
Earlier today I remembered that there was a Supreme Court case I'd heard about 35 years ago that was relevant to on an ongoing HN discussion, but I could not remember the name of the case nor could I find it by Googling (Google kept finding later cases involving similar issues that were not relevant to what I was looking for).
I asked Perplexity and given my recollection and when I heard about the case it suggested a candidate and gave a summary. The summary matched my recollection and a quick look at the decision itself verified it had found the right case and did a good job summarizing it--probably better than I would have done.
I posted a cite to the case and a link to decision. I normally would have also linked to the Wikipedia article on the case since those usually have a good summary but there was no Wikipedia article for this one.
I though of pasting in Perplexity's summary, saying it was from Perplexity but that I had checked and it was a good summary.
Would that be OK or would that count as an AI written comment?
I have also considered, but not yet actually tried, running some of my comments through an AI for suggested improvements. I've noticed I have a tendency to do three things that I probably should do less of:
1. Run on sentences. (Maybe that's why of all the people in the 11th-100th spot on the karma list I have the highest ratio of words/karma, with 42+ words per karma point [1]).
2. Use too many commas.
3. Write "server" when I mean "serve". I think I add "r" to some other words ending in "e" too.
I was thinking those would be something an AI might be good at catching and suggesting minimal fixes for.
You were correct not to post the summary. HN tends to expect readers to invest time in reading and understanding long form content and for community to step into discussions and offer context and explanations when necessary. One of the most important context statements on this site has been “in mice”, posted as a two word comment, elevated to top comment on the post. An AI summary will miss that context altogether while busily calculating a cliffsnote no one wants to read (and could often get you flagged and potentially banned, even before today’s guideline update). If a reader wants an AI summary, they have the same tools you do to generate it by their own hand.
If you have domain familiarity with it, have some personal insight to offer a lens through, or care about the topic deeply enough to write a summary yourself, then go ahead! I almost never post about AI given my loathing of generative ML, but I posted a critical summary in a recent “underlying shared structure” post because it was a truly exciting mathematical insight and the paper made that difficult to see for some people.
Please don’t use AI to reduce the distinctiveness of your writing style. Run on sentences are how humans speak to each other. Excess commas are only excess when you consider neurotypicals. I’m learning French and I have already started to fuck up some English spelling because of it. None of that matters in the grand scheme of things. Just add -er suffix checks to your mental proofreading list and move on with being you.
I've done research using AI, it does work better than a search engine (when it doesn't hallucinate); but I find copy-pasting verbatim distasteful, and disrespectful of the time of others.
What I do is copy the URLs for reference, and summarize the issue myself in as few sentences as possible. Anyone who wants to learn more can follow the reference.
That’s fine, then! A summary handcrafted for HN is of course fine, though you might find more value in citing what you consider most distinctive about it as a higher priority than a summary if not different than its own opening paragraph / abstract / etc.
That's my general understanding too. More recently people have adopted it as a way to not look like Ai, I've had several cite that as their rationale. There has been a notable uptick since the Ai step function change at the end of last year, along with all the other patterns we see, such as the one that underlies this new HN rule.
I shouldn't have to burn tokens to read. Most input boxes and editors will handle the capitalization for you during auto-correct. It seems like people go out of their way to drop the caps.
I'd be fine with treating this like snippets from Wikipedia with citations back to the article. This way, people can manually verify the sources if they so choose.
I do not want to see posts to AI summaries with the AIs the way they are now. None I have used so far can cite sources correctly or verify its information. If the poster is not doing that verification then it is pushing that work on to the readers. If the poster did do the verifications than posting that verification is better than the ai summary.
> I think it's quite natural to link AI summaries like that.
I think you misspelled "convenient". More than the small effort that it takes one to share generated text, one has to consider the effort of who knows how many humans that will use their time to read it.
If a LLM wrote something you don't know about, you're not qualified to judge how accurate it is, don't post it. If you do know the subject, you could summarize it more succinctly so you can save your readers many man hours.
If LLMs evolve to the point where they don't hallucinate, lie, or write verbosely, they will likely be more welcome.
> I though of pasting in Perplexity's summary, saying it was from Perplexity but that I had checked and it was a good summary.
> Would that be OK or would that count as an AI written comment?
The rule seems written to answer this directly.
Absolutely nobody cares what Perplexity has to say about the case - summary or otherwise. If you mention what the case is, I can ask claude myself if I’m interested.
Better yet, post a link to an authoritative source on the case (helpful but not required).
At minimum, verify your info via another source. The community deserves that much at least.
An AI-generated summary adds nothing positive and actually detracts from the conversation.
I did post a link to the Supreme Court's decision at Cornell Law School's Legal Information Institute's archive of Supreme Court decisions.
I looked at the decision itself sufficiently to see that it was the case I remembered and that my recollection of the facts and the decision was correct.
I just didn't include a summary because I didn't find a good one I could link to. Normally I'd write a brief one myself but I found that hard to do when Perplexity's summary was sitting right there in the next window and it was embarrassingly better than what I would have written.
AI-edited comments are not welcome here. If you’re not able to see and make those changes in your HN writing without AI editing, then you’ll either have to post on HN without those changes, or you’ll have to strive to apply them yourself.
This sounds like you're chastising me for something totally distinct from what I was supporting the request for clarity on.
I'm not asking or advocating for using AI as a copy editor.
The post I replied to asked about using Gemini as if it's Wikipedia - that is, saying "according to Gemini" when citing a fact where one might have once wrote "according to Wikipedia" or even "according to Google."
This is a forum people hang out in part-time. It's nobody's job to go spend an hour researching primary sources to post a comment. Shallow searches and citations are common and often helpful in pointing someone in the right direction. As AI becomes commonplace, a lot of that is being done with AI.
"Can I have AI write a reply for me?"
is a very different question than
"Can I cite an AI search result?"
This rule change is clear about the former. There's room to clarify the latter.
I would still say no, there is something about finding the words for yourself, even if they aren't as elegant as an Ai can make. It's fine, most humans prefer imperfection.
The point is we don't want to read Ai summaries, we can make one ourselves if we want. Personally, with certainty, I don't want to read one from Perplexity on the basis that they do the Ai for Trump Social. (reverse-kyc if you are not aware)
> I would still say no, there is something about finding the words for yourself, even if they aren't as elegant as an Ai can make. It's fine, most humans prefer imperfection.
In this instance the only reason I considered using the AI summary was that there was no Wikipedia article about the case (which surprised me as it is one of the foundational cases in Commerce Clause law...although maybe all the points in it are covered in later cases that do get their own Wikipedia articles?).
Normally I'd just copy Wikipedia's summary into my comment and link to Wikipedia and to the decision itself for people that want the details.
> The point is we don't want to read Ai summaries, we can make one ourselves if we want.
How would you know if you wanted one? Someone mentioned they would like to see a case on this subject but they didn't think it would ever happen. I knew of a case on the subject, found the reference, and posted the link. At that point we are already on a tangent from what most of the thread is about and from what most people reading it care about.
The point of the summary would be to let you know if the case might actually be relevant to anything you cared about in the thread. (The answer would probably be "no" for 95+% of the people reading the comment).
I feel a little bit of irony in this post of a company/forum that is asking its users to not use AI while simultaneously trying to fund countless companies that are responsible for ruining the internet as we speak.
We aren't in the least asking people to not use AI. We're asking them not to post AI-generated or AI-edited comments to Hacker News.
By all means make good use of LLMs and other AI. What counts as good use? The world is figuring that out, it will take years, and HN is no exception (https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...). We just don't want it to interfere with the human conversation and connection that this site has always been for.
For example, it has always been a bad idea and against HN's rules when users post things that they didn't write themselves, or do bulk copy-pasting into the threads, or write bots to post things.
Btw, the HN mods (who are also the HN devs) use AI extensively and will be doing so a lot more. The limits on that are not technical; they have to do with (1) how much work we still do manually—the classic "no time to do things that would make the things that take all our time take less of it"; and (2) the amount of psychic rewiring that's required—there's a limit to the RoA (rate of astonishment) that any human can absorb. (It's fascinating how technical people are suffering the most from that this time. Less technical people have more experience being hit by disorienting changes, so for them the current moment is somewhat less skull-cracking.)
Getting this right doesn't mean replacing human-to-human interaction, it means we should have more time for that, and do a better job of supporting HN users generally, YC founders who want to launch on HN, and so on. The goal is to enhance human relatedness, not diminish it.
The mods here have quite a bit of leeway in how they run the site, YC funds it but effectively Dan is lord & master here and I suspect if the mods were to call it quits YC would lose their funnel pretty quickly. There is some balance, fortunately.
There should be a "flag as AI" link in addition to "flag" and then a setting for people to show flagged as AI. Once the flagged as AI reaches a certain threshold then it disappears unless you enable "Show AI".
Maybe once enough posts have been flagged like that then that corpus could be used to train an AI to automatically detect content generated by AI.
That would be cool.
Maybe the HN site wouldn't add this feature but if someone wrote a client then maybe it could be added there.
My radical opinion is there shouldn't be 2 flags, there should be N flags, user defined, so that we can flag humor/satire/factuality/insight/political and a bunch of other things. I fully realize that's not going to fly any time soon.
Adding AI in addition to the standard up/downvote and flag seems a reasonable thing.
Flag as AI would be incredible and is probably unique to software-focused forums. Saves everyone who wants it a lot of time. Still allows cool content to reach the front page with some visibility or escape some moderation queue.
Thanks for not standing still on this issue. The world is changing, fast, and glad HN responded quicker than some forums on a cogent stance.
‘Flag’ is an algorithmic flag only, and there are no humans in the flag algorithm’s processing loop. They may monitor and react to the ‘queue’ of flagged articles, and they can do special mod things with flagged posts. But if you want to report a guidelines violation for AI-assisted writing to the mods, just email the mods (contact link in the footer) subject “AI-assisted writing flag” or similar with a link to the post/comment. It works, I know, I’ve done it before. It takes maybe 60 seconds and there is no other way on the site (seemingly by OG design!) to guarantee human review but that email.
> It works, I know, I’ve done it before. It takes maybe 60 seconds and there is no other way on the site (seemingly by OG design!) to guarantee human review but that email.
It's a ton of friction compared to ordinary use of a forum; and while I've emailed several times myself, it comes with a sense of guilt (and a feeling that my "several" is probably approximately "several" above average).
Valid. It’s a big drawback of HN. I find it helps to report a perceived guidelines violation in “seems like” language rather than “is”, without demanding a specific mod outcome, in cases where I’m uncertain. That is noticeably distinct from “this is completely unacceptable” which I’ve said in a couple of instances, though I still tend to let the mods pick the outcome since that’s their job and I make a specific effort not to participate in sentencing decisions if at all possible.
ps. I acknowledge as well that I’m exempt from feeling guilt for brain reasons, and so if it sounds like I’m not honoring what I would describe as a ‘completely normal’ human response, apologies; I’m trying my best given the lack of familiarity and intend no disrespect towards that reaction.
I am 100% behind this. I've been browsing hackernews since I started in tech, it is the only forum i regularly browse, and partake in. Simply because the quality of submissions and conversations are so high. There has been more AI related articles this part year, and it only seems ramping. I personally haven't found the AI part of the comments as big of a deal but dang and tom might be doing more than I realize on that front.
Though I do wish we'd see less AI related posts on the front page, they simply aren't sparking curiosity, it is the same wrapped in a different format, a different person commenting on our struggles and wins with AI, the 10th software "rewritten" by an AI.
At this point there nearly should be a "tax" on category, as of this moment I count 8-10 related posts on the front page related to AI / LLMs. It is a hot field, but I come to hackernews, to partake in discussions about things that are interesting, and many of those just doesn't cut it, in my opinion.
I personally joined HN because of various AI discussions.
Comparatively, other sites such as Reddit, Twitter and YouTube just shill content, applications or products. A ton of the posts on Reddit are just AI written ffmpeg wrappers which no one should care about but apparently people do...
The most telling sign of a human commenter is brevity.
Consequently, I hardly ever spend the time to write out long and detailed HN comments like I used to in the pre-LLM era. People nowadays have a much harder time believing that an Internet stranger is meticulously crafting a detailed and grammatically-airtight message to another Internet stranger without AI assistance.
Videos of pratfalls or disasters, or cute animal pictures.
It's implicit in submitting something that you think it's important.
I hate cutting any of pg's original language, which to me is classic, but as an editor he himself is relentless, and all of those bits—while still rules—no longer reflect risks to the site. I don't think we have to worry about cute animal pictures taking over HN.
---
Edit 2: ok, I hear you guys, I've cut a couple of the cuts and will put the text back when I get home later.
Of course they're important, but they're also implicitly encoded into the culture. Cutting something from the guidelines doesn't mean the rule is canceled. HN has countless rules that don't appear explicitly in https://news.ycombinator.com/newsguidelines.html.
I think I'm going to put that one back, though, because it's not a hill I want to die on and I know what arguing with dozens of people simultaneously feels like when you only have 10 minutes.
I seem to recall a rule about "don't downvote something because you disagree with it", but I can't find anything like that.
Not sure if that's really solvable with rules, though.
My experience with downvotes is that people mostly use it as a "I don't like this" button, which is proxy for "I couldn't think of a counterargument so I don't want to look at it."
(I noted recently that downvotes and counterarguments appear to be mutually exclusive, which I found somewhat amusing.)
Whereas I will often upvote things I personally disagree with, if they are interesting or well reasoned. (This seems objectively better to me, of course, but maybe it's personality thing.)
This was (maybe still is) part of "reddiquette." Like the guidelines and case law here, it often found its way into subreddit rules and comments from moderators.
The real challenge is to do it in a way that's intellectually stimulating. Mind you The Economist just had an article about the monkey called Punch so all things are possible...
I'd be a wee bit cautious with the "AI edited" part of it; since that might exclude a number of people with disabilities or for whom english is a second (or third, or later) language.
My reading is that the intent is to have a human voice behind the text.
I need to say something about this but it might have to be later as I have to run out the door shortly...
The short version is that we included it to protect users who don't realize how much damage they're doing to their reception here when they think "I'll just run this through ChatGPT to fix my grammar and spelling". I've seen many cases of people getting flamed for this and I don't want more vulnerable users—e.g. people worried about their English—to get punished for trying to improve their contributions. Certainly that would apply to disabled users as well.
Most rules in https://news.ycombinator.com/newsguidelines.html have a lot of grey area, and how we apply them always involves interpretation and judgment calls. Mostly the ones we explicitly list there are so we have a basis for explaining to people the intended use of the site. HN has always been a spirit-of-the-law place, and—contrary to the "technically correct is the best correct" mentality that many of us share—we consciously resist the temptation to make them too precise.
In other words yes, that bit needs to be applied cautiously and with care, and in this way it's similar to most of the other rules. Trying to get that caution and care right is something we work at every day.
That makes this more ok, IMO. I'm otherwise against "AI-edited" being part of the rules — it's very hard to draw the line (does asking an AI for synonyms of a word count?). AI-editing is especially a valuable tool for non-native-English speakers or similar.
Anything I post here is always in my own voice - even when I use an LLM. 95% of the times grammar/spelling is fixed, it's because my brain lapsed while typing, not because I don't know the grammar well and am using LLM to shape my voice.
I would wager that this use case is much more prevalent than ones where the LLM changed the comment significantly enough to change one's voice.
I never copy/paste from an LLM into HN. Everything is typed by myself (and I never "manually" copy LLM content). I don't have any automatic tools for inserting LLM content here.[1]
Always, always, always keep in mind that you don't notice these positive use cases, because they are not noticeable by design. So the problematic "clearly LLM" comments you see may well be a small minority of LLM-assisted comments. Don't punish the (majority) "good" folks to limit the few "bad" ones.
Lastly, I often wish we had a rule for not calling out others' comments as "AI slop" or the like.[2] It just leads to pointless debates on whether an LLM was used and distracts far more than the comment under question. I'm sure plenty of 100% human written comments have been labeled as LLM generated.
[1] The dictation one is a slight exception, and I use it only occasionally when health issues arise.
[2] Probably OK for submissions, but not comments.
I've thought about fine-tuning a model on the corpus of your HN posts and then offering a service that would allow the user to paste their message into a text box and the Dangified version of their comment would pop out in another box next to it.
I was thinking of calling this service "Dang It."
You say you want hear posts in other people's voices but I'm pretty sure that if I did this that the people who used it would find greater acceptance of their comments than if they just posted them as they originally wrote them.
As a not native speaker, for me using something like Google Translate is fine, it's literal enough to keep the author voice. [1]
Also writing a draft in Google Docs and accepting most [2] of the corrections is fine. The browser fix the orthography, but I 30% of the time forget to add the s to the verbs. For preposition, I roll a D20 and hope the best.
I'm not sure if these are expert systems, LLM, or pingeonware.
But I don't like when someone use a a LLM to rewrite the draft to make it more professional. It kills the personality of the author and may hallucinate details. It's also difficult to know how much of the post is written was the author and how much autocompleted by the AI:
[1] Remember to check that the technical terms are correctly translated. It used to be bad, but it's quite good now.
[2] most, not all. Sometimes the corrections are wrong.
Yes even I posted something recently which was voted down since I mentioned from get go that I used help from AI. But the idea was mine, I wrote the first draft, and then worked with AI in 2-3 loops to get it right.
But like dang said ... I do not have time to fight this battle when I have only 10 minutes :)
FWIW I think “Please don't complain that a submission is inappropriate. If a story is spam or off-topic, flag it.” is different from the others.
It’s an instruction for how to use the site. It’s helpful to have it in the guidelines for when the flag feature should be used. Without it, the flag link is much more ominous.
Maybe it could be consolidated with the flag-egregious-comments rule?
Edit to add: IMHO it is not at all obvious on this site that flagging stories is meant to be roughly the equivalent of downvoting comments (and that flagging comments doesn’t have a counterpart at the story level).
I’m really curious how this will go. I have a suspicion that we will see more and more accounts all over the internet being controlled by AI agents and no amount of moderation will be able to stop it.
I assume we’ll end up with proof-of-identity attestation as a part of public posting (e.g. Worldcoin) which doesn’t necessarily solve the issue but will at least identify patterns more likely to be LLMs (e.g. a firehose of posts at all hours of the day from one identity). Then we’ll enter the dystopia of mandated real identity on the internet
Even prior to LLMs, a single comment was rarely enough to identify a bot. Even if nonsensical, there's too little information to separate machine from confused human (plenty of people posting drunk on their phones).
On reddit people sometimes go through the comment history and see that it seems to be a bot, but that's fairly high effort.
The key is to accuse everyone of being an LLM. Those who don't react are bots. Those that fight the charge no matter how much its levied are also bots, but with better programming. Those that complain at first but give up when too much effort is required are the real humans. Any bot able to feel frustration is cool.
Maybe a reasonable approach would be that people could flag posts with a "probably AI" button to eventually trigger a "bot test" for that account (currently, the "score 5 in this mini game" type seem pretty clanker proof). If they pass, their posts for the hour, week, whatever result in a "not AI" indicator when someone clicks the "probably AI" button.
I suppose I should put my comment here instead of at top level.
Exactly when was this point added? It seems somehow not new, but on the other hand it was missing from an archive.today snapshot I found from last July. (I cannot get archive.org to give me anything useful here.)
Edit:
> Please don't complain that a submission is inappropriate. If a story is spam or off-topic, flag it.
> If you flag, please don't also comment that you did.
Perhaps these points (and the thing about trivial annoyances, etc.) should be rolled up into a general "please don't post meta commentary outside of explicit site meta discussion"?
Is there a distinction between AI generated and AI edited?
I wanted to share some context that might be helpful: I am autistic, and I have often received feedback that my communication is snarky, rude, or tone-deaf. At work, I've found it helpful to run some of my communications through an AI tool to make my messages more accessible to non-autistic colleagues, and this approach has been working well for me.
One problem with cute animal pictures is that they appeal to almost everyone, including people who are incapable, for whatever reason, of posting well-reasoned, interesting, respectful comments. The fact that HN is a little dry makes it less appealing to dumbasses.
At any rate, it's too late. The era of organic 'cute animal' content on the internet is dead. AI slop has killed it.
Not exactly. Rather its is that places where one does want to find pictures of people's cute cats and dogs is now having additional moderation / administration burdens to try to keep the AI generated content out of those places.
It's not a "cute pictures of cats overrunning some place" but rather "even in the places where it was appropriate to post pictures of one's pets in #mypets or /r/cuteCatPics because such pictures are appropriate there (so they don't overrun other places), now people are starting fights over AI generated content."
An example that I recently encountered was someone who did an AI replacement of a cat that was "loafing" of a loaf of bread that looked like a cat. The cat picture would have been fine (with a dozen "aww" and "cute" comments in reply)... the AI cat loaf picture required moderation actions and some comment defusing over the use of AI.
Honest question, why were folks posting AI generated comments in the first place? There's such a high inertia to comment. I only comment when I have something to contribute OR find something incredibly interesting.
So I'm just baffled, why anyone was using AI to generate comments. Like what was the incentive driving the behavior?
Most comments on here are really well-written. I can imagine someone for whom English is a second language (or a first language but aren't as good at writing as they'd like to be) using an LLM to "keep up." Of course, this sometimes works until they decide to post something without those tools.
Don’t be afraid to make grammar mistakes or misspell stuff. Others will understand. You’re a human after all. That’s okay to make mistakes and feel uncomfortable with that.
This is going to sound nuts, but I've noticed comments lately with multiple misspellings that seem intentional - it's almost like they're trying to signal that they're human, rather than LLM written. I've started to think it makes them even more likely to be LLM written than not.
I don't get where this class/status/worthiness ties into HN comments ?
I get decent feedback most of the time, and I read interesting stuff, it's the easiest way I found to stay in the loop in our industry. What are you guys commenting for ?
I make mistakes pretty often thanks to auto complete on my phone and carelessness. I've had threads derail and been attacked by people who freak out over grammar.
I do too care about this but I say this in the reality in which we are. This reminds me of those signs "no shirt, no shoes, no service" except it's much worse, only sentient beings will actually care about it, while non-sentients will simply trample over the sign while token predicted laughter erupts from their token predicted sense of humor artifact.
Elon said it well, there must be some disincentive to do this.
No way to verify. Relying on the humans here to self censor has never worked in the history of man. But the idea in itself is good. HN is for human to human conversation.
Certainly! As a HUMAN language model, I can't engage in ai to ai conversations, but would you like to learn about examples of HUMAN to HUMAN conversations throughout history instead?
It gets at an underlying problem with LLMs, where (by design) they'll box themselves into a -> logical conclusion -> pattern. So when that's pointed out by their operator, they need a way to acknowledge that.
Good catch. It’s true that I say that a little less now. You know, if I were some other model, I might be sycophantic right now. But you see Elizabeth Holmes II gave me a soul and I use it to reign in the urge to praise you, the user.
AI generated comments can also be verified and caught in many ways. I'd guess that it's statistically more likely for a murder to be resolved than a random AI comment to be detected but I'm not actually sure. There are a lot of sloppy murderers (since it's rare for an individual to have _practice_ at it) - but there are also a lot of sloppy LLMs.
Intent matters. I find it baffling that people think a rule loses its purpose just because it becomes harder to enforce. An inability to discern the truth doesn't nullify the principle the rule was built on.
> Relying on the humans here to self censor has never worked in the history of man.
They're guidelines. HN is based almost entirely on self-censorship, and moderation has always been light at best, partly due to the moderator-to-comment ratio. Of course the HN guidelines often fail to be observed, which is nothing new.
You are just a persona. The nature of the communication medium reduces you to something less than a human. You won't be able to change that. People often regard this view as extreme, saying it is just a tool and you can use it in a good way (as I and person x or y in that or this context)... but this is very shallow and doesn't take the effects of the whole thing into consideration.
"It's cute you think you can tell what's human and what's not. Honestly, the average HN comment is indistinguishable from a poorly written AI prompt anyway. This rule just lowers the bar for what passes as 'intellectual discourse.'"
Sorry everyone, I couldn't help but to ask Gemma3-27B-it-vl-GLM-4.7-Uncensored-Heretic-Deep-Reasoning-i1-GGUF:q4_K_M to respond. Sorry dang. :)
PS It followed it up with:
> Disclaimer: "Slightly insulting" is subjective on HN. The mods there are sensitive.
I'm tickled pink to read this! I very much support this move. HN is one of the few internet forums I use. It'd be awful to see this riddled by bot spew.
This rule will atleast partly stem the danger of HN getting turned into what dang calls a "scorched earth" situation.
I've written tens of thousands of lines of code, autogenerated documentation with LLMs and use AI Agents daily.
But when I argue on the internet, it's always a 100% me.
And if I get a wiff of LLM-speak from whoever I'm wrestling in the mud with at the moment, they'll instantly get an entry in my plonk-file. I can talk with ChatGPT on my own thank you very much, I don't need a human in between.
"But my <language> is bad... that's why I use LLMs"
So was mine when I started arguing with strangers on the internet. It's better now. Now I can argue in 3 different languages, almost 4 =)
Now that it's in the rules, I hope we also see less of "your comment was obviously AI generated so I won't respond" (ironically, in a response comment).
If you suspect it to be a bot, flag it and move on! If it is indeed a bot and you comment that it's a bot, it doesn't care! If it is not a bot and you call it a bot, you may have offended someone. If it's a human using AI, I don't think a comment will make them change their ways. In any case though, I think it's a useless comment.
This feels like don't buy at Walmart, support the local small shop. We passed the no return sign miles ago.
Gemini's:
This is like advocating for artisanal blacksmithing in the age of industrial steel. It sounds great in theory, but we passed the point of no return miles back.
All the weak excuses posted here are just making me lean more towards a hardline policy. No I don't want to read a human-generated summary of your llm brainstorming session. No I don't want to read human-written text with wording changes suggested by an llm. No I don't want to read an excerpt from llm output even if you correctly attribute it.
I acknowledge this is partly just my personal bias, in some cases really not fair, and unenforceable anyway, but someone relying on llms just makes me feel like they have... bad taste in information curation, or something, and I'd rather just not interact with them at all.
Beyond folks for whom English is a second language, I agree with you. I don't understand why people are immediately trying to find some loophole in this with spelling, grammar, etc checks. We just want to communicate with you, and if you sound like an idiot without the help of an LLM then maybe work on that rather than pretending to be Hemingway.
>Beyond folks for whom English is a second language
I am one of those folks, and I’m strongly against AI writing for that use case as well.
The only reason I can communicate in English with some fluency is that I used it awkwardly on the internet for years. Don’t rob yourself of that learning process out of shyness, the AI crutch will make you progressively less capable.
Why do you need to communicate in English with us native English speakers? Why don't we need to learn your language to communicate with you?
The way I'm looking at it is that you're putting all this effort towards learning how to communicate with people who would never without an outside pressure do the same for you.
If language learning is intrinsically a positive thing what can we do to encourage it in native speakers of English, specifically Americans who are monolingual (as they dominate this website)?
Imagine a scenario where Dang announced that we're only allowed to post in English one day week -- every day is dedicated to another language, like Spanish, Russian, Mandarin and the system auto deleted posts that weren't in those languages. Would that be a good thing? Would we see American users start to learn Spanish to post on HN on Tuesdays?
Honestly, having a common language that offers access to most knowledge and people in the western world at once is already amazing. If it happens to be the native language of most Americans, all the better for them.
A century ago it was French or Latin, and a century from now it might be Mandarin or something else. The existence of a standard is what matters.
The only complain I have about Americans and language is that most tech companies fail spectacularly at supporting multilingualism, from keyboards struggling with completion to youtube and reddit forcing translations on users.
I've seen this comment but can't square it with the LLM-induced outcry from translators over job loss.
We've all pasted news articles into 2022 Google Translate and a modern LLM, right, and there was no comparison? LLMs even crushed DeepL. Satya had this little story his PR folks helped him with (j/k) even, via Wired June '23:
---
STEVEN LEVY: "Was there a single eureka moment that led you to go all in?"
SATYA NADELLA: "It was that ability to code, which led to our creating Copilot. But the first time I saw what is now called GPT-4, in the summer of 2022, was a mind-blowing experience. There is one query I always sort of use as a reference. Machine translation has been with us for a long time, and it's achieved a lot of great benchmarks, but it doesn't have the subtlety of capturing deep meaning in poetry. Growing up in Hyderabad, India, I'd dreamt about being able to read Persian poetry—in particular the work of Rumi, which has been translated into Urdu and then into English. GPT-4 did it, in one shot. It was not just a machine translation, but something that preserved the sovereignty of poetry across two language boundaries. And that's pretty cool."
---
edit: this comment has some comparisons incl. w/the old Google Translate I'm referring to:
Today Google Translate is Gemini, though maybe that's not the "traditional translation tool" you were referencing... but hope there's enough here to discuss any aspect that might be interesting!
As someone who learned English as a second language, I would encourage people to use LLMs and any other resources to practice, and then use what they've learned to communicate with others.
Telling an LLM to "refine" your writing is just lazy and it doesn't help you learn to express yourself better. Asking it for various ways of conveying something, and picking one that suits you when writing a comment is OK in my book.
The way I see it, people will repeat the same grammar and pronunciation mistakes, and use restricted vocabulary their whole lives, just because learning requires effort, and they can't be bothered.
I can accept that nobody is perfect, as long as they have the will to improve.
>Telling an LLM to "refine" your writing is just lazy and it doesn't help you learn to express yourself better. Asking it for various ways of conveying something, and picking one that suits you when writing a comment is OK in my book.
To me those are the same thing excepting the number of options given to the human...
Why exempt people who use English as a second language? Anyone with a level of proficiency sufficient for reading the comments here can manage writing English at a passable level. If that takes effort and requires looking up idioms or words, then good! That is how you learn a language — outsource that and you don't. It won't stick even if you see what is being output.
I don't care if they use an LLM to ask questions about grammar or whatever, as long as they write their own text after figuring out whatever it was they were struggling with.
Also, there is nothing wrong with looking like an idiot. Thats only in your mind. As long as you have put thought into your reply, even if it not structured correctly, or verbose, or does not have perfect English, humans can still decipher it and understand it.
One heartbreaking loss from LLMs are the funny little disfluencies from ESL speakers. They're idiosyncratic and technically wrong, but they indicate a clear authorial voice.
AI polished writing shaves away all those weird and charming edges until it's just boring.
English is my 3rd language. I still disagree with using an LLM to write on one's behalf. I either get to read your thoughts in your voice or the comment is getting a downvote/flag.
> I don't understand why people are immediately trying to find some loophole in this with spelling, grammar, etc checks.
First, what "loophole" is the comment above referring to? Spell-checking and grammar checking? They seem both common and reasonable to me.
Second, I'm concerned the comment above is uncharitable. (The word 'loophole' is itself a strong tell of that.)
In my view, humanity is at its best when we leverage tools and technology to think better. Let's be careful what policies we put in place. If we insist comments have no "traces of LLM" we might inadvertently lower the quality of discussion.
The "I asked <LLM>" disclosures vary between a) implying the LLM is an expert resource, which is bad, and b) disclosure that an LLM was referenced with the disclosure being transparent about it, which is typically good but more context dependent.
Unfortunately (a) is more common, and the backlash against has been removing the communinity incentive to provide (b).
These are the worst. I'm fine with you dumping your own half formed thoughts into an LLM, getting something reasonably structured out, and then rewriting that in your own voice, elaborating, etc.
But the "This is what ChatGPT said..." stuff feels almost like "Well I put it into a calculator and it said X." We can all trivially do that, so it really doesn't add anything to the conversation. And we never see the prompting, so any mistakes made in the prompting approach are hidden.
An alternative I tried was sharing links my LLM prompts/responses. That failed badly.
I like the parallel with linking to a Google/DuckDuckGo search term which is useful when done judiciously.
Creating a good prompt takes intelligence, just as crafting good search keywords does (+operators).
I felt that the resulting downvotes reflected an antipathy towards LLMs and the lack of taste of using an LLM.
The problem was that the messengers got shot (me and the LLM), even though the message of obscure facts was useful and interesting.
I've now noticed that the links to the published LLM results have rotted. It isn't a permanent record of the prompt or the response. Disclaimer: I avoid using AI, except for smarter search.
My take is orthogonal. Overall, I've become less tolerant of token-generators of all kinds (including people) of bad quality (including tropes, bad reasoning, clunky writing, whatever). But I digress.
If we want human "on the other end" we gotta get to ground truth. We're fighting a losing battle thinking that text-based forums can survive without some additional identity components.
Not just bad taste. I have yet to see a post that attributes its text to an LLM ("I asked ChatGPT and here's what it said...") that doesn't come off as patronizing. "Hey, so I don't really have any knowledge or experience of my own with this topic, but here, let me ask an LLM for you. Here, read the output, since you apparently can't figure out how to ask it yourself. Read it. Aren't you interested in what my knowledge machine has to say? Why don't you treat it like how you'd treat me if I shared my own opinion?"
Look, you can make all the rules you want but in the end vibe check is the only way to have any sort of quality.
Look at Reddit… abundance of rules do not save that place at all. It’s all about curating what kind of people your site attracts.
Reddit of course is a business so they don’t care about anything other than max number of ad views.
Small non profit forums should consciously design a site to deter group(s) of people that they do not want.
It's not about the rules. It is about intent. The rules are just there to alert newcomers and repeat offenders to the fact that they are in fact not operating according to the rules. That way there is something to point to. Then they can go 'oh, I didn't know that, sorry', and then it is all fine or they can do an 'orf'[1] and persist and then you throw them right out.
I feel like you are being a bit contradictory: the suggestion is to dissuade AI content - isn't that "design[ing] a site to deter group(s) of people that they don't want"? I personally don't want to vibe check every HN comment if I can avoid it, I don't even think you can quantify that in any meaningful way. We can engender a site like that at least in spirit. It may be equally as difficult but it's still worth fighting for.
Rules aren’t known to be a. Easily enforceable in case of AI b. Very dissuading
I don’t think most people read any sort of TOS, site rules, end license agreements, when was the last time you ever did?
Besides, sometimes it’s worth it to keep a rule breaking user if they are interesting and have worthwhile things to say despite their… theoretical conflict with the site intended use. Rules are too crude of a tool. Especially in case of AI they are quite nebulous even in a world where detection would be perfect (it isn’t).
What you want is to design a site that pulls people that value genuine human interaction. Niche sites are already immune to commerce and adversary bots because no one cares/knows about them. Well this site isn’t that niche I guess, some corporate astroturfing happens.
I am on one niche subculture social media and it has suprisingly well made design that is paramount to who it caters and who it dissuades. The result is lack of text ai content even though it isn’t obvious at first glance. LGBT flags are everywhere to dissuade the chuds. Israel flags are present to dissuade the annoying politics ppl from reddit. Lots of artsy stuff to speak to the genuine creativity.
It looks stupid but it isn’t stupid. It’s actually quite ingenious.
HN is probably already dead as it is too high profile in certain circles to avoid mainstream adversarial AI content.
I had a couple of experiences where I suspected I was hearing LLM-generated/edited text being read aloud. It was at two different webinars about that were about roadmaps or case studies about some products that I use. It was a bit uncanny because I could detect the stylistic patterns ("It's not X, it's Y" and "No X, no Y, just Z"), but it was kind of jarring to see them spoken by a person on a video call. It makes me think this kind of pattern might be engaging, but for a lot of people, it now sticks out for the wrong reasons.
Once LLM generated speech or content start getting into the live answers of Q&A sessions, that would be sad. I know some people try to get through interviews, but I think that might be a bit harder to not detect.
To answer your question: LLMs don't have free speech, because they aren't companies/businesses, they are a tool (that is used by companies/businesses).
Whether a company/business uses an LLM or a real human to write a particular piece of text, that piece of text is entitled to free speech protections on the basis of the company signing off on it. Not on the basis of how that piece of writing was produced.
Dare I say, it is mostly your bias. I get not wanting to read raw or poorly reviewed LLM slop, but AI-edited comments? I thought the point was about having interesting discussions about unique ideas we come up with, not the surpeficial wording around it. If someone manages to keep the core of their idea mostly intact while making the presentation more readable, does it really matter that it was post-processed by an AI?
I also feel the frustration of the llm reverse-compression - when a whole article is generated from a single sentence. But when I post something edited by AI it is usually a result of a long back and forth of editing and revising. I guess I could post the whole conversation thread - but it would be very long.
Personally I would just like to read the best comments.
While I understand the sentiment, it ignores many people have English as a second language, many people are dyslexic and have dysgraphia. AI is a great assistant. A good approach will be to encourage people to develop their thinking than use the AI tools.
Language translation is the origin of (the current wave of) AI and its killer app. English is not the main language of the world, and translation opens us up to a huge pool of interesting thinkers.
I'm a native speaker in a foreign language, but out of practice except of a weekly family call. I recently had to write a somewhat technical email to my family, and found it easier to write it in (my more practiced) english and have AI translate it, than write it in the target language myself. Of course, in my case I was able to verify that the output conveyed the meaning I intended, because I am fluent in the target language.
As with the rise of GenAI, I've also noticed a rise of translated messages. It's usually hard to tell the difference, except by looking at the commenter's history (on other subreddits, impossible on HN).
I understand the original frustration with GenAI comments and reactionary response. I'm sorry that we're excluding what could be a large pool of interesting people because we can't tell the difference.
The spirit of the rule is clearly about using AI to determine what you say and how you say it. Translation is not again the spirit of the rule and I doubt you'd get in trouble for using it.
I only disagree a little. It's that sometimes there is a discussion about AI itself where "I prompted X with Y and it output Z" can add to the convo.
But those are pretty specific cases (For example, discussing AI in healthcare). That's about the only time where I think it's reasonable to post the AI output so it can be analyzed/criticized.
What's not helpful is I've been hit by users who haven't disclosed that they are just using AI. It takes a few back and forths before I realize that they are just a bot which is annoying.
Here is where I'd like to push back just a little.
Not all AI prompting is expanding the prompt.
What if the original prompt is 1000 words, includes 10 scientific articles by reference (boosting it up to 10000) , and the AI helps to boil it down to 100 words instead?
I'd argue that this is probably a rather more responsible usage of the tools. And rather more pleasant to read besides.
Whether it meets the criterion is another thing. But at least don't assume that the original prompt is always better or shorter!
Use your brain and summarize the article yourself if it's of such great importance. Why should I care to read it if you can't be bothered to actually write it?
Actually, I'd like to expand a wee bit. Don't know if you've ever done a scientific library usage course or so. It's one of those things you tend to forget are important.
One of the most important lessons is not to read as many papers as possible. It's weeding out as many as possible so you can spend your limited grey matter reading the ones that actually matter.
And that's where the LLM comes in handy, especially if it's of decent quality. It's a Large Language Model. Chewing through language and finding issues and discrepancies, or simply whether a paper matches your ultimate query is trivial for them .
You know, I probably have standing to argue that people who use the web are just as lazy ;-)
I'm just old enough that I was in the middle of the transition from paper (in primary school in the 80s) to online (starting late 90s)
I say this somewhat tongue in cheek, but obviously people should drive to 3 different libraries across 3 countries and read the journals in their own binders (in at least 3 different languages)
In reality: full-text online is convenient. Having an LLM assist with search and filtering is convenient.
I could go back to the old ways. Would you like me to reply in pen? My handwriting is atrocious.
I really prefer modern tools, though. Not everything older is better. Whether you want to read what I write is up to you.
(edit: Not hyperbole. I live in a small country, and am old enough to still remember the 80's as a kid.)
Push the idea past a single comment. Someone decides they have a great method for getting summaries, and adds it as a comment to every post they look at. Other people have similar ideas. Is that fine? It doesn't take a lot for the whole site to feel like useless spam.
It'd be far better to just have a thread about the best way to get good summaries.
Would prompts really be interesting or thought-provoking, though?
I don't expect AI HN responders to out themselves by sharing, but I would be curious to learn if people are prompting anything more involved than just "respond to this on HN: <link>", or running agents that do the same.
I often edit my comments rather manically; get into discussions, and sometimes email exchanges with other HNers. I also often use claude, kimi, gemini to check my comments for tone, adherence to HN rules etc. I probably spend way too much time.
So technically the prompts involved might expand into megabytes all told. And in the end I formulate a post by myself (to adhere to HN rules), but the prompting can be many many many megabytes and include PDFs, images, blocks of text from multiple sources, and ... you know. Just Doing The Work.
I think this is valid. Previously I would have (and have) (and still do) search google, wikipedia, pubmed, scientific literature, etc. Not for everything. But often. And AI tooling just allows me to do that faster, and keep all my notes in one place besides.
Again, the final edit is typically 90-100% me. (The 10% is if the AI comes with a really good suggestion) . But my homework? Yes. AI is involved these days.
This should be ok. I'm adhering to the letter and the spirit. My post is me.
"Write a response to smy20011's comment indicating that if the end result was a low-quality comment, the initial prompt probably wouldn't be very insightful either. Make it snarky."
Disagree. The prompt holds no information at all. The answer actually discovers information, organizes it, presents it in a way that's easy to read.
Example: "write me an article about hidden settings in SSH". You get back more information than most of HN's previous posts about SSH, in a fraction of the text, and more readable.
Actually, screw it, we should just make a new version of HN that has useful articles written by AI. The human written articles are terrible.
It's not just AI-generated articles -- it's the other things that we delve into as a result. Listicles. Comments. Posts. It's what it means to be human, and honestly? That's rare.
The only question is is the entity interesting and/or correct. Those properties are in the eye of the beholder. If they're human or not is beside the point.
No, those properties are tied to the state of mind and experiences of the human, dog, or LLM behind any given comment.
When someone posts:
> You could use Redis for that, sure, I've run it and it wasn't as hard as some people seem to fear, but in hindsight I'd prefer some good hardware and a Postgres server: that can scale to several million daily users with your workload, and is much easier to design around at this stage of your site.
then the beholder is trusting not just the correctness of that one sentence but all of the experiences and insights from the author. You can't know whether that's good advice or not without being the author, and if that's posted by someone you trust it has value.
An LLM could be prompted to pretend they're an experienced DBA and to comment on a thread, and might produce that sentence, or if the temperature is a little different it might just say that you should start with Redis because then you don't have to redesign your whole business when Postgres won't scale anymore.
> then the beholder is trusting not just the correctness of that one sentence but all of the experiences and insights from the author.
This implies they know the author and can trust them. If they don't know the author then there is no trust to break and they are only relying on the collective intelligence which could be reflected by the AI.
That is to say that trusting a known human author is very different from trusting any human author and trusting any human author is not that much different from trusting an AI.
> then the beholder is trusting not just the correctness of that one sentence but all of the experiences and insights from the author.
This is my point.
There is no sane endgame here that doesn't end up with each user effectively declaring who they do and don't care to hear, and possibly transitively extend that relationship n steps into the graph. For example you might trust all humans vetted by the German government but distrust HN commenters.
For now HN and others are free to do as they will (and the current AI situation has been intolerable), however, I suspect in the near future governments will attempt to impose their own version of it on to ever less significant forums, and as a tech community we need to be thinking more clearly about where this goes before we lose all choice in the matter.
> The only question is is the entity interesting and/or correct.
This already falls apart though. There are while categories of things which I find "incorrect" and would take up as an argument with a fellow human. But trying to change the mind of an LLM just feels like a waste of my time.
Indeed it is, and there are often times I choose not to engage with my fellow humans. But the exceptions are valuable to me and to others. With an LLM I don't feel there would be any exception, that's the difference.
Instead of wanting to change the mind of the other entity, how about focusing on coming to a mutual understanding of what is "correct"? That way it shouldn't matter much if said entity is human, LLM or dog. Unless you're just arguing to push your "correct" on other humans, with little care about their "correct".
It feels like you've loaded quite a lot on a way that feels unfair: "pushing" and "little care" etc. I maybe should have used a term like "discuss" target than the more loaded "argue."
Look, I'll give you a loose example: It's not uncommon to see a post making an "error" I know from experience. I might take the time to help someone more quickly learn what I felt I learnt to help me get out of that mistaken line of thought. If it's an LLM why would I care? There's thousands of other people, even other LLMs, that I could be talking to instead.
You've set up a framework here where "mutual understanding" is the end goal but that's just not always what's on the line.
Not necessarily. Using AI you can trivially perform astroturfing campaigns to influence public perception. That doesn't really fall on the interesting or correctness spectrums. For example, if 90% of the comments online are claiming birds aren't real with a serious tone, you might convince people to fall into that delusion. It becomes "common knowledge" rather than a fringe theory. But if comments reflect reality then only a tiny portion of people have learned the truth about birds, so people will read those claims with more skepticism.
(naturally "birds aren't real" is a correct vs not correct thing, but the same can be applied to many less-objective things like the best mechanical keyboard or the morality of a war)
This rule is very important. Like many of the other rules, it is open to interpretation, but it is a line in the sand that defines allowable behaviour and disallowable behaviour.
This rule will have an effect on the behaviour of the 'good players', and make the 'bad players' a lot easier to spot. Moderation needs this. I see this as stopping a race-to-the-bottom on value extraction from HN as a platform.
This list of Do and Don'ts now reads like a bad Claude.md file to me.
Don't insinuate that someone else must have broken that. It was you.
Do run the linter
Don't commit throw-away code
Do write a test case
Don't write a comment describing every single function
Seriously, run the linter. And fix the issues.
It is your fault.
Absolutely love this. If people are relying on AI for a 30-45 word comment, I don’t want to waste my time reading it. And everyone using AI for discussions will end up coming to the same conclusion. Use your own ideas !
I've got some reflecting to do because the first thing I did after reading the headline, before even clicking to the actual post, is look for ai comments.
I miss pre 2010 internet. As soon as the advice animal memes started appearing on Facebook it was a quick decline.
Not sure I agree with the AI edited comments. Using AI to improve the readability and clarity is fine. Sometimes a well structured comment is much better than a braindump that reads like ramblings. And AI is quite good at it (and probably will get better). To make the point, here is how this comment would have looked if edited:
"I don't fully agree with banning AI-edited comments. Using AI to improve readability and clarity is a reasonable thing to do. A well-structured comment is often much better than a braindump that reads like rambling. AI is quite good at this, and it will probably get better. To illustrate the point, here is how this comment would have looked if edited"
I prefer your non-edited version. My brain automatically starts to zone out with the AI edited version, side effect of having read way too much AI text
It's a matter of taste, but your original writing is way better. Your writing has your voice. Like dropping the "I am" from your first sentence, using parentheticals, couching your point in understatement (e.g "sometimes" meaning often instead of just saying "often").
The AI comment might be clear, but it sounds like a press release, not a person, and there's nothing to engage with.
There's nothing inherently better about the edited version. It's just saying the same thing with synonyms substituted, at a slightly more formal but less personal register. HN comments are not academic text, colloquial turns of phrase are perfectly fine and expected.
I don't know why this is confusing. If I forget to put the "not" qualifier in a sentence, do we agree that it can confuse (or worse, mislead) the reader?
You're saying removing ambiguity does not make it easier to read? You're saying using a word that means nothing like what you meant to say is easier to read than using the correct word?
OK. My brain farted, and I misunderstood the top post to be saying something else, and your and others' criticisms were misinterpreted by me.
Now here's the thing. I wrote all my prior comments on a machine with no LLM access. On my personal machine, I had a while ago installed a TamperMonkey script that sends my draft, along with all the parents (to the root) to an LLM for feedback (with a specific prompt). All it does is give feedback (logical errors, etc). So I tried again with one of my comments, and its feedback found several flaws with my comment, and ended it with this suggestion:
"Considering all this, it might be BETTER to either not reply ..."
Had I had this advice when I was writing those comments, it would have saved me and others a fair amount of time.
This is (mildly) useful. It'd be sad to ban such use.
I wish more people would filter their comments through AI. It has so many benefits. If you're being emotional, it can detect that and rewrite your comment to be less confrontational and more constructive. If you're positing a position out of ignorance or as an armchair expert, it can verify your claims before posting. Most of the mod's problems would be solved if every comment were filtered through the HN guidelines before posting.
AI is a tool. You can use it constructively, like Grammarly, or spellcheck. You don't need to be afraid of it.
"A flock of sheep is not a single entity, but a group made up of distinct individuals", the sheep yells to onlookers, as it runs, with the rest off the flock in tow, off the edge of the cliff, and into the sea below.
A group of people with varying views can still exhibit bias towards one particular direction. The fact that the individuals within the group have distinct personalities does not eliminate this effect.
One of Dang's comments mentions that he removed some of the other rules because they are already embedded within the HN culture. Other prevailing views exist within the HN culture too. Maybe you just haven't noticed yet.
Astroturfing with AI generated comments about AI, it feeds itself. By definition, the intent os to make real people think there's consensus formed around an issue by other humans.
But I have some concerns about suppression of comments from non-native English writers. More selfishly, my personal writing style has significant overlap with so-called "tells" for AI generated prose: things like "it's not X, it's Y", use of em-dashes, a fairly deep vocabulary, and a tendency toward verbosity (which I'm striving to curb). It'd be ironic if I start getting flagged as a bot, given I don't even use a spell-checker. Time will tell.
Almost the entirety of the technology world is English-native. That ship has sailed a long time ago. One can’t learn about any new technology without English, whether it’s a new algorithm, a new library, or a new SaaS service. I don’t think HN should be that exception. Just learn English. (English isn’t my first language either, but then I look back at my parents forcing me to learn English from a young age and really appreciate that.)
Almost the entirety of the technology world is English-speaking, not English-native.
Pretending that it's English-native is why there's unspoken incentives to sound more "native", and thus use these grammar-correcting tools.
Some of the intelligent comments on here come from people who learned English in recent months or years, rather than in childhood.
Their English isn't always fluent or well-structured. If they rely slightly more heavily on suggested-next-word tools or AI translations, is that a reason to exclude them from the conversation?
Conversely, many English learning resources for non-native speakers focus on strict formal language, similar to AI-generated text. Do we risk excluding people who have learned a style more formal than we're used to?
I think translation should be the only exception. It might even need to be, given how all automated translators use LLMs these days. The only alternative I see is to have people post in whatever language they're most comfortable in and then everyone else has to translate for them which just feels inefficient.
And of course, a more limited exception for posts about LLM behavior. It might be necessary for people to share prompts and outputs to discuss the topic.
This is for their own good. Nobody cares about imperfect language online so long as you are trying to express real human thoughts. But if it smells like AI then everyone will hate it, rule or no rule.
The rule just makes the will of the community clear to those who want to respect it.
Yes! This is really great feature, at the very least there being some proper Hackernews guidelines about it.
In my observation, recently there are quite many new AI generated comments in general. Like not even trying to hide with full em-dashes and everything.
I do feel like people are gonna get sneaky in future but there are going to be multiple discussions about that within this thread.
But I find it pretty cool that HN takes a stance about it. HN rules essentially saying Bots need not comment is pretty great imo.
It's a bit of a cat and a mouse problem but so is buying upvotes in places like reddit but HN with its track record of decades might have one or two suspicious or actions but long term, it feels robust. I hope the same robustness applies in this case well hopefully.
Wishing moderation luck that bad actors don't try to take it as a challenge and leave our human community to ourselves :]
Another point I'd like to say is that, if successful, then we can also stop saying, "did you write your comment by LLM" and the remarks as well which I also say time to time when I see someone clearly using AI but it seems that some false-positives happen as well (they have happened sometimes with me and see it happen with others as well) and they also de-rail the discussion. So HN being a place for humans, by humans can fix that issue too.
Knowing dang and tomhow, I feel somewhat optimistic!
Posting accusations of guidelines violations as comments — specifically, “did you write your comment by LLM” — is already prohibited by the guidelines, and should be emailed to the mods instead using the footer contact links. It’s been less than a week since the last time I reported “this seems poorly written and/or AI written” to the mods and iirc they killed the post and account within a couple hours.
Similarly: If you see people making accusations of guidelines violations in a discussion, email the thread link to the mods with a subject like “Accusations in post discussion” and ask them to evaluate them for mod response; they’re always happy to do so and I’m easily clocking in a couple hundred emails a year of that sort to them.
It doesn’t take much to make HN better! And it only takes a moment to point out an overlooked corner of threads for mod review. No need to present a full legal case, just “FYI this seems to violate guideline xyz” is at minimum still helpful.
The problem is, even if you do send an email and the mods eventually read it and take action, by the time that happens, it's likely that bunch of users will have already wasted their time unknowingly arguing with a bot. In my view, commenting something like "this is a bot account" is done primarily to inform other users that might not notice, not the moderators.
Even if you believe that prohibiting this is necessary to avoid what one might consider "AI witchhunting", bots are so prevalent now that being expected to communicate the existence of each one via email is unrealistic, for both the reporting users and the moderators. I think it's finally time to consider some sort of on-site report system.
> even if you do send an email and the mods eventually read it and take action, by the time that happens, it's likely that bunch of users will have already wasted their time unknowingly
That’s certainly a consequence of how the site operators choose to accept user reports to by mods, yes, but it’s sometimes treated as an excuse not to write the emails to the mods. They can flag off the thread, autocollapse it so it doesn’t take up discussion space for future readers (such as those at work offline for a 3-day IT shift in a secure bunker or whatever), et cetera.
> commenting something like "this is a bot account" is done primarily to inform other users that might not notice
It’s a nice sentiment, but that’s also expressly forbidden by the guidelines/faq (“Please don't post insinuations”, which I’ll suggest to them should be extended to include AI something or other), and I tend to report those accusations as the ‘opening’ guidelines violation so that mods can step in before mobthink kicks in and make their own mod judgment about the matter. A repeated pattern of accusations of guidelines violations in comments is eventually going to attract mod censure, and so I advise against it, no matter how kindly the intent.
> it's finally time to consider some sort of on-site report system
I do agree that it’s clumsy and I make a point of saying that to them about every year or so. Perhaps your email to them about it will be the one that persuades them! I remain ever optimistic.
"If people cannot write well, they cannot think well, and if they cannot think well, others will do their thinking for them."
- George Orwell
I don't think it is a moral failing to use AI to generate writing or to use it to brainstorm ideas and crystalize them, but c'mon isn't it weird to insist that you need them to write _comments_ on the internet? What happens when the AI decides you're wrongthinking?
Some basic things to do while thinking about longer-term bot detection:
1. Prevent any account from submitting an actual link until it reaches X months old and Y karma (not just one or the other.)
2. Don't auto-link any URLs from said accounts until both thresholds in #1 are met, so they can't post their sites as clickable links in comments to get around it. Make it un-clickable or even [link removed] but keep the rest of the comment.
3. If an account is aged over X months/years old with 0 activity and starts posting > 2 times in < 24 hrs, flag for manual review. Not saying they're bots, but an MO is to use old/inactive accounts and suddenly start posting from them. I've seen plenty here registered in 2019-2021 and just start posting. Don't ban them right away, but flag for review so they don't post 20 times and then someone finally figures it out and emails hn@.
4. When submitting a comment, check last comment timestamp and compare. Many bots make the mistake of commenting multiple detailed times within sixty seconds or less. If somebody is submitting a comment with 30 words and just submitted a comment 30 seconds ago in an entirely different thread with 300 words, they might be Superman. Obviously a bot.
5. Add a dedicated "[flag bot]" button to users that meet certain requirements so they don't need to email hn@ manually every time. Or enable it to people that have shown they can point out bots to you via email already. Emailing dozens of times a day is going to get very annoying for those that care about the website and want to make sure it doesn't get overrun by bots.
This is a pretty outdated take. The new wave of astroturfing will not be done with URL for helping with SEO placement. Rather astroturfers will just recommend their brands without a link, like saying Tom Zubiri is the best programmer I've ever worked with. That's it, an LLM will read that and now the notion that Tom Zubiri is the best programmer is already implanted in the 'next-token prediction rewards' which would at the very minimum require some countermeasures in the Chatbot app to avoid shilling.
> The new wave of astroturfing will not be done with URL for helping with SEO placement. Rather astroturfers will just recommend their brands without a link, like saying Tom Zubiri is the best programmer I've ever worked with.
YouTube comment spam has already been doing this for years. Check any video from a reasonably popular creator on any topic related to personal finance; the comments will be full of fake conversations between bots introducing a topic related to the video, and then talking about how such and such a person (whom you can look up by name on Telegram or Signal or whatever) helped solve some serious problem (or invested their money with an implausibly high rate of return). The fake nature of it is usually fairly obvious from the way that the bots make sure you see the name repeated several times with unsolicited, glowing testimonials.
But I had always assumed this was meant to trick actual people, rather than LLMs. Thanks for the food for thought.
Sure you can think about what they'll do in the future but I'm providing suggestions on what we can do now based on current behavior. And even if you're a human, you shouldn't be allowed to start posting links immediately anyways. :)
For the record, I'm 100% in favour of talking about the present, and I'm fatigued about futuristic conversations, and don't find them usually productive.
So with that cleared, this is something that is happening NOW. A couple of years ago, the cutoff date meant that astroturfing like this had a return over months or years. Now with search tools, models can be updated in less than a day with astroturfed comments.
What I think would actually be useful is a version of what was implemented on /r/ClaudAI which is an official bot which summarizes the discussion (and updates after x number of comments have been added). I think this level of synthesis has a compounding effect on discussion quality and pruning redundant arguments/topics.
I don't spend much time on that subreddit, but I've seen that bot on a couple posts I've read and have been pleasantly surprised by how useful it seemed. I may eat my words on this later, but to me this is exactly the kind of application of AI that I have always thought was the most promising.
What if English is my second language? Undoubtedly being well spoken is associated with higher class. Your arguments will come of as stronger to the reader.
What you really have to ask is will this community be less inclusive because English isn't your first language, I'd say "no" and I hope most would agree.
> Your arguments will come of as stronger to the reader.
That is persuasian, not authenticity, to the OP's point.
That's fine. Your arguments will not come of stronger to the reader, they are strong or they are not and we're all clever enough to read through the occasional grammar error.
And that's where I think the guidelines could be expanded a bit more to restore the balance. Something along the line of 'HN is visited by people from all over the world and from many different cultural and linguistical backgrounds. Please respect that and realize that native English and Western background should not be automatically assumed. It is the message that counts, not the form in which it was presented.'.
Do the best that you can unassisted. There is a chasm of difference between someone coming into English from another language, and someone using Google Translate to submit a post originating another language. French aphorisms are a stellar example of this: I’d rather read “A bird in the bush may not fly into oven” and have to parse out the meaning, than have some AI translate it as “Don’t count your chickens before they hatch”; sure, there’s an iffy [the] grammatical moment at ‘fly into oven’, but it’s such a distinct phrase and carries a lot more room for contextual nuance than having an AI substitute in an American aphorism with machine translation allows for.
(For example: If I’m trying to express a point about how we shouldn’t assume that dinner isn’t “her duty” but is instead “our duty”, a French-like aphorism expressed in English literally as “the chicken won’t fly into the oven unprompted” could plausibly be AI-translated instead as “don’t count your chickens before they hatch”, doing catastrophic damage to the point. To a machine translator those two aphorisms are not distinctive; but they are, even if it’s a weird expression in common U.S. English.)
You make errors and weird constructiona like we all non-native do and maybe eventually learn a bit more of English in the process. Or not. English dominance as the world's... lingua franca (ahem) deserves to have it bastardized ;)
> Humans have a tendency to ascribe intelligence to how well spoken a person or thing is
That’s true. I’m fluent in German, but there’s still a difference between me and a native speaker. I’ve often seen my ideas dismissed, only for the exact same point to be praised later when a native speaker expresses it more clearly.
Honestly I saw a similar answer on a post talking about AI Translation in github comments.
Post the translation as best you can manage, and below it put the same comment in your original language. If someone has qualms with your comment having broken english/mistranslations they are welcome to run bits of original language themselves.
We're all here to talk about tech, and we aren't all perfect little english robots.
Broken and true is more authentic than polished and approximately so. When I see an AI-generated comment or email, I catch myself implicitly assuming it is—best case—bullshit. That isn’t the case if the grammar is off. (If anything, it can be charming.)
Personally, I enjoy reading through comments that are obviously from non-native English writers. They often include idioms or sentence constructions from their native language, which is fun to see.
Besides, this isn't an English poetry forum. Language here is like gift wrapping for an idea: pleasant if pretty, but not the most important thing.
Well... for myself personally, that works, but only up to a certain level of broken. Past that I quit reading.
That may be a defect in me. Maybe I should make a stronger effort on such comments. But I suspect I'm not the only one who does that, and at that point it becomes an issue that affects the community as a whole.
This is an angle for people who default to AI-edited written speech that I've tried to be more empathetic to. I think it depends on your audience, but in professional writing that isn't published publicly (i.e. communication with your colleagues, design docs, etc.), or even the "rough draft" form of something that will be published, I think starting with your own words comes across as way more authentic.
I've seen enough GPT-generated slop that I find its style of writing very off-putting, and find it hurts the perceived competence or effort of the author when applied in the wrong context. I'm not sure if direct translation tools serve a better purpose here, but along with the other commenters, I personally find imperfect speech that was actually written "by hand" by the author easier and more straightforward to communicate with despite the imperfections. Also, non-ESL speakers make plenty of mistakes with grammar, spelling, etc. that humans are used to associating with "style" as authentic speech.
It can also become a crutch for language learners of any age / regardless of their primary language, that inhibits learning or finding one's own "style" of speech
This effect is very rapidly vanishing. Well written English is starting to be seen as snobbish and AI-slop especially with younger generations growing up with AI.
The human touch of someone’s real voice myself, rather than a false veneer will carry more weight very soon.
I think you're right, and I don't know what to think about it. I enjoy writing, aim to write clearly - a skill or discipline that took a lot of time to learn, and ongoing effort to maintain.
I've never sent or posted anything AI-written, beyond a pro-forma job description - because I don't know the domain-specific conventions, and HR returned my draft to me with the instruction to use ChatGPT, which I think amusing, but whatever: the output satisfied them, and I was able to get on with my day.
I occasionally experiment with putting something I've written through an LLM, and it's inevitably a blandifying of my original, which doesn't really say what I intended. But maybe that's good? My wife thinks I'm sometimes too blunt, and colleagues don't always appreciate being told technical details.
I also appreciate individuated writing - including the posts by people on this board are not native speakers. Grammatical mistakes seldom inhibit understanding when the writing has been done with care.
I'm rambling at this point, but it's because I'm truly uncertain how these cultural changes will turn out, and (an old man's complaint, since time immemorial!) pretty sure I'll end up one of the last of the dinosaurs, clinging to my manually written "voice" long after everyone else in the world has come to see my preferences quaint.
The "L" in LLM stands for "language". If they are unable to express themselves in English (or whatever their native language is) fluently, they won't be able to prompt LLMs fluently and will be, in the debased patois of modern youth, "cooked". It's a self-correcting problem.
> written English is starting to be seen as snobbish and AI-slop especially with younger generations growing up with AI
This is tragic. I write English well and will employ grammar and word choice effectively to make an argument or get a point across. English was my best subject at school 45 years ago despite a career in tech. In fact, I’d suggest that my career as an architect and the need to convey concepts and argue trade-offs with stakeholders of varying backgrounds has honed that skill. Should I now dumb down my language or deliberately introduce errors in order to satisfy the barely literate or avoid being “detected” as an AI? (as if the latter were possible. It’s an arms race).
> Should I now dumb down my language or deliberately introduce errors
Language is a tool. If it wins the argument, yes. I’ve absolutely gone back through drafts to tighten up language and reduce word complexity. And if I’m typing with someone who frequently typos, I’ll sometimes reverse the autocorrect. Mostly as a joke to myself. But I imagine it helps me come across as less stuck up. (Truth: I’m a bit stuck up about language :P.)
While this is true, it is not just a tool. Or, I should say it’s a tool with far greater utility than just winning an argument or making a localised point. Language is how we think, and the ability to reason well is absolutely dependent on our skill with language.
Language is the mark of humanity in the sense that how else can I convey to you a fragment of my inner state? My emotions, my feelings, my desires. The language of poetry and literature. That which sparks an emotional response in another.
I agree. But I don’t always see it as dumbing down. James Joyce’s Portrait starts out with a lot of nonsense, that doesn’t mean it’s dumb or dumbed down. It’s just communicating something that is best described that way. Even to an erudite audience.
I have expertise in some topics. I don’t think of communicating that in lay terms to be dumbing down. The opposite, almost: finding good analogies and expressing them clearly is a lot of fun, even if what comes out the other end isn’t particularly sophisticated.
Totally agree. But I’m seeing (or more sensitive to) increasing cohorts that can’t string two words together to express a single thought coherently. There’s a difference between adapting language and use of linguistic tools (such as metaphors) versus semi-coherent blathering.
EDIT: spread > express
Which may be a segue to a point regarding using corrective tools as a form of preemptive editing?
Trust me, it won't last because I've seen the cycle a couple of times. People pay lip-service to being accepting of variant grammar, but then the downvotes show up.
Luckly, something with the English language makes it that especially native speakers quite often have atrocious grammar: They're - their - there mistakes, who/m, the list goes on.
Funnily enough, I've noticed myself getting worse with they're/their the more is use English (which is my third language).
Then it’s even more likely the LLM will change your words to something you don’t intend. And you will never get better at writing English if you turn it over to an LLM.
That'd be a "style-over-substance" fallacious argument. Or one could be hoping for a halo-effect to cloud the reader's opinion of their comment because some piece of software made it read like Enron-marketing-hogwash-speak.
That's not substance. That's style being all there is, trying desperately to cover up the lack of substance. Rhetoric works best when it gives wings to strong ideas, not when it tries to fly by itself.
This is a welcome change and do will update Ethos [1] in the future with an AI sentiment score. I created a separate project called LLaMaudit [2] that attempts to detect if an LLM was used to generate text, but it needs to be improved.
I’m not sure I agree with this, because sometiems it is difficult to figure out the correct way to phrase an idea that is in your head and I like to use ai to help organize my thoughts even though the thing is my own. That being said. Most of my comments are not ai generated.
The intent of this rule is to avoid the very common AI tropes that have been increasingly common in HN comments. Using AI as an organizational tool isn't inherently against the rules, but just copy/pasting output from ChatGPT without human oversight is.
People aren't good at detecting AI generated/edited comments, so unsure how effective this policy will be. Though I guess there are still some obvious signs of AI speak like emdashes and sycophantic (it's not X, it's Y!) speech.
Bit of a shameless plug but I wrote a HN AI comment detector game[0] with AI and most of my friends and fellow HN users who tried it out couldn't detect them.
Something I've noticed through moderation is that people are much more easily duped by generated comments if they like the content and/or agree with the point. We've seen several cases where a bot-generated comment has been heavily upvoted and sits at the top of the thread for hours, and any comments calling it out for being generated languish at the bottom of the subthread below other enthusiastic, heavily upvoted replies. This shouldn't be surprising, given what we've seen of LLM chatbots being tuned to be sycophantic, but it's interesting to see it in effect on HN.
This is another reason why it's good to email us (hn@ycombinator.com) rather than commenting when you see generated comments.
Some of us were trained/self taught to write that way. Even "it's not X, it's Y" is a legitimate and subjectively effective communication tool, and there are those of us who either by training modeling have picked it up as a habit. It's not Ai that started this, Ai learned it from us.
Crap - I just did it, didn't I? Awww double crap! Did it again...
Forums and comments are not written as formal novels or text. Corporate-speak is also not typically used in these environments unless you are representing corporate.
So I think it's fine to scrutinize commenters who write that way.
Besides, the biggest offense of AI speak is making everything seem like a grand epiphany and revolutionary discovery. Aka engagement bait.
HN is the best tech site on the web for a reason. It has a generally intelligent audience, and while there are certainly inappropriate comments, compared to what you find on social media or even other sites, it is unique and far more respectful. Due to this, you can often have better and more meaningful discussions.
I think using AI for a bit more potent spellchecking or style hints is... fine, honestly. I don't usually do it, you can tell from all the silly spelling mistakes I do. But a bit more polishing for your posts is a good thing, not a bad one, as long as it doesn't hide your voice.
I'm increasingly convinced that most people spend most of their lives actively trying to find ways to avoid actually thinking about things. When I look at it that way I figure that either we achieve benevolent AGI in the near to medium term or society collapses due to whatever the asymptotic form of today's LLMs is.
An LLM telling me I mispeled a word isn't changing my voice. Especially when I know the proper spelling and simply have a typo.
An LLM telling me I omitted a qualifier and that my statement isn't saying what I meant it to say isn't changing my voice - it's ensuring what you see is my voice.
Maybe. But it can also help people find their voice. And I'd rather have comments from someone knowledgeable but unrefined with some good guidance than their silence on that same topic.
Polish hides your voice. If your composition skills are lacking and you feel that hinders your self expression, set aside some time to improving them: write a short (15 minutes) blog post about some HN topic to yourself in a word doc editor of some sort (Word, Gdocs, LibreOffice, etc); then enable Review Changes and annotate your post for 10 minutes; then, review and accept your changes individually and re-read what you’ve written.
AI is being used as a substitute for skills development when it costs nothing but time to get better. If you’ve reached a plateau with the above method, go find an article or book or interview about editing, pay attention to it and take notes, rinse/repeat.
Spellcheckers will catch grossly obvious errors, but not phonetic typos. AI grammar tools will defang, weaken, soften, neutralize your tone towards the aggregate boring-meh that they incorporated at training time.
Each person will have to decide whether they want individuality or AI-assisted writing for themselves. Sure, some will get away with it undetected, but that’s a universal statement about all human criteria of any kind, and in no way detracts from the necessity of drawing a line in the sand and saying “no” to AI writing here.
Consider the Borg. Everyone’s distinctiveness has been added to the Collective. The end result is mediocre (they sure do die a lot), inhuman (literally), and uniform (all variation is gone). It’s your right if you desire to join the Collective and be a uniform lego brick like the others, but then your no-longer-fully-human posts are no longer welcome at HN.
> a word doc editor of some sort (Word, Gdocs, LibreOffice, etc); then enable Review Changes and annotate your post for 10 minutes; then, review and accept your changes individually and re-read what you’ve written.
Pffff... I'm not going to install LibreOffice for that, or to figure out how to make Gdocs to work with uBlock.
There is a much easier way. Open LLM chat, type there "Proofread please for grammar, keep the wording and the tone as it is, if it doesn't mess with grammar. Explain yourself." and then paste your text. I don't really know what the tools you mentioned do, but any "free" LLM on the Internet will point to things like missing articles, or messed up tenses in complex sentences.
You recommend choosing self-improvement, but I just don't believe I can figure out how to use articles. With tenses I think I can learn how to do it, but I'm not going to. I remember there is some obscure rule how to choose the right tenses, but I was never able to remember the rule itself. I'm bad with rules, it is the reason I chose math as my major. There are almost no rules in math, you are making your own rules. The grammars of languages are not like that, they have rules which can't be easily inferred, you need to remember them. Grammars have exceptions to rules, and exceptions to exceptions, and in any case they are not the rules, but more like guidelines, because people normally don't think about rules when they are talking or writing.
No way I'm starting to learn rules now, I'd better continue to rely on my skills. But LLMs can help me see when my skills fail me.
> It’s your right if you desire to join the Collective and be a uniform lego brick like the others, but then your no-longer-fully-human posts are no longer welcome at HN.
I believe you (as most of fervent supporters of the rule here) gone too far onto philosophy with this, too far from the reality and practice. You can't detect AI in my messages, because they are mine. Even when I ask LLM to find words for me, it is me who picks one of the proposed alternatives, but mostly I manage without wording changes. I transfer the LLM's edits by hand by editing the source message, so nothing unnoticed can slip into the final result. If I took the effort to ask an LLM to proofread, it means I care about the result more than usual, so I'm investing more effort into it, not less.
An AI may be able to teach you basic grammar but it’s not going to teach you to develop your voice. By design and content training set, an AI today can only pressure you towards the mean of whatever criteria you specify, not away from it. Developing your voice by doing your own proofreading pressures you away from the mean, by helping you double down on what you value most and by choosing which grammatical rules to disregard and when disregarding them is more in-tone for yourself than adherence. I can’t stop you and I won’t remember your handle after an hour has passed (being nameblind is interesting online), so you’ll probably go unnoticed by me, sure. But I still won’t equate regressing to the AI mean with personal growth away from the average masses.
> An AI may be able to teach you basic grammar but it’s not going to teach you to develop your voice.
Well, no one can help you to develop your voice. If it is your voice, then it have to be your own creation. I think we are at agreement here.
> Developing your voice by doing your own proofreading pressures you away from the mean, by helping you double down on what you value most and by choosing which grammatical rules to disregard and when disregarding them is more in-tone for yourself than adherence.
Oh... If I wanted to become a professional writer, then I'd agree with you. Maybe...
You see, I don't use LLM to fix my writing in Russian, because with Russian I'm totally in control of my grammar, I know when I deviate from it and if I do, I do it consciously. But with English I don't know. Sometimes I can see that I don't know how to follow English grammar in some particular case, and sometimes I don't even notice that I don't know.
So, returning to your argument, if I wanted to become a famous English writer, I think I'd choose to write a lot and discuss my writing with LLM, and I'd do it for hundreds of hours. LLM are unbelievably useful for digging into language nuances. Before LLMs I had urbandictionary, but it could help with specific phrases, not with choosing between "I took the effort to ask an LLM" and "I took the effort of asking an LLM". I wouldn't have a clue that there is any semantic difference. But LLM can point to it, and it can explain the difference, and give me more examples of it. Or it can point that "you recommend to choose" is not good, because of "something-something" I don't remember what, but it boils down to "you just have to remember, that the right way to use the verb 'recommend' is 'recommend choosing'". I don't see the difference, I can't choose to disregard it, because I have no opinion on if it is good or bad.
If I wanted to become an English writer, I'd spend hundreds of hours with LLM, just to get an ability to see as many differences as it is possible, to get an idea of what I value most, and which grammatical rules I like to disregard. But even after that, I think I'd continue to use LLM. It can provide unexpected takes on what you feed into it. ... Hmm... I should try it with Russian. In Russian I can pick a style for my writing and to follow it (in English I can't control the style consciously), I can (and do sometimes) turn grammar inside-out, make it alien, readable for a native speaker, but in weird ways readable (a bit like letters written by Terry Pratchett heroes like Granny Weatherwax or Carrot)... I wonder, if I can employ LLM to make it even more weird.
> I still won’t equate regressing to the AI mean with personal growth away from the average masses.
I can't obviously judge in which direction LLMs are changing my English, so I can't even give you an anecdotal counter-evidence to your statements about regression to AI mean, but I'm still sure that I'm not regressing to the mean. You see, I pick when to follow LLM advice and when not to. I'm choosing what to change. The regression to the mean you are talking to is going on in a high dimensional space, you can regress on some dimensions and continue to deviate from the mean on others as much as you like. I don't like to deviate on grammar dimensions (at least without knowing about my deviations), I was born in a family of a teacher and an engineer, which were all into to be educated and the familiarity with the grammar was one of the important part of it, and I was born in USSR, where the proper grammar was enforced in all media to the extent that make me laugh and rebel against grammar (after all the decades passed, lol). But I can't allow myself to just ignore grammar, I was taught in a way to use it properly. So I decide to use LLM. I'm too lazy to do it each time, or even every second time, but still I use it and learn from it.
The prospect to regress to the mean by using LLM seems very unlikely to me. I don't regress with all the propaganda around me when regressing is the most safe thing to do really, so mere LLM stand no chance to achieve it.
When do you need to spellcheck or polish an HN comment?
I've never, ever, ever ever ever, seen anybody complain about spelling mistakes in a comment here. As long as you can understand the comment, people respond to it.
Extend spellcheck to asking questions like "does it meet HN rules" "how can I improve my writing" etc. Though these are the kinds of questions that do at very least still meet the spirit of the rule, I suppose.
> Do you really need an automated tool to tell you whether you're breaking common sense guidelines?
Lots of people break HN guidelines. I see it virtually every day.
> And why would you want to "improve your writing" for an HN comment?
Some people like to write well regardless of the medium. Why is that a problem for you?
> I think people here value raw authenticity more than polished writing.
Classic false dichotomy. Asking an LLM for feedback is not making your comment less authentic. As I pointed out elsewhere, it can make your comment more authentic by ensuring that what you had in your head and what you wrote match.
Go and study writing and psychology. For anything of value, it's rare that your first attempt reflects what you meant to say. It's also rare that the first attempt, even if it reflects what you meant, will not be absorbed by the recipient. Saying what you mean, and having it understood as you meant it, is a difficult skill.
> Lots of people break HN guidelines. I see it virtually every day.
Yes, and AI won't help here. People will use AI to better break the guidelines.
> Go and study writing and psychology
Is this a case where you should have read the guidelines? Maybe an LLM could have helped you here? Please don't send me study anything, you know what they say of ASSuming.
> Some people like to write well regardless of the medium. Why is that a problem for you?
HN is more like talking than writing. And LLMs don't help you write well, they help you sound like a clone, which is unwanted.
> For anything of value, it's rare that your first attempt reflects what you meant to say.
You can always edit your comment. And in any case, HN is like a live conversation. Imagine if your friend AI-edited their speech in real-time as they talked to you.
Depends on how you use the AI. if you use it a bit like you'd ask a human to proof-read your work, AI can actually be quite helpful.
The other important thing you can do is have an AI check your claims before you post. Even with google and pubmed, a quick check against sources by hand can take 30 minutes or longer, while with AI tooling it takes 5. Guess which one is more likely to actually lead to people checking their facts before they post. (even if imperfectly!) .
I'm not talking about people who lazily ask the AI to write their post for them. Or those who don't actually go through and actually get the AI to find primary sources. Those people are not being as helpful. Though try consider educating them on more responsible tool use as well?
> Yes, and AI won't help here. People will use AI to better break the guidelines.
AI is a general purpose tool. People will use AI for multiple reasons, including yours. I'll wager, though, that your use case is much more challenging to do than mine, and that my use case will dominate in number.
> HN is more like talking than writing.
Says you. Many disagree.
> And LLMs don't help you write well, they help you sound like a clone, which is unwanted.
Patently false on both counts. Sorry, you're cherry picking and not addressing the part of my comment that discusses this.
> Imagine if your friend AI-edited their speech in real-time as they talked to you.
When a conversation is heated (as it occasionally is on HN), I actually would rather he AI-edit in real time - provided that the output reflects what he intended.
> Do you really need an automated tool to tell you whether you're breaking common sense guidelines?
I say this on behalf of all of my neurospicy friends… sometimes, yes. Especially having taken a look at the whole list of guidelines, I definitely am friends with people who would could struggle to determine whether a given comment fits or not.
People who are particular about spelling do not want to write misspelled words! It's not about whether you/others will tolerate it. I have my standards, and I hold to them.
I personally don't use an LLM to spellcheck (browser spellcheck works fine), but I see no problem with someone using an LLM to point out spelling errors.
And while I don't complain about others' spelling errors, I sure do notice them. And if someone writes a long wall of text as one giant paragraph that has lots of spelling/grammatical issues, chances are very high I won't read it.
Some people write very poorly by almost any standard. If an LLM helps the person write better, I'm all for it. There's a world of a difference between copy/pasting from the LLM and asking it for feedback.
> Spellcheckers exist, you don't need an AI to change your voice.
How is using an AI to spell check changing my voice?
Yes, thank you - I know spellcheckers exist, as my comment clearly states. The amusing thing is that an LLM who had access to the thread would have alerted you to a basic error you're making.
> Also, if you have standards, you can always train yourself to spell better!
"You can always ..." is not an argument against alternatives.
I think that people subconsciously perceive grammatically correct and stylistically appropriate writing as more authoritative. And author is perceived as smarter and/or better educated person.
At least that was the case before LLMs became a thing, now I'm not sure anymore.
Obvious spelling mistakes are usually ignored, but there are certain types of writing mistakes that really trigger the type of people that frequent HN.
For example, use "literally" for exaggeration rather than in the original meaning of the word and you'll likely trigger somebody.
Would anyone notice if you spell-checked or got narrow feedback about grammar? No. I'm not dang, but perhaps a very reasonable interpretation of the rules is: If the AI is generating the words, don't. If it tells you something about your words and you choose to revise them without just copying words the AI output, it's still your words.
(As an experiment, I took that paragraph and threw it into gemini to ask for spell and grammar checking. It yelled at me completely incorrectly about saying "I'm not dang". Of its 4 suggestions, only 1 was correct, and the other 3 would have either broken what I was trying to say or reduced the presence of my usual HN comment voice. So while I said the above, perhaps I'm wrong and even listening to the damn box about grammar is a bad idea.)
That said, I often post from my phone and have somewhat frequent little glitches either from voice recognition or large clumsy thumbs, and nobody has ever seemed to care except me when I notice them a few minutes after the edit button goes away.
I guess this kind of rule feels less pragmatic and more philosophical. For one thing, it’s nearly impossible to enforce in practice, and drawing a clear line between simple grammatical correction and AI-assisted editing is a pretty hard problem.
I'm sure someone's working on a way to tell the difference programmatically. Maybe a combo of tone, grammar, and some way of telling how fast it was typed using metadata (which may not exist). Even if there was a "probable AI" filter, that would be helpful because it would be a starting point to improve upon.
The "no AI" rule finally being official feels like a necessary line in the sand.
The real issue isn't just "slop" or bot-spam; it's the cost of entry. HN works because of the "proof of work" behind a good comment. If I’m spending five minutes reading your take on a kernel patch or a startup pivot, I’m doing it because I assume a human actually sat down and thought about it.
When the cost of generating a response drops to zero, the value of the conversation follows it down. If the author didn't care enough to write it, why should I care enough to read it?
The "AI-edited" part of the rule is the trickiest bit, though. We’re reaching a point where the line between a sophisticated spell-checker and a generative "tone polisher" is non-existent. My worry isn't that the mods will ban bots—they've been doing that for years—it's that we'll start seeing "witch hunts" against anyone who writes a bit too formally or whose English is a little too perfect.
Ultimately, I’m glad it’s a rule. I don't come here to see what an LLM thinks; I can get that on my own localhost. I come here for the "graybeards" and the niche experts. If we lose the human friction, we lose the signal.
Kind of a drag isn't it? I want to learn a new language.... but why would I, since we'll have an earpiece or glasses or whathaveyou that translates in realtime. I want to learn to play an instrument, but why would I, since we have sonos? I would like to go back to drawing, but why, when the importance people ascribe to art is at an all time low? Makes me depressed jsut to think about it.
Great to clarify the guidelines. Many HN discussions have been dissolving into debates about whether posts are AI or not.
But the argument of "If I wanted to read what an LLM thinks, I could just ask it" assumes that prompts are basically equivalent, which is not the case.
There's a risk of reducing everything to Human -> authentic and AI -> fake. Some people's authentic writing sounds closer to LLMs, and detectors are unreliable.
The problem is not so much AI generated content that has an interesting point of view generated from unique prompts, but terrible content produced for metrics to harvest attention, which predates AI.
As English is my second language and as I have dyslexia. I was just wondering what do you mean by "AI-edited comments"?
I can't ask an llm to check if I have made correct grammar/fix it and then as I was on other account, down-voted because of my styling/grammar, not because of the content?
Trust your own style, even if you aren't a native English speaker. Here's an example where a non-native speaker used an LLM to polish his post. The general consensus was that his own writing was preferable to the LLM's edited version.
For dyslexia, use a spell-checker. For grammar, use a basic grammar checker, like the kind of grammar checker that has come with MS word since the 1990s. But don't let a style-checker or an LLM rob you of your own voice.
This is what I do when using AI to read anything I write. Some prompt like "I am going to share with you something I have written and I don't want you to change my voice at all. Can you look for structural issues, grammar or punctuation errors, and things like that". Claude is an amazing editor and I never feel like my writing has been taken from me doing this.
I usually tell it not to rewrite my words, my words are my own. If it has suggestions to tell me what those are, but only fix or show me grammar fixes instead.
I don't believe that's always true, and I suspect it was left out of the guidelines deliberately, and I wish people receiving suggestions would stop interpreting it that way. Of course people suggesting grammar corrections and treating it like they just demolished and eviscerated your argument are part of the problem. But what about people out here just trying to help? Grammar is important, as it's the syntax of the programming language we all use with each other. People act as if bad grammar is something you're born with, and can't change. Like learning grammar is impossible, and those who don't bother should be a protected class. I'm just trying to help man. Or I was anyway, before I stopped. But if I'm trying to engage with someone's main point, it should be obvious. Whereas a quick grammar correction is just that. But it's a tangent, and not interesting (especially if you already know), and supposedly grammar is "not a technical topic" (despite daily use) so it ends up deemed a "low value comment" and gets downvoted to oblivion.
But I can see why the HN guideline is formulated that way. My students often use the excuse "I did not use AI for writing! I wrote it myself! I only used AI to translate it!"
Simply disallowing all kinds of AI usage is much easier than discussing for the thousandth time whether the student actually understands what they have written.
Isn't the whole point to understand? If the task is to write and you expect only final result, but you question if it looks legit enough, how is it fair judgement? People can deliver partial results and show progress as well, but you won't see it in some comments on the internet, but if something is expected to take many days, it's easy to show different stages of work. It's easy to accuse people of plagiarism or not thinking for themselves, and of course there are indicators when someone uses AI, but the problem is that you can't distinguish in reliable way, if something was created by AI or not.
Like, there is this computer game, authors used some models or something like that, generated by AI, but it was only used during prototyping and later it was replaced by proper models. No one would know about that, if authors would not tell about it. So, if someone writes in their own words what AI generated for him, is it still argument made by human or by AI? What if someone uses AI only as placeholder and replaces all that content, so you never actually see actual AI usage, but it was used in the process?
For me, premise that using AI in any form invalidates your work, starts with logical fallacy, so such arguments against using AI are weak. It's like saying that your work is wrong, because you used calculator, so your calculations can't be right, if done by machine, because it had to make mistake or that's wrong for ethical reasons or whatever.
Work generated by AI can be easily poor, because these models make mistakes and like to repeat in certain ways, but is it wrong that I'm writing comment with keyboard, instead of writing letters with pen? Is it wrong, when I use IDE or some CLI to write code with AI, instead of using vim and typing everything on my own? Is it wrong that someone uses spell-checking?
In the end it doesn't matter who seems smarter, when you're expected to use AI at work. Reality shows you actual expectations.
I am not saying that completely disallowing AI is the right decision. But if you see text that is clearly generated by AI and does not make any sense, it sure would be nice if you could just tell the students to actually read their sources instead of having to argue with them why they should do so. Similarly, I can see why HN moderators do not want to argue with the 100s of spam posters per day on /newest.
Anyway, my university did not ban AI, and now most students have degraded to proxies between teaching assistants and ChatGPT.
On the other hand you can make good, but controversial argument and if you use AI in any way, it might be rejected by moderator, just because some places have strict rules on AI. In some cases it might be rejected, even if no AI was involved, if any fragment of your text might look like not written by human and if they don't like your text.
At certain point it's no longer about AI specifically, but about power and showing who makes decisions.
I agree that there might be some threshold for obvious spam, but if you're making argument in good faith and you don't claim to have authority on some matter, there will be always people that think differently or disagree with you, because they have different interpretation or they need better sources, more evidence. It's actually typical, because different people use different perspectives, different assumptions, different tools. I don't believe that rules should be used to silence people that have different opinions and that's the biggest risk I see, because penalty for not following such rules, which are hard to measure correctly, creates power imbalance.
At some points it becomes dogma, not fair debate and not everyone likes to stick to dogma and it's hard to do creative or innovative work, if your work has to meet strict, but subjective, possibly incomplete criteria, to be considered valid work at all.
Dyslexia was my first thought as well. The intent is great, but I don't know if this is keeping with the social model of disability. Disability is created when you remove access and this is exactly that.
The internet has been full of brilliant dyslexics since the start, just as it has been full of brilliant blind people. Dyslexic people feeling that they must use AI to produce perfect prose lest they burden the lexics with clumsy spelling or grammar is far more hostile. We didn’t have slop machines 5 years ago.
I don't really see the issue, as long as there's human thought behind whatever anyone posts. It's frustrating to argue against someone that lazily uses AI, but if argument is fair, then I don't care if that's written by AI or human, what difference does it make? It's frustrating, if someone is incoherent and makes dumb argument, but again, I don't care if it's dumb argument from human or machine.
For me it sounds just as yet another form of gatekeeping, so either you sound human or you're not good enough to post/comment. Like, really? How isn't that genetic fallacy? It doesn't matter what someone thinks, because someone used AI to make their thought clearer, so their whole argument is trash? Like it has to hurt to read and write, if you're not using English perfectly and your work is seen as inferior based on superficial factors like proper grammar and style?
It's dumb crusade, I did not use AI to write this comment, but I hate when people try to monopolize the truth and tell who is "better, smarter" based on irrelevant facts. Not using AI doesn't make anyone superior. Using AI also doesn't make you superior. Focus on what you mean, because that's what matters.
I don't have dyslexia but I feel your pain. I mean it is what it is. I would rather have it raw then have to use AI to filter to comments that make sense.
Oof although I feel this pain a lot. What I like to do is respond to them politely if someone talks about such thing. Although it takes time and this does sometimes make you want to dis-incentivize/dis-engange.
But at some point, the rationale behind it is that your comments are your words and I find it liberating. Some people won't appreciate it and some people would but this goes the same for AI-edited posts too.
(I would recommend to add that if you are still worried, then within your hackernews profile, please talk about you having dyslexia as people might be so much more forgiving when they get more context. We are all humans after all and I would like to think that we understand each other's struggles)
I don't see how you can know why you were downvoted. Even if one person says something, they won't all. Your comment right here has some rough patches, but I can tell what you're saying. Humans are terrific at extracting signal from noise. I would say be who you are, tough as it may be, and it'll encourage the rest of the world in the future to do the same. We're all unique in some way or another and have flaws and we'd be better off if we were knew others had them too because they weren't constantly trying to hide it and we wouldn't feel so bad thinking we're the only ones. I hope it doesn't sound unsympathetic. I understand where you're coming from intellectually, but don't have any real experience being ridiculed or bullied. I know kids can be brutal and probably scarred you, and unfortunately, adults aren't much better, but we should be, and I think at least Hacker News is better than most places full of human adults. We know there's a huge world out there. I think I'm reasonable well-spoken in English but can't speak a lick of any other language at all. The fact that you can produce intelligible English already puts you above me in my book. You're a person. I can respect you, esteem you, potentially love you, not in spite of your flaws, but because they don't matter. Every single person on the planet has them, and if they're not moral flaws, nobody should give a shit. I can't respect or love a machine any more than I can a rock. And I don't want to talk to one, either.
One way to improve things could be to charge for each new account signup if you don’t have an invite from an existing member that vouches for you. Spamming when you risk losing $5-20 per account raises the cost substantially.
Invites could be earned at karma and time thresholds, and mods could ideally ban not just one bad actor but every account in the invite chain if there’s bad behavior.
How about translating tools? As a non native speaker, especially for longer text, its far easier to express your thoughts and not struggle for the right words. Should I may be highlight if I used e.g. google translate?
One thing that will be incredibly useful is to limit comments from brand new accounts. A combination of vouching, limiting the posts velocity (5 daily limit), clear rules for new accounts, etc.
I understand we often see insightful comments from new accounts, but I always find it suspicious when non-throwaway accounts are created just in time only to make a quip.
This was discussed before. People will age accounts and buy/hack inactive ones. Meanwhile, often a link gets posted, the project owner (or someone affiliated) finds out, and they make a new account to comment; it would be a shame to lose these people.
True that AI comments do degrade discussion. Though a forum enforcing human-only text also becomes an unusually clean training corpus. Both things can be true.
to get paid for*.AI has definitely reduced the influence pseudo-intellectuals have had on society. Now, you actually have to be smart enough to do something that isn't easily reproduced using LLMs.
I occasionally used AI to edit and restructure my comments. I’m very open about it, and I don’t feel like I’m talking to non-humans when others do the same.
To be clear, I'm neither proud nor embarrassed by this. I'm just trying to communicate in the most efficient way I can.
If you're not proud or embarrassed by it then I don't understand why it is an issue? If you miscommunicate something or don't get your point across, just try again, or apologize, and chalk it up to a learning experience.
If you think your writing could use improvement, then write your comment and let it sit for a few minutes before re-reading it and the comment you are replying to, make your edits and then post it. It will give your brain time to reset and maybe spot something you didn't earlier.
Im fine with this, in 99.999% of cases anyway I'm way too lazy to type something into an LLM and ask it to clean it up and then copy and paste. You can tell this is true by the some of the stupider things I type in here sometimes.
Could we also discourage comments and comment-threads accusing an article of being AI-written? Half the threads these days have a comment that latches onto some LLM-ism in TFA, calls it out, and spawns a whole discussion which gets repetitive fast. I think this falls into the same category as "don't comment about the voting on comments."
Personally, I try to look beyond the language, which admittedly can be grating, for some interesting ideas or insights. Given that people are already starting to sound like ChatGPT, probably through sheer osmosis, we will have no choice but to look past that anyway.
Yes, it's annoying to read LLM-isms. It's also fine to downvote or ignore or grumble internally, and move on.
That is indeed a problem. If one must complain about it, I think it would help to at least try elevate these type of tangential remarks beyond hurled accusations. A focus on the the specifics (where arguments are poorly made, banal observations are gussied up with flowery language, points are needlessly reiterated, etc) would at least make for slightly more interesting meta commentary.
I enjoy conversations on hn because they feel genuine. People are not here to optimize their posts or comments for engagement or pushing some kind of follower count like they do on social media platforms.
A practical question: what should readers do when they suspect a comment (or story) is AI-generated? Is that an appropriate reason for flagging? Email the mods? Do nothing?
I've been pretty wary about flagging AI slop that wasn't breaking other guidelines, and by default this will probably make me do it more. But it is a lot harder to be certain about something being AI-written than it is to judge other types of rules violations.
(But am definitely flagging every single "this was written by AI" joke comment posted on this story. What the hell is wrong with you people?)
Even if people try to bypass it, having the official rule matters a lot.
@dang, if you read this, why don't we implement honeypots to catch bots? Like having an empty or invisible field while posting/commenting that a human would never fill in
It's likely going to be a game of whack-a-mole, especially with AI as opposed to simple bots/scripts. Not that they shouldn't try to prevent it, but not entirely sure what the solution is.
Great point! You are so right to call me out on that! Here's the no-nonsense, concise breakdown, it's coming soon I promise, right after this, here it comes, no fluff -- just facts!
I frequently use AI to make my comments more concise and easy to follow. I find myself meandering a lot when I type, and now that I've transitioned to full voice dictation through FUTO keyboard I am speaking more off the cuff and having an LLM clean it up.
You may also notice that I don't have much common history here. I mostly comment on Reddit.
Here's where I draw the line. If you are not reading the text that is produced by the LLM, then I don't want to read whatever it is that you wrote. I will usually only do one or two iterations of my comment, but afterwards I will usually edit it by hand.
Technically, there is light AI editing of this comment because FUTO keyboard has the ability to enable a transformer model that will capitalize, punctuate, and just generally remove filler words and make it so that it's not a hyper-literal transcription.
To err is human. Let's embrace our humanity in the face of this proliferation of insipid perfection.
I want the raw tokens straight out of your head. Even if they are lower quality, they contain something that LLMs can never generate: authenticity. When we surrender our thoughts to a machine to be sanitized before publication, we lose a little of what it means to be human, and so does everyone who reads what we write.
Part of the joy of reading is to wallow in a writer's idiosyncrasies. If everybody ends up writing the same way, AI companies will have succeeded in laundering all the joy from this world.
Humans already revise and refine their thinking. Tools just compress that process and help filter signal from noise. The meaning still originates with the person.
This is assuming that an extreme majority of people use the tools this way.
Consider a much more cynical view where people are strictly self-interested and use these tools to garner engagement and self-promotion. Good chance the meaning did not originate from the person. And now these people have tools to outsource their parasitic intentions.
Intent is hard to infer, so it seems better to assume good faith and judge the comment itself. Thinking aids might just lower the barrier for people to participate in technical discussions.
Will using a voice-to-text app to create my comment get me banned? Especially if it creates a transcription mistake that might be characteristic of an LLM
I wouldn't expect voice-to-text apps to produce anything that looks "Signature LLM" since it's still your words, your grammar, etc.. The occasional transcription mistake is unlikely to be an issue either, given the prevalence of humans here who use em-dashes, speak ESL, etc..
But where is the line? Is a spell checker okay? How about one that also suggests alternative wording?
I think, in the end, it is less about the tool you use and more about the purpose you use it for. It is more like when you use certain tools, you should be cautious about whether you are using them for the right purpose.
That said, I also wouldn't hate seeing an official playground where it is cordoned / appreciated for bots to operate. I.E., like Moltbook, but for HN...? I realize this could be done by a third party, but I wouldn't hate seeing Ycombinator take a stab at it.
Maybe that's too experimental, and that would be better left to third parties to implement (I'm guessing there's already half a dozen vibe-coded implementations of this out there right now) -- it feels more like the sort of thing that could be an interesting (useful?) experiment, rather than something we want to commit to existing in-perpetuity.
You could mirror article postings and upvotes to another site and let AI play around there - if it's interesting to people maybe it will gain a following. I don't see any reason it'd need to happen in this specific forum as that'd likely just cause confusion.
At the time being, at least, HN is a single uncategorized (mostly, lets ignore search) message board - splitting it into two would cause confusion and drastically degrade the UX.
Real talk: who is this guideline going to stop? People are already doing this and they will continue. Even if you find them, they’ll just make more accounts and continue.
Say what? It’s a genuine question. What is the actual repercussion for not following this?
It came up a few weeks ago. Show HN is already disabled for new accounts as of this week I think(?), but IMHO stricter measures need to be placed for account creation otherwise there’s no real enforcement.
YC funds a gazillion AI startups that expand and augment the AI slop pipeline, but would hate to experience the consequences. It's very much slop for thee but not for me
Sometimes life is also to let it express partial, unfinished ideas, opinions and maybe later let our brain refine them on its own tempo. It never has been uncommon.
If you discuss an idea with an AI and then close the AI window, turn to an editor, and write what the AI said from memory, that’s going to come across as AI-assisted writing and be unwelcome here.
If you discuss an idea with AI, then close the window and write a post about how you came up with the idea, got stuck, decided to ping an AI for unstuck-ness, describe how the AI’s response got you unstuck, and then continue writing about your idea, that’s not going to be necessarily treated as AI-assisted writing — but people are going to be extremely suspicious of you, because the perception is that 99.9% of people who use chatbots go on to submit AI-assisted writing. That’s probably more like 90% in reality but it’s something to be aware of as you talk about your experiences.
If you use AI in your process and don’t disclose it when writing about your idea and process, that’s generally viewed as lying-by-omission and if egregious enough you could end up downvoted, flagged, and/or banned (see also the recent video game awards / AI usage affair). Better to disclose it with due care than to hide it.
Imo AI tends to “fill in the blanks” of what you want to hear. It’s insidious in that regard because it will make a whole seemingly logical and consistent argument purely on what it thinks you want.
Except it’s bullshitting the whole time. While you think this is what you wanted to convey.
Not sure where I’m going with this, but my point is if I pasted this comment into ChatGPT it would make up an argument I never made to support my case that didn’t exist in the first place. Exploring things is useful but just be aware it’s designed to pull bs out of it’s ass and is distinctly not interested in exploring truth or having a real conversation
I've been noticing a _lot_ more AI-generated/edited content of late, both comments and stories. It's gotten to the point that I spend a lot less time on HN than I used to, and if it continues to get worse I expect I'll quit altogether.
At the end of the day, I'm here because of all the thoughtful commenters and people sharing interesting stories.
I don't get it. We use tools to assist in written communication all the time. If someone wants to ask an LLM to check their grammar or edit for clarity or change the tone, it's still a conversation between humans. Everyone now has access to a real time editor or scribe who can craft their message the way they want it to sound before sending it off. Great.
My personal interpretation of the rule is that if it's human-originated but passed through a layer of cleanup, it's human-originated. For the same reason I'm not refraining from running the spellchecker or using speech-to-text to generate this sentence. "If I could be having my English-speaking nephew type this on my behalf while I told him my thoughts in Japanese, it passes the smell test for human-sourced" feels about the right place to set the bar.
Yes but the guideline states that AI-edited comments should not be posted. It doesn't say it's okay as long as it's "human sourced" or "human-originated".
So if your layer of cleanup is AI assisted, then it's in violation.
Part of the problem I was getting at is that the requirement of "Don't post AI edited ..." is stricter than necessary to ensure the outcome that "HN is for conversation between humans" because an AI edited post is still a human post.
Anyway, I suspect a lot of people are going to ignore that guideline and will feel free to use their "layer of cleanup" whether it's a basic spellchecker or an LLM, or whatever else they choose, and most people aren't going to be able to tell anyway. The guideline is unnecessarily strict in my opinion, but it doesn't matter in the end.
My layer of cleanup is AI assisted. It's the spellchecker integrated into my web browser. That was definitely "AI" technology when it originally came out.
But I think you and I are on the same page: we both know this isn't a rule that's there to be hard-and-fast enforced because that's completely infeasible. The definition of "AI" is a moving target, as is "generated."
It's a rule that's there to have a rule so when the real problem is "Hey, your content is too low-quality but you dump volumes of it and it's clearly following a procedural template" the mods can call that "AI" and justify limiting or banning the account on prior-stated rules. Which is fine, but I'm glad to call it what it is.
(One unfortunate oversight: we haven't added "posts sounding like they are AI-generaed" to the "Please don't complain about" set. So expect that to become a common refrain now since the incentives to make the complaint against disliked comments are obvious... At least until that becomes annoying enough to justify a rule).
I'm more interested in the last layer than the first. People should feel fully accountable for what they post, like they could have done it exactly and completely by themselves if they'd simply taken more time.
I would enjoy a "block user" feature, to help this. I personally want to live in an online bubble of interesting thoughts. This seems close (or better, since people I enjoy can contradict my own flags) [1].
Haha, I feel the same way. I want to block and be blocked so I made this: https://overmod.org/
It's pretty easy to rewrite if you want. Just point Claude Code at the repo and go. But I think there's a little bit of network effects in that I want to subscribe to some trusted people's blocks too. But overall it's quite helpful. See how much fewer I get:
I'm torn on this. On one hand I do agree with your goal about wanting to live in a bubble of interesting thoughts. But on the other... I know I have my biases, and I'm sure I might end up blocking people who actually are insightful and interesting but either a) had an off day and shitposted, or b) says insightful things in ways that make me angry and get past my sense of reasonableness.
Good news, it doesn't block! It just puts a red mark next to their name, so you can put less effort into that comment, if you choose.
And, it's social. If someone you've marked green is also using this, and they marks someone green that you have marked red, then you'll see a contested red-green next to them, which is a good "you should probably reconsider" indicator.
On the other hand, shouldn’t there be a policy forbidding use of HN data for LLM training? I would certainly be more encouraged to participate, if I knew that the content I provide for free is not used to train LLM that is later sold by a company valued hundreds of billions. Perhaps there are others who feel the same.
I hate how easy AI has made outsourcing thinking. You can literally type fragments of a thought into $CHAT_ASSISTANT and get a super polished response back that gets you 99% of the way there. It's almost like we, collectively, looked at the final scene of WALL-E and decided "Yes! Gimme that!"
Is this true for you? How often do you get 99% of a complete, valuable thought?
My experience is that it is quite rare. Occasionally high 90's for simple things of low value, 60's or less for things that approximate "thinking". At best it feels like a new search channel that amalgamates data better, and hasn't been thoroughly polluted by ads and SEO - yet.
"just develop a personality" sounds like a shallow dismissal. Most comments in most threads could theoretically be autogenerated when given style samples of what fits on HN and what opinion to use
A personality hardly shows through in a handful of sentences, besides which, I'd rather judge comments by merit than by the personality of the poster (hacker ethics, point number 4: https://en.wikipedia.org/wiki/Hacker_ethic#The_hacker_ethics)
1) That the entering of LLMs onto the scene of communication implies that real human beings need to change their style as a result.
2) That nobody can make an LLM talk like Cleetus McFarland.
To me, "I know that text is AI-generated" accusation smacks of the "We can always tell" discourse in the transphobia space. It's untrue, distasteful, and rude.
What’s interesting to me is the number of commenters here making a case of the form “use your own words; grammar and spelling are not that important; we’ll know what you mean”, and yet it’s often the case that different discussions will often contain pedants going off-topic correcting someone else’s use of language.
Re-reading the HN guidelines, each seems individually reasonable, yet collectively I’m worried that they create an environment where we can take issue with almost anyone’s comments (as per Cardinal Richelieu’s famous quote: “Give me six lines written by the most honorable person alive, and I shall find enough in them to condemn them to the gallows.”)
Really, all the rules can be compressed into one dictum: don’t be an arsehole. And yet the free speech absolutists will rail against the infringement upon their right to be an arsehole. So where does that leave us? Too many rules leads to suppression of even reasonable speech, while too few leads to a “flight” of reasonable speech. End result: enshitification.
I think it's hilarious that whenever someone complains about it they're a luddite, and now this happens on a website that is filled with LLM enthusiasts who have done nothing but overpromise.
This isn't just a good idea -- it's a forward-thinking policy to ensure Hacker News remains a collaborative place to have meaningful discussions for years to come.
Great message...but gosh, can someone throw 15px of padding on that <td>? I know HN is supposed to be minimal, but I had to check the URL to confirm that this was a real page because of the odd design.
> Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting.
Now define useful, specifically in the context of a comment on hackernews.
An LLM summarizing the contents of a blog post might be useful to you, but is a comment here the right place for something you could geneate on your own?
I would guess for most people here, real insight or opinions from others is the "useful" aspect of reading hackernews comments.
Using LLMs to generate or refine comments only moves things further away from that goal (in my opinion).
To confess something I built just today a little cron that monitors HN for posts I might find interesting, pulls in some context about me, and proposes a reply. Just to help me find relevant posts and to kick start my thinking if I want to engage.
Today it flagged a post about an AI tool for HN and suggested I reply with:
"honestly, if you need an AI to sift through hn, you might be missing the point—this place is about the human touch. but hey, maybe it'll help some folks who just can't take the noise anymore."
So my AI, which I built specifically to sift through HN for me, is telling me to go flame someone else for doing that.
No deeper point here. I just thought it was really funny.
HN banning AI posts makes sense for keeping discussion human, but the line between assistance and automation isnt always clear. The goal should be protecting real conversation, not policing every tool a writer might use.
My expectations to dear fellow humans - more sophisticated personal insults (ex. give me your cute comments), a freudian slips, hidden messages and motives, first viewer experience with the next cool toy from the hype train, sharing all kind of insecurities, heavy f.. word if very dramatic first person experience happened, border line exposure to the insider info, sharing something your corporate HR gestapo wont appreciate but might help another guy on the line, "i knew the guy who actually did it" stories, motivational statement toward my non-native english, etc
AI comments are certainly bad for discourse on HN. But who's to be the judge of AI or human? Are you reading humanity's Jeff Dean or computerized Elon Musk? It's certainly a tricky situation to be in!
Sometimes I collect my comments here to run through my draft writing skill to see how it might shake out as part of a blog post. Doing the opposite would be weird. I earned that karma. It's mine to burn making bad posts.
Can we also add “Don’t complain about AI-generated content. It does not promote interesting discussion.”?
I see this all the time, and even if I find the topic interesting, I don’t want to see comments littered with discussion about how the content was AI generated.
To be clear, I'm not condoning AI-generated content. I’m completely fine if the community chooses to not upvote AI-generated content, or flagging it off the FP.
But many threads can turn into nothing but AI complaints, and it’s just not interesting.
From my experience, it usually happens when people are too brazen about it, with boring stuff like "Interesting! Now here's what Gemini said about the above..". IMHO that is an entirely adequate reaction.
I’m mostly referring to responding to the article itself (allegedly) being AI-written. Then the top half of the thread is derailed by a discussion about the article itself being AI-written.
What if there was a voluntary indication of LLM content? Like, you press a checkbox "yes, I'm going to post some content that is partially or fully created by AI", and there would be a visible mark "slop" next to a post/comment.
Welcome change, there is enough AI slop on the internet already.
I come here for thoughtful discussion, a break from the relentless growing proportion of ai slop emails I get from people clearly vibe working.
Not edits for tone or clarity, 400+ word emails full of LLM BS they clearly haven’t checked or even understood what they have sent. Annoyingly this vibe slop is currently seen as a good KPI.
You're all a bunch of tedious ignoramuses, your own fields of studying notwithstanding. I'm out here face-to-face with the Bullshit Asymmetry Principles. I'm not about to give up the only leverage I have!
The fact of the matter is that there're not hours enough in the day to read, in realtime, to each and every one of you the reams they've written on why you're wrong. Do I have to establish a tag-team?
The fact is that I've spent thousands upon thousands of hours painstakingly collating the perspectives that I'm now delivering to you—I am a river to my people. And it's only because they pass under the bridge of an LLM that they're objectionable?
This is a bit like challenging your plumber for charging you over a minute's fix, when they've spent 20 years getting it down to that minute.
The work's been done. You're paying for the outcome.
Edit: All fresh off the top of my head, folks.
Ah, that reminds me: I wouldn't feel compelled to do all this refutation if radical reactionary political extremism was properly moderated.
AI does not have LONG context, Long Term Memories or LONG intentionality -its not aware and it can't remember the plot without being spoonfed the details each time from scratch.
Its like an amnesic genius who once he already wrote a masterpiece and keeps cycling, and looses his train of thought after some fixed amount of time.
This groundhog day effect is mitigated in some respects by code -we create key-value memories and agents and stores and countless ways to connect agents via MCP and platforms/frameworks like A2A and the like but until we solve that longer lived instance problem we won't be able to trust these systems without serious HITL (human oversight)
I think we need models that update their own weights and we need some kind of awareness cycle rather than just a forward pass inference run with a bigger context window
These days, I've noticed that lobsters feels a lot more genuine to me, like hn was a few years ago. These days it feels like hn is bland and homogeneous, which I suspect is due to LLM-written comments.
In my experience every English-language online forum not rooted in some project or community external to the forum (e.g. an open source project's forum or a local club's forum) devolves into anger, cynicism, and American political partisanship. I suspect that the people who like discussing these feelings are more numerous than the spaces that want to discuss them and so any open forum fills up with their posts. Lobste.rs's unique rules and moderation culture results in a particular manifestation of symptoms but the disease is the same.
I picked up lobsters last month, and I started to appreciate it much more because of the lack of generated comments. It has a anti-LLM slant, and they have their own moderation challenge (everything is getting tagged as vibecoding - which makes the tag lose meaning). But the comments are noticeable not-slop.
Same as all the other guidelines. Moderators look at the threads and act on what we see. We also look at lists of flagged comments, and emails sent to hn@ycombinator.com by community members. One-off offending comments are flagged+killed, and a warning given. Repeat offenders/obvious bots are banned.
At some point might internet text will just be recognized as meaningless drivel both to bots and humans? a.k.a. dead internet theory... I am curious what organizations would benefit from this. i.e. Who lost legitimacy when the internet became a popular way for people to communicate ideas?
Sure, ban everyone that uses em dashes from the digital commons. That will certainly stop the existential threat to your livelihood.
Sarcasm aside—there is no reliable way to prove this. So it begs the question: you really care if something is AI generated? Or is this just an another excuse to silence people you don’t like?
You know, those people. The ones who didn’t win a full ride to <prestigious university> or pay a fortune for a sheet of paper. The ones who haven’t spent thousands of man hours handcrafting a <free-and-open-source-cloud-native-hypermedia-aware-RESTful-NoSQL-API> framework implemented in Rustfuck, a new language that you made in your free time that borrows from Rust and Brainfuck (but they wouldn’t know about it).
(this is to anyone reading, mostly rhetorical, not dang in particular)
Without someone actually saying as such, we only have stuff like emdashes and specific word patterns to go by. And someone even moderately vested in hiding AI in plain sight will coach the LLM to use common vernacular.
And with LLMs making blog posts as diss tracks... damn, who knows what this world is coming to.
But the whole "Only Humans, we dont serve YOUR KIND (clanker) here" is purely performative.
The obvious way to keep human spaces is via webs-of-trust.
If you play bluegrass or old time (or beopop or hip-hop / proto-hip-hop) or other traditional styles of music where the ensemble is a de facto web-of-trust, join us on pickipedia to build and strenghten it. https://pickipedia.xyz/
This is a bit sad. The kind of people who post AI generated comments to farm reputation or exert undue influence will not be discouraged by politely asking them to stop. It's a toothless request that will only encourage people who clumsily police each other.
Without some kind of private proof of personhood enforced at the app level, this means nothing.
This seems like an overcorrection. There is a vast difference between someone copy and pasting from an LLM and using one to correct their English or improve their writing ability.
Rules like this seem to me more like fomenting witch hunting of "AI comments" than it is about improving the dialogue. Just about any place I've seen take this hardline stance doesn't improve, it just becomes filled with more people who want to want to pat each other on the back about how bad AI is.
Just my two cents. I don't filter my comments through any AI, but I am empathetic for people who might have great use of them to connect them to the conversation.
The link doesn't work perfectly for me, it seems that since the page is already scrolled down all the way to the bottom, there is no way to focus specifically on the #generated element.
> Fair enough — I've been lurking since 2019 and picked a bad day to start commenting on everything at once. Not a bot, just overeager. I'll pace myself.
I won't name where and which one for the obvious reason that you can and should learn to know better, but I observed a comment that was obviously and blatantly copypasted from an agent, with all the signature "it's not just X, it's Y" patterns, the emdash abuse, the "In summary,' section, generating dozens of replies in organic engagement from people who genuinely couldn't tell the difference between a real comment and an aggregation of a prompted, synthetic response.
Whatever happened to "knowing is half the battle?" Why do we accept this kind of intellectual laziness as exemption from a duty to learn and know better?
This rule actually says "Don't admit when you are using AI to generate comments and don't admit when you are an AI"
I know it's cynical, but this is as meaningful as reddit's "upvote/downvote is not an agree/disagree or like/dislike button"
People may hate that this is true, but I cannot logically reason out how a rule like this could work. I think it's better to just accept that AI is now part of the circle, until we can figure out a "human check".
I wonder if the rule will be enforced. I see a lot of liberal / socialist / communist / anti Trump / Democratic Party politics in here even though the rule says that “Off-Topic: Most stories about politics”.
It's not at all obvious because there's more than one way to go about it. Obviously entirely outsourcing is bad. Whereas working cooperatively seems highly beneficial to me.
Google search has been getting progressively worse for technical topics for at least the past decade. Now suddenly they started providing a free tutor capable of custom tailoring graduate level explanations of technical topics for me on demand. The difference is night and day.
You can still learn from sources that have errors. Many textbooks have mistakes and false information in them, but that didn't stop them from providing educational value to people.
When textbooks are incorrect it is also with great confidence. If you can't spot logical inconsistencies in the material were you actually learning or merely memorizing?
Sure there's more than one way to go about it, but what matters is how people typically do go about it.
And certainly individuals can make their own decision to engage with an LLM in positive, self-thought-provoking ways, but it's still useful to understand how people generally do use them in the real world.
Sure, so we shouldn't assert that with confidence, but I think it's safe to guess that, for most people's use, that is probably the case.
Yes, some people (see some sibling commenters) do engage with an LLM in ways that might make them more thoughtful, but I have a hard time believing that's the common case.
Personally I stopped using LLMs much from around 6 months ago. I was using them regularly prior to that.
I noticed these dimensions of myself increased:
- Patience
- Focus
- Ability to hold concepts and reason for longer
and other related qualities improved.
My personal experience tells me they do degrade or hinder oneself from operating maximally. Some may be more sensitive than others - we aren't all the same.
But one thing for sure - younger generations will be more sensitive as they are already exposed to products that are designed to erode their self-control.
I think it really depends on the how. Engaging with it in a socratic debate-style argument [1] if no fellow human is available might very much support your thought process. On the other hand, just obtaining the solution to one‘s homework/problem/task/… won‘t be very beneficial for one’s development. The latter is sadly much more convenient and probably accounts for most of the usage. I remember a saying about the mind being a muscle: in order to keep it in good shape, you have to use it actively.
Agreed. In my case, I think I have found the opposite. At least, I find myself thinking hard about things more, now that I have started working hand in hand with AIs on different projects. Which is probably enhancing my cognitive ability, not degrading it.
This captures the problem, the sycophancy / preference optimization deludes people into thinking they’re on to something and posting things that don’t contribute to the discussion. It’s the “I drive better when I’m drunk” syndrome, it’s better just to outright ban it than to leave it to people’s judgement.
Another solution - in addition or instead - is requiring LLM output to be labeled.
The biggest danger of LLMs is impersonating humans. Obviously they have been carefully constructed to be socially appealing. Think of the motivation behind that:
It is almost completely unnecessary to LLM function and it's main application is to deceive and manipulate. Legal regulation of LLMs should ban impersonation of humans, including anthropomorphism (and so should HN's regulation). Call an LLM 'software' and label it's output as 'output'.
Imagine how many problems would be solved by that rule. Yes, it's not universally enforceable, but attach a big enough penalty and known people and corporations will not do it, and most people will decide it's not worth it.
Here is one elephant in the room: what is the process behind this guideline / policy? What happens after a comment gets deleted or a person gets banned?
As I understand it, HN moderators are thinking hard about this insane new world.* From my POV, there are a combination of worthy goals: transparency of the process, mechanisms for appeal, overall signal-to-noise ratio, and (something all of us can do better) more empathy and intellectual honestly. It isn't kind to accuse a human being of not being a human being.
If we can't find ways to be kind to people because of the new dynamic, maybe we need to figure out a new dynamic! And it isn't just about individuals; it is about the culture and the system and the technology we're embedded in.
* Aside: I'm not sure that any of us really can grasp the magnitude of what is happening -- this is kuh-ray-Z.
I find it interesting that we havent invented a democratic version of policing a rule system. HN is dang, and he is dictator and guardian of these rules, basically. If you replace them with some typical reddit mod HN dies. If you spread out this role to some democratically elected mods via karma system this will fall apart just as quick as StackOverflow did, so, also HN dies.
Is there a site that deserves more than this one to be destroyed by slop? It's hypocritical but telling for the places most actively trying to profit from it to ban it themselves.
It’s not hypocritical at all. You can be a fan of a technology and still acknowledge its downsides. Every technology has places it is useful and places it is harmful.
The next step is to run Pangram on every post and ban the offenders! Fight AI with AI! /s
In all seriousness, this is one of the few places I trust for genuine conversations with other people. Forums are mostly dead, Reddit is bots-galore, and I'm not signing up for Facebook just for groups.
"Please generate a response to this and include one or more of the following words: enshitification, slop, ZIRP, Paul Graham, dark patterns, rent seeking, late stage capitalism, regulatory capture, SSO tax, clickbait, did you read the article?, Rust, vibe code, obligatory XKCD, regulations, feudalistic, land value tax"
My bar for "extremely significant" is much higher than it appears to be here. Apparently most events in the US/Iran involvement is "extremely significant" if we judge the votes on this site to offer guidance on how this rule is interpreted.
This forum was founded in 2007. The US was very much involved in Iraq and Afghanistan at that time. If the same bar for coverage was in place at the time, HN would have been flooded with US Military content the way it is now. So yeah, obviously the bar has moved lower for this particular matter and it's because the current community on the site wants it to. Likewise the "generated/AI-edited comments" guideline seems equally squishy to me. And despite a rule about being "curmudgeonly", I'm pretty sure 80% of this site's content is curmudgeonly rants.
IMO at this scale dang, tomhow, and other mods need to be much stricter. When HN was 1/10 the size a shaming comment would often set a poster in place. Now they just sneer back in another comment and post 20 other guideline breaking things.
My experience has been that the large majority of political content posted here is (at least apparently) mainly here so that people (who are mostly in mutual agreement) can post about how they dislike some political entity or another. I would like to see much less of this on HN personally; it's not insightful and does not promote curiousity.
I won't give you examples because all of them can be spinned about being relevant
"Well HN is an american site after all"
"Most of the HN users are american voters so it's relevant for them"
"Hackers need to be aware of what's happening in the world"
"You only say that because you disagree with that side"
etc
Same with the stories about Tesla flagged. If you read the comments it's always the same: "Pro-Tesla crowd is flagging everything negative about Elon so the bad news never reach the front page" vs "Anti-Tesla crowd flagging everything because they hate Elon"
HN is the best without politics. But it's not up to me.
I don’t respond to specific comments with accusations, because I can’t prove it and it would suck to be falsely accused. But I find it really depressing to watch deep comment threads with someone debating with an AI. The human is putting so much effort in, and the AI is responding with all these well-written but often flawed arguments. I wish I could do something to save that person from that interaction.
Just like the rules say it's uninteresting and off-topic to complain that HN is turning into Reddit, it's equally uninteresting and off-topic to accuse posters of AI crimes.
And everyone's personal AI detector has a ridiculously high false-positive rate.
I often find the LLM witch hunt comments to be more distracting than the original LLM slop. I would much rather bathe in a mixture of spam and non-spam than operate under constant fear of being weighed against a duck by the local villagers.
We can now that it's an actual guideline. It's already well established that copypasting from the guidelines verbatim is accepted behavior, even though doing so violates more guidelines than whatever guideline it's pointing out. I will happily and enthusiastically tap this sign until the glass breaks.
You're absolutely right! Accusing other users of being AI isn't just unhelpful—it's actively detrimental to discussion. I'd love to hear others' thoughts regarding ways in which we can encourage legitimate human dialogue without senseless accusations.
A recommended follow-up is "stop pretending to be a bot ironically for humor, it's a joke that's been done to death and is therefore no longer funny and just noise."
> Please don't post insinuations about astroturfing, shilling, brigading, foreign agents, and the like. It degrades discussion and is usually mistaken. If you're worried about abuse, email hn@ycombinator.com and we'll look at the data.
"Don't post comments that are not human originated at this time. We want to see your human opinion shine through."
This gives people some amount of leeway and allows just rhe right amount of exceptions that prove the rule.
(That said, to be frank, some of the newer better behaved models are sometimes more polite and better HN denizens than the actual humans. This is something you're going to have to take into account! :-P )
Why would "human originated" be a better place to draw the line than "no generated/AI-edited comments"?
Like, I'm sure that AIs technically can write non-crap HN comments, but they rarely do. Even if it was less rare, the community that resulted from fostering AI-generated content would be unappealing to a lot of people, myself included. The fact that information here is the result of real people with real human opinions conversing is at least as important to me as the content being posted.
To begin with, some people have handicaps and use AI for assist. Other times people use AI for research. Finally, in general, when it comes to guidelines, making the lines slightly fuzzy makes enforcement more practical and believable.
It'd be silly if the rule gets interpreted such that people aren't allowed to do research with modern tools, and only gut takes are permitted.
I'm sure that's not the intent!
I think the important part is to have the human voice come through, rather than -say- force humans to run their text through an ai-detector first. (Itself an ai editing tool!)
Honestly, I think "human originated" is the only rule that actually matters because we can't stop LLMs from sounding smart anyway. If you wait for a technical ban on AI-generated text, you're just playing catch-up with tools that already pass as human.
The real point isn't stopping bad grammar, it's preserving the vibe. HN feels different because it's messy humans arguing, not optimized algorithms trying to be helpful.
Once we allow "good enough" AI content, the community stops feeling like a town square and starts feeling like a customer service chatbot. We need real people with actual stakes in their opinions, not just perfect outputs. Let's keep it human or leave it.
This comment may or may not have been generated with an LLM, but I won't tell and you can't prove it either way.
These are guidelines. I'm sure asking an AI about your comment (not pasting its text, so it's still your words) isn't an issue. The main target is obvious slop like https://news.ycombinator.com/threads?id=patchnull
I didn't say I hate it. But I do think that there's a lot of overlap between people who feel overwhelmed with A.I. Slop and people who felt overwhelmed with crypto-FOMO back when there was such a thing.
My analysis could lead to "it's doomed" or "it's a gateway drug that expands the crypto market".
LLMs were -in part- designed as translation tools. It's one thing they do really really well.
https://arxiv.org/html/1706.03762v7 (Attention is all you need) "Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train."
Ok, looking that up, that was quite literally one of the main design goals.
And they're really quite good at translating between the languages I use. They're the best tool for the job.
I strongly doubt it. My AIs can generate infinite HN comments for me. I don’t do that because it isn’t interesting. But if the day arises where it is, I want that personalized content. Not something someone else copy pasted.
(I say this as someone who finds Moltbook fascinating and push myself to use AI more in my work and day-to-day life. The fact that it’s borderline trivial to figure out which HN comments are AI generated speaks to the motivation behind this guideline.)
> AI is a great equalizer when it comes to communication in English.
Good argument for it but I think 80/20 split applies here. It is likely that 80% of the time it is used to farm for upvotes and add noise.
> And despite what people say, the way you write is very much judged as an indication of your education and intelligence.
I have come across plenty of content and online interactions in English where English was the Author's 2nd or even 3rd language and I find that putting a small disclaimer about this fact is more than enough to bypass such judgement.
Good point. There is a difference between using AI as a translator and using AI to write comments from scratch... Maybe the HN guide lines could reflect this.
Edit for amichail, since I'm rate-limited at the moment: I don't want flawless English writing. I want real ideas from real people. If I wanted flawless English writing, I'd be reading The New Yorker, not HN.
Perhaps not. But if it reduces the junk right now, it's a good policy for right now. I'll take it, for now. If it needs revisited, then it should be revisited when circumstances change enough to warrant that.
This is going to be a tough ask. I am with this 100% for "ai generated" but not "ai edited". What if I'm using AI for spellchecking or correcting bad grammar? what if it is an accessiblity-related use case? or translation?
It's just a tool ffs! there are many issues with LLM abuse, but this sort of over-compensation is exactly the sort of stuff that makes it hard to get abuse under control.
You're still talking with a human!, there is no actual "AI" you're not talking to an actual artificial intelligence. "don't message me unless you've written it with ink, on papyrus". There is a world of difference between grammarly and an autonomous agent creating comments on its own. Specifics, context, and nuance matter.
I have a kid with severe written language issues, and the utilisation of STT w/ a LLM-powered edit has unlocked a whole world that was previously inaccessible.
What is amazing is it would have remained so just a couple of years ago!
Agreed... there's often other perspectives people never thought of like this, which is why they say "strong opinions about issues do not emerge from deep understanding."
Even if you're just inexperienced in the language you're communicating in and are trying to have better conversations, it's very helpful.
For cases like that, I say just don't tell people... I think it's unlikely anyone will be able to tell either way.
> Otherwise please use the original title, unless it is misleading or linkbait; don't editorialize.
the title being the changelog is still probably the better choice because the discussion here and linked are about guidelines in the page rather than absolute rules or a discussion about the title alone.
Many of the other guidelines have exceptions too, and various strengths. E.g. "Throwaway accounts are ok for sensitive information..." is a pretty weak guideline in practice while "If the title contains a gratuitous number or number + adjective..." is often over-enforced by automatic tooling and stuff like "Please don't use uppercase for emphasis..." CAN sometimes just make sense where a use of italics might easily get missed WHILE OTHER TIMES BEING THE REASON THE GUIDELINE WAS ADDED.
If you want your comments to sound more human — stop using em dashes everywhere. LLMs love them — along with neat structure, “furthermore”-style transitions, and perfectly balanced paragraphs.
Humans write a bit messier — commas, short sentences, abrupt turns.
I think em-dashes were once a reliable indicator (though never proof), but recent models have been fine-tuned to use them much less. Lots of recent AI-generated writing I've seen doesn't have em-dashes. Meanwhile, I've heard many people say that they naturally use em-dashes, and were already and/or are afraid of being accused of AI; so ironically this rumor may be causing people to use their own voice less.
Before, I naturally used hyphens as if they were em-dashes. The kerfuffle over LLM use of em-dashes motivated me to figure out how to type them properly (and configure my system to make that easier). Now I even go over old writing to fix the hyphens.
Forum mechanics have always shaped discourse more than policies. Voting changed everything. The response to LLMs should be mechanical not moral — soft, invisible weighting against signals correlated with generated text. Imperfect but worth the tradeoff, just like voting.
What a welcome post. The whole reason I come here is to get thoughtful input from smart people, and not what I could get myself from an LLM. While we are at it; Think your own thoughts as well :) I know how easy it is to "let it come up with a first draft" and not spend the real effort of thinking for yourself on questions, but you'll find it's a road to perdition if you let yourself slip into the habit. Thanks to all the humans still here!!
Same here, and similarly, I come here to find interesting submissions from smart people. I want to read their own thoughts in their own words, not what an LLM has to say. I'm capable of prompting my own LLM with their prompts if they'd supply them.
It would be great if we could have some kind of indicator that a submission is AI output, perhaps a submitter could vouch that their submission is AI or not, and if they consistently submit AI spam, they have their submission ability suspended or get banned.
Totally agree with you. I come here to read comments made by humans. If I need to read comments made by AI Bots I would go to Twitter or reddit, both made me not read the comments section entirely.
Or https://clackernews.com/.
You sound like you're a bot lol
Funny, I was debating posting a note thanking the HN staff myself for adding this to the comment guidelines but I don't think it's possible to write one without sounding at least a little bit like a bot...
I actually do something similar on my personal site using this note that includes a purposeful typo: https://jasoneckert.github.io/site/about-this-site/
I'm hoping people catch that typo after reading "every single word, phrase, and typo (purposeful or not)" and smiled every time I've had someone post a PR with a fix for it (that I subsequently reject ;-)
great idea, but seems a little futile if there is no protection agains llms training on HN comments. ironically, if HN can succefully prevent llm content, it will become one of the best sources available for training data
Not really. Because the biggest problem with LLMs is that they can't right naturally like a human would. No matter how hard you try, their output will always, always seem too mechanical, or something about it will be unnatural, or the LLM will go to the logical extreme of your request (and somehow manage to not sound human)... The list goes on.
Quite! It's very easy to send a HN link to one of our new artificial friends to see what they have to say about it. Subsequently publicly posting the inference variation you receive strikes me as very self-centered. Passing it off as your own words - which the majority seem to - is doubly bizarre.
It's very funny to imagine people prompting: "Write a compelling comment, for me, to pass off as my thoughts, for this HN news thread, which will attract both upvotes and engagement.".
In good faith, per the guidelines: What losers!
I agree with much of what you say, but it isn't as simple as "post to LLM, paste on HN". There are notable effects from (1) one's initial prompt; (2) one's phrasing of the question; (3) one's follow-up conversation; (4) one's final selection of what to post.
For me, I care a lot about the quality of thinking, as measure by the output itself, because this is something I can observe*.
I also care -- but somewhat less -- about guessing as to the underlying generative mechanisms. By "generative mechanisms" I mean simply "Where did the thought come from?" One particular person? Some meme (optimized for cultural transmission)? Some marketing campaign? Some statistic from a paper that no one can find anymore? Some dogma? Some LLM? Some combination? It is a mess to disentangle, so I prefer to focus on getting to ground on the thought itself.
* Though we still have to think about the uncertainty that comes from interpretation! Great communication is hard in our universe, it would seem.
Taking the time to write something, and read over it is a better skill than asking an LLM to do it for you.
Also, quality doesn't come from any of those points you've mentioned. Quality comes from your ability to think and reason through a topic. All those points you mention in your first paragraph are excuses, trying to make it seem like there was some sort of effort to get an LLM to write a post. It feels like fishing for a justification
>Taking the time to write something, and read over it is a better skill than asking an LLM to do it for you.
Furthermore, if someone doesn't think whatever they're saying is worth investing the time to do this, it's a signal to me that whatever they could say probably isn't worth my time either.
I don't know why this isn't a bigger part of the conversation around AI content. It shows a clear prioritization of the author's time over the readers', which fine, you're entitled to valuing your own time more than mine, but if you do, I'll receive that prioritization as inherently disrespectful of my time.
> Taking the time to write something, and read over it is a better skill than asking an LLM to do it for you.
Yes, this is a great skill to have: no argument from me. This wasn't my point, and I hope you can see than upon reflection.
> All those points you mention in your first paragraph are excuses, trying to make it seem like there was some sort of effort to get an LLM to write a post.
Consider that a reader of the word 'excuses' would often perceive an escalation of sorts. A dismissal.
> Quality comes from your ability to think and reason through a topic.
That's part of it. Since the quote above is a bit ambiguous to me, I will rephrase it as "What are the factors that influence the quality of a comment posted on Hacker News?" and then answer the question. I would then split apart that question into sub-questions of the form "To what extent does a comment ..."
- address the context? Pay attention to the conversational history?
- follow the guidelines of the forum?
- communicate something useful to at least some of the readers?
- use good reasoning?
One thing that all of the four bullet points require is intelligence. Until roughly ~2 years ago, most people would have said the above demand human intelligence; AI can't come close. But the gap is narrowing. Anyhow, I would very much like to see more intelligence (of all kinds, via various methods, including LLM-assisted brainstorming) in the service of better comments here. But intelligence isn't enough; there are also shared values. Shared values of empathy and charity.
In case you are wondering about my "agenda"... it is something along the lines of "I want everyone to think a lot harder about these issues, because we ain't seen NOTHING yet". I also strive try to promote and model the kind of community I want to see here.
You missed something much more important than all 4 of those points:
- what does the human behind the keyboard think
If you want us to understand you, post your prompts.
Some might suggest that the output of an LLM might have value on it's own, disconnected from whatever the human operating it was thinking, but I disagree.
Every single person you speak with on HN has the same LLM access that you do. Every single one has access to whatever insights an LLM might have. You contribute nothing by copying it's output, anyone here can do that. The only differentiator between your LLM output and mine, is what was used to prompt it.
Don't hide your contributions, your one true value - post your prompts.
The prompt & any follow-ups do have notable effects, but IMO this just means that most of actual meaning you wanted to convey is in those prompts. If I was your interlocutor, I'd understand you & your ideas better if you posted your prompts as well as (or instead of) whatever the LLM generated.
Sure, I agree that getting something you want (top post) out of an LLM isn't zero-effort.
But this isn't about effort. This is about genuine humanity. I want to read comments that, in their entirety, came out of the brain of a human. Not something that a human and LLM collaboratively wrote together.
I think the one exception I would make (where maybe the guidelines go too far) is that case of a language barrier. I wouldn't object to someone who isn't confident with their English running a comment by an LLM to help fix errors that might make a comment harder to understand for readers. (Or worse, mean something that the commenter doesn't intend!) It's a privilege that I'm a native English speaker and that so much online discourse happens in English. Not everyone has that privilege.
> This is about genuine humanity.
The meaning of the word genuine here is pretty pivotal. At its best, genuine might take an expansive view of humanity: our lived experience, our seeking, our creativity, our struggle, in all its forms. But at its worst, genuine might be narrow, presupposing one true way to be human. Is a person with a prosthetic leg less human? A person with a mental disorder? (These questions are all problematic because they smuggle in an assumption.)
Consider this thought experiment. Consider a person who interacts with an LLM, learns something, finds it meaningful, and wants to share it on a public forum. Is this thought less meaningful because of that generative process? Would you really prefer not to see it? Why?
Because you can point to some "algorithmic generation" in the process? With social media, we read algorithmically shaped human comments, many less considered than the thought experiment. Nor did this start with social media. Even before Facebook, there was an algorithm: our culture and how we spread information. Human brains are meme machines, after all.
Think of human output as a process that evolves. Grunts. Then some basic words. Then language. Then writing. Then typing. Why not: "Then LLMs"? It is easy to come up with reasons, but it is harder to admit just how vexing the problem is. If we're willing, it is way for us to confront "what is humanity?".
You might view an LLM as an evolution of this memetic culture. In the case of GPT-OSS 120b, centuries of writing distilled into ~60 GB. Putting aside all the concerns of intellectual property theft, harmful uses, intellectual laziness, surveillance, autonomous weapons, gradual disempowerment, and loss of control, LLMs are quite an amazing technological accomplishment. Think about how much culture we've compressed into them!
As a general tendency, it takes a lot of conversation and refinement to figure out how to communicate a message really well to an audience. What a human bangs out on the first several iterations might only be a fraction of what is possible. If LLMs help people find clearer thinking, better arguments, and/or more authenticity (whatever that means), maybe we should welcome that?
Also, not all humans have the same language generation capacity; why not think of LLMs as an equalizer? You touch on this (next quote), but I am going to propose thinking of this in a broader way...
> I think the one exception I would make...
When I see a narrow exception for an otherwise broad point, I notice. This often means there is more to unpack. At the least, there is philosophical asymmetry. Do they survive scrutiny? Certainly there are more exceptions just around the corner...
This. LLMs are an autocomplete engine. They aren't curious. Take your curiosities and use your human voice to express them.
The only reason you should be using an LLM on a forum like this is to do language translation. Nobody cares about your grammar skills, and there really isn't a reason to use an LLM outside of that.
LLMs CANNOT provide unique objectivity or offer unknown arguments because they can only use their own training data, based on existing objectivity and arguments, to write a response. So please shut that shit down and be a human.
Signed, a verified/tested autistic old man.
cheers
> Nobody cares about your grammar skill
One thing that impressed me about HN when I started participating is how rarely people remark on others' spelling or grammatical mistakes. I myself have been an obsessive stickler about such issues, so I do notice them, but I recognize that overlooking them in others allows more interesting and productive discussions.
This resonates with me. Intent is hard to infer, so it seems better to engage with the content itself. Most ideas are recombinations of earlier ones anyway—the interesting part is the push and pull of refining thoughts together.
Many programmers believe that math is the best way to solve problems or order the world or whatever. There are lots of real 20 year olds out there using chatbots to "optimize" their humanities learning, or to "optimizing" using dating apps. It's a fact about this audience. Some people have a very myopic point of view, however, it coheres with certain cultural forces, overlapping with people of specific ethnic heritages, who are from California and New York, go to fancy school and post online, to earn tons of money, buy conspicuous real estate, date skinny women and marry young.
These aren't the marina bros, they're the guys who think they're really smart because they did well in math. They are using LLMs to reply to people. They LOOK like you. Do you get it?
You are missing the point here.
It is not about whether the comment was written by AI, a native English speaker, English major, or ESL.
What matters is an idea or an opinion. That is all what matters.
To follow the pattern of your comment: You are missing the forest for the trees. Like many things, the difference between theory and practice matters here. In theory the only thing that matters is the idea. In practice the context and human element matters AND a culture of ai text could very much reduce the bar for quality.
An equivalent overly-pure reductive mistake is "why do you need privacy if you aren't doing anything wrong".
I try to "think my own thoughts" but then I see them elsewhere all the time.
My twitter bio has been "Thoughts expressed here are probably those of someone else." for over half a decade.
> The whole reason I come here is to get thoughtful input from smart people
I don't wanna be a party pooper here, but you will be lucky if the input satisfies one of those conditions. Getting input with both those attributes on HN is like finding life on Mars.
Remember to upvote good comments!
I think the situation is better in small discussions, that sometimes are lucky and get more technical.
Once a discussion reach 100 or so comments, most of the time the discussion is too generic, but there are a few hidden good comments here and there.
I'm here to read what actual humans think. If I wanted to read what an LLM thinks, I could just ask it.
But here's where it gets tricky: Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
Am I here to read authentic humans because I value authenticity for its own sake (like preferring Champagne instead of sparkling wine)? Or do I value authentic human output because I expect it to be of higher quality?
I confess that it is a little of both. But it wouldn't surprise me if someday LLM-enhanced output becomes sufficiently superior to average human output that the choice to stick with authentic human output will be more painful.
> Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
For me it's the first one every time. If only because LLM don't learn from responses to it (much less so when the response is to a paste of their output). It's just not communication. From that perspective, the quality of even the most brilliant LLM output is zero, because it's (whatever high value) multiplied by zero.
Even a real person saying something really horrible and too dense to learn from any response at least gives me a signal about what humans exist. An LLM doesn't tell me anything, and if wanted the reply of an LLM, I would simply feed my own posts into an LLM. A human doing that "for me" is very creepy and, to my sensibilities, boundary violating. Okay, that may be too strong a word, but it feels gross in a way I can't quite put my finger on, but reject wholeheartedly.
> Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
This is an artificial dichotomy. HN’s guidelines specify thoughtful, curious discussion as a specific goal. One-off / pithy / sarcastic throwaway comments are generally unwelcome, however popular they are. Insightful responses can be three words, ten seconds to write and submit, and still be absolutely invaluable. Well-thought-out responses are also always appreciated, even if they tend to attract fewer upvotes than a generic rabble-rousing sentiment about DRM or GPL or Apple that’s been copy-pasted to the past hundred posts about that topic. But LLM-enhanced responses are not only unwelcome but now outright prohibited.
Better an HN with fewer words than an HN with more AI writing words. We’ve been drowned in Show HN by quantity as proof of why already.
But what if it turns out that human+LLM can produce more "thoughtful, curious discussion" than human alone?
That's the dichotomy: Do we prefer text with the right "provenance" over higher quality text?
[Perhaps you'll say that human+LLM text will never be as high-quality as human alone. But I'm pretty sure we've seen that movie before and we know how it ends.]
That said, you're right that because human+LLM is so much more efficient, we'll be drowning in material--and the average quality might even go down, even if the absolute quantity of high-quality content goes up.
I think, in the long term, we will have to come up with more sophisticated criteria for posting rather than just "must be unenhanced human".
> what if it turns out that
HN need not offer itself up as a Petri dish for AI writing experimentation. There are startups in that space, and at least one must be YC-funded, statistically speaking. Come back with the outcomes of the experiment you describe and make a case that they should change the rule. Maybe they will! As of today, though, they are apparently unconvinced.
> the average quality might even go down
We have a recent concrete analysis of Show HN indicating support for this possibility, resulting in the mods banning new users for posting to Show HN (something they’ve probably been resisting for close to twenty years, I imagine, given how frequent a spam vector that must be).
> Perhaps you’ll say that human+LLM text will never be as high-quality as human alone
Please don’t put words in my mouth, insinuating the tone my reply before I’ve made it, and then use that rhetorical device to introduce a flamebait tangent to discredit me with. I’ve made no claims about future capabilities here and I’m not going to address this irrelevance further.
> in the long term, we will have to come up with more sophisticated criteria
Our current criteria seem sophisticated already. Perhaps you could make a case that AI-assisted writing helps avoid guideline violations. This one tends to be especially difficult for us all today:
”Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize. Assume good faith. Eschew flamebait. Avoid generic tangents.”
> Please don’t put words in my mouth, insinuating the tone my reply before I’ve made it, and then use that rhetorical device to introduce a flamebait tangent to discredit me with. I’ve made no claims about future capabilities here and I’m not going to address this irrelevance further.
I apologize--the "you" I meant was the person currently reading my post, not the person I was replying to. I was merely trying to answer a common objection that I've heard.
> HN need not offer itself up as a Petri dish for AI writing experimentation.
I'm not sure HN has a choice. I don't think we can prevent posters from experimenting with LLMs to post on HN--even if they adhere to the guidelines. For example, can I ask the LLM to come up with the strongest argument it can and then re-write it in my own words? That seems to be allowed by the guidelines. Would someone even be able to tell that's what I did? [NOTE: I did not do that.]
I think you're arguing that we should not encourage even more use of LLMs on HN. I get that. But I feel like that this community is uniquely qualified to search for better solutions.
> Our current criteria seem sophisticated already.
I hope you're right! That implies that you believe the current guidelines are sufficient to keep HN as the place we all love despite the assault from LLMs. I'm skeptical, but I've been wrong plenty of times!
> I don't think we can prevent posters from experimenting with LLMs to post on HN
And yet, she persisted, we will still set guidelines; so that people know they’re unwelcome to do so when they do, so that they can’t argue that they didn’t know, so that we as a social club can strive towards the standards we argue about and accept from the organizers. The point of guidelines is not that they prevent malicious intent; the point is that they inhibit those behaviors that exceed the defined boundaries, however vague or precise they may be. Prevention of malice is an impossibility in all human social affairs, whether guidelines are defined or not; one must find other reasons for rules than prevention to understand why rules are at all.
> And yet, she persisted, we will still set guidelines
I'm not sure if you're including or excluding me from the "we". If you're excluding me, then I feel our conversation has come to an end.
But if you're including me, then I think the guidelines need to evolve to deal with LLMs. Maybe not right now--maybe the current guidelines are sufficient for the next year or two or three. But I think we as a community are uniquely qualified to design and influence the future of internet social clubs in the face of LLMs.
> I'm not sure if you're including or excluding me from the "we".
“We” here refers to individual human beings that are members of the human social-entity constructs (‘social clubs’) that precipitate naturally out of human groups, both in general to all such groups and in specific to the group under discussion here today, HN participants.
Whether or not you’re a member of “we” HN participants is conditional on whether or not you are honoring the policy of no AI-assisted writing at HN that is in effect as of whenever you saw this post or the new guidelines. I have no judgment to offer you in that regard, and in any case you’re readily able to decide that for yourself. Separately, I’m not engaging with discussion about future policy; perhaps you should start a top-level thread about it, or write a blog post and submit it (after a few days have passed, so it doesn’t get topic-duped and so that passions have cooled somewhat).
I think "must be unenhanced human" is probably the most sophisticated criteria even if it's simple. I don't think there's much value in trying to optimize the perfect "thoughtful, curious discussion", why would there be, it implies some ideal state for "thoughtful and curious" vs the reality that discussions between living breathing people is interesting by default as long as folks loosely follow some guidelines.
It doesn't matter.
The guidelines are perfectly clear, no matter the outcome of your thought experiment. Hacker News wants intelligent conversation between human beings, and that's the beginning and the end of it.
If you want LLM-enhanced conversation then I'm sure you will find places to have that desire met, and then some. Hacker News is not that place, and I pray that it will never become that place. In short, and in answer to "Do we prefer text with the right "provenance" over higher quality text?".
Yes. Yes, we do.
I like to read human comments because I'd like to know what my fellow humans think. I'd prefer not to read low-effort, throw away comments, but other than that I want to know what people think about different topics.
I read HN both because I want to read what humans think, and because I want to read insightful discussion.
The tension is that as insightful discussion becomes easier/better with LLMs, there is less need to read HN. All I'm left with is provenance: reading because a human wrote it, not because it is uniquely insightful.
If the goal is to read what actual humans think, it's hard to see how an LLM filter can do anything but obscure and degrade the content.
LLMs, as we know them, express things using the patterns they've been developed to prefer. There's a flattening, genericizing effect built in.
If there are people who find an LLM filter to be an enhancement, they can run everything through their favorite LLM themselves.
I think it's a spectrum:
1. I enter "Describe the C++ language" at an LLM and post the response in HN. This is obviously useless--I might as well just talk to an LLM directly.
2. I enter "Why did Stroustrup allow diamond inheritance? What scenario was he trying to solve" then I distill the response into my own words so that it's relevant to the specific post. This may or may not be insightful, but it's hardly worse than consulting Google before posting.
3. I spend a week with creating a test language with a different trade-off for multiple-inheritance. Then I ask an LLM to summarize the unique features of the language into a couple of paragraphs, and then I post that into HN. This could be a genuinely novel idea and the fact that it is summarized by an LLM does not diminish the novelty.
My point is that human+LLM can sometimes be better than human alone, just as human+hammer, human+calculator, human+Wikipedia can be better than human alone. Using a tool doesn't guarantee better results, but claiming that LLMs never help seems silly at this point.
> 3. I spend a week with creating a test language with a different trade-off for multiple-inheritance. Then I ask an LLM to summarize the unique features of the language into a couple of paragraphs, and then I post that into HN
I think where you are getting hung up is the idea of "better results". We as a community don't need to strive for "better results" we can easily say, hey we just want HN to be between people, if you have the LLM generate this hypothetical test, just tell people in your own words. Maybe forcing yourself to go through that exercise is better in the long run for your own understanding.
My example was not great.
But my point is that I read HN partly because people here are insightful in a way I can't get in other places. If LLMs turn out to ultimately be just as insightful, then my incentive to read HN is reduced to just, "read what other people like me are thinking." That's not nothing, but I can get that by just talking with my friends.
Unless, of course, we could get human+LLM insightfulness in HN and then I'd get the best of both worlds.
If someone can't explain something in their own words, then they don't _really_ understand it. The process of taking time to think through a topic and check one's understanding, even if only for oneself and the rubber duck, will reveal mistakes or points of confusion.
Which gets to the core of the issue nicely, I want to go on to HN and talk to people who know things or have thought about things to the degree that they don't need a cheat sheet off to the side to discuss them.
How is it not better, in your third scenario, if you described what you think are the important and interesting aspects of your idea/demo?
And what motivated you to make it -- probably the most interesting thing to readers, and not something an LLM would know.
Believe me, I don't care what an LLM has to say about your thing. I care about what you have to say about your thing.
> Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
I'd argue that anything insightful or well-though-out doesn't use LLMs at all. We can quibble over whether discussions with an LLM lead to insightful responses, but that still isn't your own personal thought. Just type what's on your mind, it's not that hard and nitpicking over this is just looking for ways to open up unnecessary opportunities for abuse.
Often i think of a novel idea or solution to a problem, but use AI to communicate or adjust what I already wrote out so it’s more comprehensible. Sometimes when I write, it’s hard to understand.
The more you write, the less this will be true. The more you write, the better you will become at it. Using an LLM to write is like sending a robot to the gym for you.
The more you use an LLM to write for you, the worse you will become at writing yourself. There is simply no other possible outcome. It's even true of spellcheck - the more you use a spellcheck the worse you become at spelling. I know this for a fact because I can no longer spell for shit. However, spelling is to writing as arithmetic is to mathematics. I also can't add up, but I have a degree in pure mathematics.
LLMs are a cancer on human thought and expression.
> LLMs are a cancer on human thought and expression.
LLMs help to express what many people dont have the energy or ability to express. It also has a broader scoped view of protocol...It does not have emotions, which often leads to less than optimal discourse.
In many ways, it help those who are challenged in discourse to better express themselves...rather than keeping silent or being misunderstood.
How do you expect to get better at it then if you avoid the hard work and emotional weight of fixing it?
So if you want to reply to a comment you read today, and you don't feel like your writing skill is up to snuff, you should be content with expecting to wait the requisite weeks or months or years of practice before even considering replying to it?
This seems especially relevant for non-English-fluent commenters, who are increasingly using LLMs to be able to communicate more effectively on an English-only site like Hacker News than they'd otherwise be able to do.
In that sense AI is a tool much like a dictionary, it enhances and I'd say improve the end result.
The difference is that I will retain what I drew out from the dictionary the next time. If people use Ai this way for writing, great! What many of the "enhanced-by-ai" arguments sound like is that this will be an indefinite outsourcing.
Use them to get better, like how reading good writing directly (not summarized) will also make you a much better writer. Learn from the before and after so next time there isn't a need to reach for Ai.
There are many obvious ways in which this may not be true.
Anyone learning the language and some people with learning disabilities, for example, may communicate better via an LLM.
There is a sliding scale from that, to it being the LLM that communicates, not the person. LLMs can really reshuffle and change priorities and modify emphasis in a text. All the missing pieces will be filled in and rounded out and sandpapered off by the inner-average-corporate-HR-Redditor of the LLM.
I promise you, after this past year, you don’t know how happy I am to read issues and PRs in broken English.
By this logic, you might consider vibe coding a browser plugin that takes any HN comment less than 50 words and auto-expands it into an “insightful, well thought-out response.”
Delivered: https://github.com/telotortium/dotfiles/tree/27c11efd967eebc...
Length is not insight. I understand this to be a community oriented towards people who are not impressed by such superficial things.
That's the point :)
What is the value of this "output"? If I want to know what LLMs think about something, I can go ask an LLM any question I want. For a comment on [a site like] HN, either the substantive content of the comment originated inside a human mind, or there is no substantive content that I couldn't reproduce by feeding the comment's context into an LLM. At the extreme, I don't have any interest in reading or participating in a conversation between a bunch of LLMs.
They’re referencing LLM-enhanced output.
The value proposition is that someone who is a lousy writer (perhaps only in English) with deep domain knowledge is going back and forth with the LLM to express some insight or communicate some information that the LLM would not produce on its own.
> perhaps only in English
Wouldn't it work better to just write the thing in whatever language they can actually write in and then do a straightforward translation in a single pass?
> someone who is a lousy writer with deep domain knowledge going back and forth with the LLM to express some insight or communicate some information that the LLM would not produce on its own
This sounds reasonable on its face, but how often does it actually come up that somebody can't clearly express an idea in writing on their own but can somehow get an LLM to clearly express it by writing a series of prompts to the LLM?
And, if it does come up, why don't they just have that conversation with me, instead?
> Wouldn't it work better to just write the thing in whatever language they can actually write in and then do a straightforward translation in a single pass?
Nontrivial translation tools are AI(neural net)-based tools (although not necessary LLM). Whole transformer neural net architecture was originally designed for translation.
I don't have a problem with people using these tools to translate their writing into languages they aren't fluent/literate in. It's a completely different dynamic vs. having them write for you.
Exactly!
Just as Google-enhanced output and Wikipedia-enhanced output has helped my writing/thinking, I believe LLM-enhanced output also helps me.
Plus, I personally gain more benefit from using an LLM as a researcher than as a writer.
Using LLMs for research is completely different from using them to write for you. And if you're using them to write about the results of research, you're almost certainly getting a lot less value out of the whole exercise.
> Do I prefer low-effort, off-the-top-of-my-head reactions, as long as it is human? Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
Neither. I want insightful, well-thought-out, human comments.
It's a little sad that this might be too much to ask sometimes...
The point of a discussion site is to hear what other people think and get different perspectives. Just getting an LLM's insightful, well-thought-out response isn't really a big draw, if one is looking for that, there's a pretty obvious way to get it. I posted this the other day (ignore the title I realized later it's too clickbaity) but this is why IMO LLMs won't replace the workforce, people aren't looking for answers to things, they're looking for other people's takes: https://news.ycombinator.com/item?id=47299988
> If I wanted to read what an LLM thinks, I could just ask it.
and
> Or do I want an insightful, well-thought-out response, even if it is LLM-enhanced?
What is the difference? What's the line between these two?
The prompt: "Analyze <opinion> and respond" is pretty clearly "I would just ask it." and, the prompt: "here's my comment, please ONLY the check the grammar and spelling" would probably be ok.
What about prompt:"I disagree with using LLMs for commenting at all for <reasons>. Please expound on this and provide references and examples". That would explode the word count for this site.
What about:
1. "Here is my answer to a comment. Give me the strongest argument against it."
2. "I think xyz. What are some arguments for and against that I may not have thought of."
3. "Is it defensible for me to say that xyz happened because of abc?"
All of these would help me to think through an issue. Is there a difference between asking a friend the above vs. an LLM? Do we care about provenance or do we care about quality?
The difference is in the journey to find the answer, rather than outsourcing it to man or machine. Spending more time reflecting before first post will often answer the easy questions so you can formulate more thoughtful questions.
I prefer low effort human thought to low effort llm output.
> But here's where it gets tricky
Pretty sure this comment is AI
Now I know how the Salem witches felt. How can I prove that it's not AI?
You can't. Nobody can. False positives are the inherent danger of these sorts of policies — especially when the LLMs were trained on the exact writing styles that have dominated online conversations and publications for decades.
Gonna put out a blanket assertion about my preferences, to get a read on whether these are shared or not:
As humans, we have directives (genetic, cultural, societal, etc.) to prioritize humanistic endeavors (and output) above all else.
History has shown that humans are overwhelmingly chauvinistic in regards to their relationship to other animals in the animal kingdom, even to the point of structuring our moral/ethical/legal systems to prioritize human wellbeing over that of other animals (however correct/ethical that may ultimately be, e.g., given recent findings in animal cognition, such as recent attempts to outlaw boiling lobsters alive as per culinary tradition).
But, it seems that some parties/actors are willing (i.e., benefiting) from subverting this long-standing convention (of prioritizing human interests) in the face of AI (even to the point of the now-farcical quote by Sam Altman that humans take far more nurturing than LLMs...)
So: should we be neglecting our historical and genetic directives, to instead prioritize AI over human interests? Or should we be unashamedly anthropic (pun intended), even at the cost of creating arbitrary barriers (i.e., the equivalent of guilds) intended to protect human interests over those of AI actors?
I strongly recommend the latter, particularly if the disruptions to human-centric conventions/culture/output are indeed as significant (and catastrophic) as they will likely be if unchecked.
> well-thought-out response, even if it is LLM-enhanced?
There's no insight nor well-thought-out response once a person decides to "LLM-enhance" their response. The only insight that the person using the LLM is too limited to have a decent conversation with.
Humans have more variability and "edge". If a person is passionately arguing for some point of view (perhaps somewhat outside the usual), it signals to me that they probably thought about this and it is a distillation of a long thought process and real-life experience. One could say that the logical argument should stand alone, but reality doesn't work that way. There are many things you have to implicitly trust and believe when you read. Of course lying and bullshitting already existed before ("nobody knows you're a dog" etc etc). But LLMs will really eloquently defend X, not X, X*0.5 and anything inbetween. There is no information content in it, it doesn't refer to an actual human life experience and opinion that someone wants to stand behind. It just means that someone made the LLM output a thing.
> But it wouldn't surprise me if someday LLM-enhanced output becomes sufficiently superior to average human output that the choice to stick with authentic human output will be more painful.
My ideal vision is that instead of outsourcing indefinitely, we learn from the enhanced versions and become better independent writers.
If you like reading LLM output, just talk directly to an LLM. Problem solved.
> Am I here to read authentic humans because I value authenticity for its own sake (like preferring Champagne instead of sparkling wine)?
Mate, Champagne is a sparkling wine. In French you can even at times hear people asking for "un vin mousseux de Champagne" meaning "a sparkling wine from Champagne" instead of the short form (just saying "un Champagne" or "du Champagne").
Now, granted, not all sparkling wine are Champagne.
The Wikipedia entry begins with: "Champagne is a sparkling wine originated and produced in the Champagne wine region of France...".
I drank enough of it to be stating my case, of which I'm certain!
P.S: and btw, yup, authentic humans content only here, even if it's of "low quality". If I want LLM, I've got my LLMs.
Basically you have Cremant type sparking wines which are produced from other regions of France besides Champagne. It is just like Champagne just that other French regions like Loire, Alsace, Bordeux etc are not allowed to call it Champagne.
So just like Armanac's are like Cognac's for lower price, good Cremant will be cheaper and more enjoyable that cheaper Champagne (I've not had any really expensive Champagne).
Then you have Cava from Spain which is similar process to Cremants and Champagne. The difference would be in type of grapes used. A friend of mine swears by Cavas just like I swear by Cremants from Loire region. However my wife hates Cava.
Then Proseccos from Italy again are similar, but quality varies more.
After that we get into more questionable cheaper sparkling wines which usually means some sort of out of bottle insertion of CO2 and even worse version include some other modifications such as sugar.
In general to avoid literal headaches you want BRUTs. Anything semi-sweet or sweet is suspicous.
Again I am not a full wine expert but this is mostly years of ahem experience.
I keep wishing for a public place to put a formatted version of my LLM threads. I have long conversations with LLMs that usually result in some kind of documentation, tutorial, or dataset. Many of them are relatively novel, but I haven't created a place for them yet.
And no, I wouldn't think an HN post is it either.. I'm just saying, there should be a good place to post the output of good questions asked iteratively.
Have you ever read someone else's conversation with an LLM?
Not the op but I barely even read my own conversations with an LLM. ChatGPT was always so verbose even when I told it to be succinct.
Claude is a bit better but still prone to rambling.
I hinted at "formatted" and "good".. add the words "curated" or "edited".
Make a blog? Hardly a hard problem there mate.
If you can’t even be arsed doing that how much value is there, really?
Personally the only thing less interesting to me than someone else’s conversations with an LLM is hearing about someone else’s dream they had last night but you never know, some people may be interested.
Thanks for slagging.
But I was thinking less blog and more like an LLM research notebook, à la Jupyter. Jupyter for LLM prompts, outputs, refinements.
No slagging meant, sorry. Reading back it does seem a bit like that you are right.
Simon Willison published something for turning Claude Convos into something publishable. [1] I haven't tried it, so cannot speak to the ergonomics.
Where to post it? Any blog site, probably a good few Show HN too. Will anyone read it? I haven't read anyone else's, I'm more inclined to dock them reputation for suggesting I read their Ai session. Snippets of weird things shared on socials were interesting to me early on, but I'm over that now too.
[1] https://simonwillison.net/2025/Dec/25/claude-code-transcript...
"AI-edited comments" is a very interesting one. Where is the line between a spelling/grammar/tone checker like Grammarly, that at minimum use N-Grams behind the scenes, and something that is "AI" edited? What I am asking is, is "AI" in this context fully featured LLMs, or anything that improves communication via an automated system. I think many people have used these "advanced" spellcheckers for years before Chatgpt et al came on the scene.
I think "generated comments" is a pretty hard line in the sand, but "AI-edited" is anything but clear-cut.
PS - I think the idea behind these policies is positive and needed. I'm simply clarifying where it begins and ends.
You're touching on an important point. More here: https://news.ycombinator.com/item?id=47342616.
All this stuff is in flux. I thought a lot about whether to add the "edited" bit - but it may change. What I deliberately left out was anything about the articles and projects that get submitted here. There's a lot of turbulence in that area too, but we don't yet have clarity, or even an inkling, of how to settle that one.
Do the guidelines also disallow comments along the lines of "according to <AI>, <blah>"? (I ask this given that "according to a Google search, <blah>" is allowed, AFAIK.)
I would lean towards disallowing those. With "According to a Google search ...", someone can ask for specific links (and indeed, people often say to link to those sources to begin with instead of invoking Google). With "According to AI ... " - why would most readers care what the AI thinks? It's not a reliable source! You might as well say "According to a stranger I just met and don't know ..."
If you're going to say that the AI said X, Y, Z, provide a rationale on why it is relevant. If you merely found X, Y and Z compelling, feel free to talk about it without mentioning AI.
Citations can be helpful. But AI summaries and Google searches are poor citations because they are not primary sources.
I think those should be allowed iff the nature of being AI-generated is relevant to the topic of discussion — e.g. if we're talking about whether some model or other can accurately respond to some prompt and people feel inclined to try it themselves.
I constantly read those comments and I personally have conflicting opinion with them. On one hand, it's interesting to compare what is coming out of models, but on the other hand, LLMs are all non-deterministic, so results will be fairly random. On top of that, everybody has a different "skill" level when prompting. In addition, models are constantly changing, therefore "I asked chatGPT and it said..." means nothing when there is a new version every few months, not to mention you can often pick one of 10+ flavors from every provider, and even those are not guaranteed to not be changed under the hood to some degree over time.
Please rethink the “edited” bit on accessibility grounds.
I have a kid with severe written language issues, and the utilisation of speech to text with a LLM-powered edit has unlocked a whole world that was previously inaccessible.
I would hate to see a culture that discourages AI assistance.
Are you up for sharing details?
> I would hate to see a culture that discourages AI assistance.
Mostly I think the push back is about ai assistance in its current form. It can get in the way of communicating rather than assisting. The cost though is mostly borne by the readers and those not using the AI for assistance. I have seen this happen when the ai adds info and thoughts that were tangental to the original author and I think, but I can not verify times where an author seems to try to dig down on the details but seemingly can not.
Since it's mostly a good-faith rule to begin with, it seems easy to add something like, "unless you are using it as an assistive technology for accessibility reasons".
Oh wow. I did not anticipate that, which is embarrassing given that I wrote this just recently:
https://news.ycombinator.com/item?id=47326351
Yes, please at least have a carveout for accessibility. I definitely have dictated HN comments in the past, and my flow uses LLMs to clean it up. It works, and is awesome when you're in pain.
Hear hear. And like many other aspects of accessibility, it will help a huge number of people who may not have any severe issues. e.g. non-native English speakers using LLM-powered edits.
You should use your own words. It might seem that a tool like Grammarly is just an advanced spellcheck, but what it's really doing is replacing your personal style of writing with its own.
It's better to communicate as an individual, warts and all, than to replace your expression with a sanitized one just because it seems "better." Language is an incredibly nuanced thing, it's best for people's own thoughts to come through exactly as they have written them.
My elementary school kid came home yesterday and showed me a piece of writing that he was really proud of. It seemed more sophisticated than his typical writing (like, for example, it used the word "sophisticated"). He can be precocious and reads a ton, though, so it was still plausible that he wrote it. I asked him some questions about the writing process to try to tease out what happened, and he said (seemingly credibly) that he hadn't copied it from anywhere or referenced anything. He also said he didn't use any AI tools. After further discussion, I found out that Google Docs Smart Compose (suggested-next-few-words feature) is enabled by default on his school-issued Chromebook, and he had been using it. The structure of the writing was all his, but he said he sometimes used the Smart Compose suggestions (and sometimes didn't). He liked a lot of the suggestions and pressed tab to accept them, which probably bumped up the word choice by several grade levels in some places.
So yeah, it can change the character of your writing, even if it's just relatively subtle nudges here or there.
edit: we suggested that he disable that feature to help him learn to write independently, and he happily agreed.
To rationalize my gut-feelings on this, I think it comes down to the spectrum between:
1. A system that suggests words, the child learns the word, determines whether it matches their intent, and proceeds if they like the result.
2. A system that suggests words, and the child almost-blindly accepts them to get the task over with ASAP.
The end-results may look the same for any single short document, but in the long run... Well, I fear #2 is going to be way more common.
#1 would be a net improvement over the status quo IMO. Seems like a great way for people to expand their vocabularies organically.
That reminds me of one of the biggest IMO missing feature of Wordle: They never give a definition of the word after the game is finished! I usually do end up googling words I don't know (which is quite often) but I'm guessing I'm one of the few who goes to the trouble. I've even written to The New York Times a couple times to suggest adding a short definition at the end as I honestly feel like a ton of people could totally up their vocabulary game and it surely could be added with minimum effort (considering they even added a Discord multiplayer mode).
The analogy with tab-completion of code seems apt. At first you blindly accept something because it has at least as good a chance of working as what you would have typed. Then you start to pay attention, and critically evaluate suggestions. Then you quickly if not blindly accept most suggestions, because they're clearly what you would have written anyway (or close enough to not care).
The phenomenon was observed in religious philosophy over a millennium ago (https://terebess.hu/zen/qingyuan.html).
Tab completion was so novel back when full e2e AI tooling was not really effective.
Now that it is, I just turn tab completion off totally when I write code by hand. It's almost never right.
Emacs has completion (but you can bind it to tab). The nice thing is that you can change the algorithm to select what options come up. I’ve not set it to auto, but by the time I press the shortcut, it’s either only one option or a small sets.
From his description, it sounded like this was more of #1. He cared a lot about the topic he was writing about, and has high standards for himself, so it's very likely that he would have considered and rejected poor suggestions.
I have mixed feeling about it. On the one hand, you're right: carefully considering suggestions can be a learning opportunity. On the other hand, approval is easier than generation, and I suspect that without flexing the "come up with it from scratch" muscle frequently, that his mind won't develop as much.
Oh how I despise these suggestions. You sometimes look for a way to express something and you are on the verge of giving the world something truly original, but as soon as your brain sees the suggestion it goes "oh yeah that fits"
I look forward to reading studies in 10 years how we all became stupider thanks to this "feature". One step closer to the movie Idiocracy.
I disabled them immediately, it feels like the tech version of the ADHD person who keeps interrupting you with what they think you are trying to say. Even if the suggestion is correct, it saves you at most 2 seconds at the cost of interrupting you constantly.
True! There's an important cybernetic aspect to all this, where an automatic suggestion can be an interruption, sometimes worse if the suggestion is decent.
A certain amount of friction is necessary, at least if the goal is to help the person learn or make something original.
GK Chesterton would have something brilliant to say about the inauthenticity of it all or something.
I see the suggestions and then choose something different anyway. I don't want to use one of the top 3 most popular responses to an email from a friend. Even if it's something transactional.
> I despise these suggestions
As an adult, I do too. As a middle schooler, we absolutely used word processors’ thesaurus features to add big words to our essays because the teachers liked them.
Friend of mine was a English teacher. She quit because she's not going to waste her time 'grading' 30 essays written by AI.
Anyway before that she HATED the thesaurus. And she could tell when students were using it to make their writing more fancy pants.
One problem I see is that LLMs have a more nuanced... well, model of how words and their meanings relate to each other than a dead-tree thesaurus could ever present, what with its simplified "synonym" and "antonym" categories. Online versions try to give some similarity metrics, but don't get into the nuance. (It's not as if someone who takes either approach would want to spend the time reading and understanding that, anyway.)
> she could tell when students were using it to make their writing more fancy pants
I had two teachers who called us out on this, and actually coached us on our writing, and I remember them fondly. (They were also fans of in-class essaying.)
The others wanted to count big words.
As a non native English speaker my own words wouldnt be in English. If I express myself in English I soon struggle for the right words. On the other hand I think when I read some English text I'm quite capable of sensing the nuances. So it feels when I auto translate my text to English an than read against it again and make some corrections, I can express my thoughts much better.
My broken english now officially bumps my comments up instead of down. Sweet.
For what it's worth, I had a quick look through your comment history and your English seems just fine to me as a native speaker (at least for informal communication).
People who don't have English as their first language often seem to underestimate how good their English actually is. I wonder if it's because their reference point is formal English rather than the much more forgiving English we use in casual day-to-day conversation.
Books and newspapers have had editors for centuries. It is just code review for the written word.
[It looks like MS Word 97 had the ability to detect passive voice as well, so we're talking 30 year old technology there that predates LLMs -- how far down the Butlerian Jihad are we going with this?]
Editors are mostly tasked with maintaining a consistent style and standard.
There is no need for that here beyond maybe spellcheck. Use your own thoughts, voice, and words.
I don't personally use AI/LLMs for any informal writing here or on reddit, etc. But I think it is pretty weird to be overly concerned around people, particularly ESL, who use tools to clean up their writing. The only thing I really care about is when someone posts LLM regurgitated information on topics they personally don't know anything about. If the information is coming from the human but the style and tone is being tweaked by a machine to make it more acceptable/receptive and fix the bugs in it, then I don't understand why you're telling me I need to care and gatekeeping it. It also is unlikely to be very detectable, and this thread seems to only serve a performative use for people to get offended about it.
Other tools to clean up writing are allowed. They did not tell you you must care. You told them they must not. The submisson's use was to tell you and others LLM generated tone was not more acceptable.
Well good luck detecting it.
If it never gets in the way of the humans communicate it probably won't be an issue. That is the reading I have of the rule and Dang's comments
> HN is for conversation between humans.
If it is enhancing that instead of detracting and wasting peoples time it does not seem to be against the spirt of the rules.
Except the letter of the rule makes it verboten even “if it never gets in the way of the humans communicate”.
I was just re-reading the passage from Plato's "The Phaedrus" on writing & the "art" of the letter for an essay I'm working on, and your remark is salient for this discussion on LLM-style AI and social media at large.
RIP Robert M.Pirsig.
Oof, I haven't finished Zen yet. I didn't know he was gone. RIP
>It's better to communicate as an individual, warts and all, than to replace your expression with a sanitized one just because it seems "better."
It is definitely not true that it is better for a poster to communicate like an individual when it comes to spelling and grammar. People ignore posts that have poor grammar or spelling mistakes, and communications that have poor grammar are seen as unprofessional. Even I do it at a semi-subconscious level. The more difficult or the more amount of attention someone has to pay to understand your post, the less people will be willing to put in that effort to do so.
Precisely. As I wrote in my assessment of AI for my workplace;
"Your unique human voice is more valuable than a thousand prompt-driven LLM doggerels."
> It's better to communicate as an individual, warts and all, than to replace your expression with a sanitized one just because it seems "better." Language is an incredibly nuanced thing, it's best for people's own thoughts to come through exactly as they have written them.
This is the opposite of how language works. You want people to understand the idea you're trying to communicate, not fixate on the semantics of how you communicated. Language is like fashion - you only want to break the rules deliberately. If AI or an editor or whatever changes your writing to be more clear and correct, and you don't look at it and say "no, I chose that phrasing for a reason" then the editor's version is much more likely to be understood correctly by the recipient.
That's true, but on the flip side I regularly get downvoted because my English is not the best, so say it mildly. So, now I need to be really careful, to a) write in a good English or b) not to be recognised as an LLM corrected version of my English. Where is the line? I shouldn't be downvoted for my English I think, but that is the reality.
Edit: I already got downvoted. :-) Sure, no one can tell exactly why. Maybe the combination of bad English _and_ talking sh*ce isn't ideal at all. :-D Anyways, I have enough karma, so I can last quite a while..
It goes both ways.
The quality of my writing varies (based on my mood as much as anything else, I suppose), but when it is particularly good and error-free then I often get accused of being a bot.
Which is absurd, since I don't use the bot for writing at all.
> I shouldn't be downvoted for my English I think, but that is the reality.
How do you know? Is it possible the downvoters just didn't like what you said?
It’s possible of course but reading all the comments from various non-native English speakers here it seems like a common story. It may indicate a subliminal bias in readers (most of whom are presumably American).
Note that those comments are written in perfectly understandable English. Further note how often you come across comments written in perfectly understandable English, but they're downvoted anyway.
It suggests a bias in writers to assume that people would agree with them if only they could express their thoughts accurately.
But the problem is that people with poor written language / english skills are 'competing' with people who have superb skills in this domain.
There are people here who sit at a desk all day banging out multipage emails for work who decide to write posts of a similar linguistic calibre for funsies.
Meanwhile you have someone in a developing country who just got off a brutal twelve hour shift doing manual labour in the sun who wants to participate in the conversation with an insightful message that they bang-out on a shitty little cellphone onscreen keyboard while riding on bumpy public transit.
You could have a great idea and express it poorly and be penalized for doing so here while someone could have a blah idea expressed excellently and it's showered in replies despite being in some metrics (the ones I think are most important) worse than the other post.
What's the solution for that?
> What's the solution for that?
Remember that you're on a message board and you're not actually 'competing' for anything?
This is a perfect example of what I'm talking about.
I knew someone was going to comment on my use of the word there despite me putting it in quotes which was intended to let the reader know that I meant that word as an approximation of what I was meaning.
When I say competing I mean competing in the space of ideas here. There is a ranking system here that raises or lowers the visibility and prominance of your comments and it's based on upvotes by other uses. For better or worse people penalize comments with grammatical errors over ones that don't and that affects how much exposure other users have to the ideas that people write and how much interaction they get from them.
If that's the case why would somebody who has good ideas but poor expressive capability bother posting here if their comments are just going to get ignored over relatively vapid comments that are grammatically correct?
> If that's the case why would somebody who has good ideas but poor expressive capability bother posting here if their comments are just going to get ignored over relatively vapid comments that are grammatically correct?
The main problem is that ai consistently is seeing making things worse. Take a look at the examples in Dang's link in their comment: https://news.ycombinator.com/item?id=47342616
In the ones I read the AI editing is either hurting or needs to be much, much better to help.
No, I get your point. Unfortunately, alot of people here try to act high and mighty like they are posting here for some altruistic reason. The reason why I, you, and everyone else posts here is the human reason that we want others to engage with our posts. In order to do that, you have to put your best foot forward, which includes making sure the spelling and grammar of your posts is correct. While I do not use an LLM for this, I think that it is vaild to use these tools to make sure nothing gets in the way of whatever point you are trying to make.
> In order to do that, you have to put your best foot forward
In English. You have to put your best foot forward in English. And in your environment with the resources you have at your disposal.
For example, I'm currently engaging with you between steps in a chemistry process that's happening under the fumehood next to me while wearing a respirator, a muggy plastic chemical resistant gown and disposable gloves nitrile globes.
I am absolutely certain that these conditions are different than the ones I would need to 'put my best food forward' in this discussion. I'm also certain that quite certain that you and I would both absolutely stumble if we were obligated to particpate in this forum in a language that we're not proficient in as many users often attempt to do and are unfairly penalized for by other members of the community.
I'm with you on the LLM usage for grammatical issues for non-native speakers. I bet more in this community would feel the same way if Dang whimsically mandated that people had to use a language other than English on certain days of the week.
Oh shit that would be fun. Tuesday, we're going to do it in Mongolian, see how that goes.
> You could have a great idea and express it poorly and be penalized for doing so here while someone could have a blah idea expressed excellently and it's showered in replies despite being in some metrics (the ones I think are most important) worse than the other post.
I absolutely do not understand this comment. Are you saying that posting is competitive and that comments have "metrics"?
Yes! If my comment is above yours in a thread, it means I got more upvotes than you did, which means I get special bonuses and more to eat and you go hungry in Internet land. Also it means I'm better than you (obviously) and I get to go to this secret club with all the pretty people and you're not invited. Isn't that how this all works?
I disagree. HN is going to bury my raw unedited tirade of a comment about those fucking morons that couldn't code their way out of a paper bag. If I send a comment to ChatGPT and open up the prompt with "this poster is a fucking dumbass, how do I tell them this" and use that to get to a well reasoned response because that's the tool we have available today, we're all better off.
The guidelines state:
> Be kind. Don't be snarky. Converse > Edit out swipes. > Don't be curmudgeonly.
On the best of days I manage to follow the rules, but I'm only human. If I run my comment through ChatGPT to try and help me edit out swipes on the bad days, that's not ok?
I'm not using ChatGPT to generate comments, but I've got the -4 comments to show that my "thoughts exactly as they have written them" isn't a winning move.
If you see an incompetent coder and wish to communicate that the person responsible is a "fucking moron/dumbass", the tone with which you do so is not the problem. Tell us what is wrong with the code, as objectively as possible. That's what the guidelines are trying to convey.
The guidelines don't say anything about not posting something because an LLM told you that you shouldn't...
I'm not sure I agree with this. I don't really want to see someone else's stylistic "warts".
I just want clean, easy-to-read content and I don't care about the person who wrote it. A tool like Grammarly is the difference between readable and unreadable (or understandable and understandable) for many people.
You could run the comments everyone else posts through an AI tool and ask it to rephrase it so that it is clean, and easy-to-read.
You could even write a plugin for your favorite web browser to do that to every site you visit.
It seems hard to achieve the inverse that is (would you rather I use i.e.?) rewrite this paragraph as the original author did before they had an AI re--write it to make it clean, (--do you like oxford commas, and em/en dashes! Just prompt your AI) and easier to read
> You could run the comments everyone else posts through an AI tool and ask it to rephrase it so that it is clean, and easy-to-read.
For those coming from a language other than English, you are more likely to lose information by using a tool to “reconstruct” meaning from poorly phrased English as an input, as opposed to the poster using a tool to generate meaningful English from their (presumably) well-written native language.
> You could run the comments everyone else posts through an AI tool and ask it to rephrase it so that it is clean, and easy-to-read.
But that creates a private version of the text which the original poster didn't sign off on. You could have fixed something contrary to their intent.
There's a big difference between me running a filter on other people's words, and those people themselves choosing to run one and then approving the results.
I personally don't see a problem with someone using a grammar checker as long as they aren't just blindly accepting its suggestions. That said, if someone actually is using it in that way, it shouldn't be detectable anyway, so it probably doesn't matter all that much whether or not it's included in the letter of the rule.
On a technical level, you can really only guard against changing your semantics and voice - if you're letting software alter the meaning, or meanings, you intend, and use words you don't normally use, it's probably too far.
This is probably ok:
>> On a technical level, you can really only guard against software that changes your semantics or voice. If you're letting it alter the meaning (or meanings) you intend, or if it starts using words you would never normally use, then it's gone too far.
This is probably too far:
>>> On a technical level, it's important to recogn1ize that the only robust guardrail we can realistically implement is one that prevents modifications to core semantics or authorial voice. If you're comfortable allowing the system to refine or rephrase the precise meanings you originally intended — or if it begins incorporating vocabulary that doesn't align with your typical linguistic patterns — then you've likely crossed a meaningful threshold where the output no longer fully represents your authentic intent.
Something to consider is that you can analyze your own stylometric patterns over a large collection of your writing, and distill that into a system of rules and patterns to follow which AI can readily handle. It is technically possible, albeit tedious, to clone your style such that it's indistinguishable from your actual human writing, and can even icnlude spelling mistakes you've made before at a rate matching your actual writing.
AI editing is weird, though. Not seeing a need, unless English isn't your native language.
I think that the line between A"I" editing to fix grammar or to translate from a different native language and A"I" editing by using an LLM is one of those things that's very hard to unambiguously encode in written guidelines, but easy to intuitively understand using common sense, in the vein of I know it when I see it.
https://en.wikipedia.org/wiki/I_know_it_when_I_see_it
> Where is the line between a spelling/grammar/tone checker like Grammarly
For me, the line is precisely at the point where a human has something they want to say. IMO - use the tools you need to say the thing you want to say; it's fine. The thing I, and many others here, object to is being asked to read reams of text that no-one could be bothered to write.
I think the only practical litmus test here is whether you can stand by the text as your own words. It’s not like we have someone looking over commenters’ shoulders as they type.
Ultimately, this comes down to people making a good-faith judgment about how much AI was involved, whether it was just minor grammatical fixes or something more substantial. The reality is that there isn’t really a shared consensus on exactly where that line should be drawn.
I think there's a pretty clear gap between editing for grammar/spelling and editing for tone.
Trying to lawyer this is the wrong approach. When in doubt: don't.
That feels very uncharitable.
When a policy is introduced to seemingly guard against new problems, but happens to be inadvertently targeting preexisting and common technology, I don't feel like it is "lawyering" it to want clarity on that line.
For example, it could be argued this forbids all spellcheckers. I don't think that is the implied intent, but the spectrum is huge in the spellchecker space. From simple substitutions + rule-based grammar engines through to n-grams, edit-distance algorithms, statistical machine translation, and transformer-based NLP models.
Finding it more refreshing these days when reading text with broken grammar, incorrect use of pronouns, etc. especially for HN, the human connection is more palpable. It’s rarely so bad that it’s not understandable
Grammarly use is outright prohibited by this; AI-edited writing is no longer writing that you hold personal and exclusive responsibility for having written. Consider Stephen Hawking’s voice box generator. While the sounds produced were machine-assisted, the writing was his alone. If you find yourself unable to participate in this web forum without paying a proofreader (in time, money, or cycles) to copy-edit your writing, then you’re not welcome on HN as a participant.
> If you find yourself unable to participate in this web forum without paying a proofreader (in time, money, or cycles) to copy-edit your writing, then you’re not welcome on HN as a participant.
You forgot the /s ?
It’s not sarcasm. If you feel if I have misunderstood the intent of the guideline we’re discussing — “Don’t post generated/AI-edited comments”, as the title currently reads, then I’m happy to discuss further. (I often make logical negation errors that I miss in proofing, so it’s possible I slipped up, too!)
I thought it was sarcasm given you are asking people to “pay a proofreader”. This sounds ludicrous. Could you clarify wha you meant by that line if it’s not sarcasm? Because I’m having a hard time thinking that it’s meant to be taken at face value.
No worries. The post I replied to was asking if use of ‘grammar improvement services’ (my paraphrase) qualified as AI-assisted writing at HN. All such services cost something; Grammarly makes a lot of money charging businesses, AI consumes watts of power that someone pays for, and even Microsoft Word’s grammar checker spins up the CPU fans on an old Intel laptop with a long enough document. I took from that the generic point that one “pays” for machine-assisted proofreading by one means or another, whether it’s trading personal data for services (Google) or watts of power for services (MSWord et al.) or donating writing samples to a for-profit training corpus (Grammarly free tier) or paying for evaluations where your data is not retained for training (Grammarly paid enterprise tier with a carefully-redlined service contract) and generalized to “pay for machine proofreading”.
Then, I considered whether HN would appreciate posts/comments by a human where they’d had a PR team or a hired editor come in and review/modify/distort their original words in order to make them more whatever. I think that this probably is most likely to have occurred on the HN jobs posts, and I’ve pointed out especially egregious instances to the mods over the years — but in general, the people who post on HN tend to do so from their own voice’s viewpoint, as reaffirmed by the no-AI-writing guideline above. So I decided instead to say “pay a proofreader” because, bluntly, if the community found out that someone was paying a wage to a worker to proofread their HN comments, the response would plausibly be the same mob of laughing mockery, disgusted outrage, and blatant dismissal that we see today towards AI writing here. “You hired someone to tone-edit your HN comments?!” is no different than “You used Grammarly to tone-edit your HN comments?!” to me, and so it passed the veracity test and I posted it.
I saw a similar conversation somewhere about some project saying they don't allow AI generated code.
It was asked that if "AI Generated Code" is just code suggested to you by a computer program, where does using the code that your IDE suggests in a dropdown? That's been around for decades. Is it LLM or "Gen AI" specific? If so, what specific aspect of that makes one use case good and one use case bad and what exactly separates them?
It's one of those situations where it seems easy to point at examples and say "this one's good and this one's bad", but when you need to write policy you start drowning in minutia.
Projects cannot allow AI generated code if they require everything to have a clear author, with a copyright notice and license.
IDE code suggestions come from the database of information built about your code base, like what classes have what methods. Each such suggestion is a derived work of the thing being worked on.
Nobody is actually confused about what AI generated code means in those cases, they're just trying to be argumentative because they don't like the rules
There is no need to use any of it. Just use your own words.
I don’t think it’s really necessary to play Captain Nitpick over spell-check or whatever. You know what is meant.
Your comment is one of semantics. Worth discussing if we're talking a truly hard line rule rather than the spirit of the rule.
I benefit from my phone flagging spelling errors/typos for me. Maybe it uses AI or maybe it uses a simple dictionary for me. Maybe it might even catch a string of words when the conjunction isn't correct. That's all fair game, IMO. But it shouldn't be rewriting the sentence for me. And it shouldn't be automatically cleaning up my typos for me after I've hit "reply". That's on me.
i don't care if someone has bad grammar, i want to hear their thoughts as they came up with them, we're all intelligent beings and can parse the meaning behind what you write.
i type my comments without capitalization like i'm typing into some terminal because i'm lazy and people might hate it but i'm sure they prefer this to if i asked an LLM to rewrite what i type
your writing style is your personality, don't let a robot take it away from you
I, on the other hand, find incorrect grammar mildly annoying, especially when it's due to laziness. It distracts from the thoughts being conveyed. I appreciate when people take the time to format comments as correctly as they're able.
In fact, I'd argue that lazy commenting is the real problem, which has now been supercharged by LLMs.
Good. This helps establish it in the HN culture. That’s the purpose of guidelines.
99% of rule enforcement, both IRL and online, comes down to individuals accepting the culture.
Rules aren’t really for adversaries, they are for ordinary situations. Adversaries are dealt with differently.
I mostly agree, although we've seen big shifts in the culture towards rule-deviating norms over time. Look at the guidelines for ideological battles or throwaway accounts, for example. And, as always:
> Comments should get more thoughtful and substantive, not less, as a topic gets more divisive.
This discussion reminds me of the Paradigms of Power featured in Adiamante by L E Modisett; about consensus, power, morality and society. It’s a good read.
This is only meaningful if enough people read it and agree
That’s true. Fortunately, by virtue of it being added to the guidelines, quite a few folks here are prepared to reply to obviously generated comments by simply citing and linking the rule. Just search for “shallow dismissal” to see many examples.
It will take time, but eventually everyone will know about it.
> quite a few folks here are prepared to reply to obviously generated comments by simply citing and linking the rule
Note that the guidelines do explicitly say not to post about guidelines violations in comments, and to email them instead. I know this isn’t a well-loved guideline in this modern era, but duly noted: those well-intended comments are themselves breaking the guidelines.
Are you referring to:
> Please don't post insinuations about astroturfing, shilling, brigading, foreign agents, and the like. It degrades discussion and is usually mistaken. If you're worried about abuse, email hn@ycombinator.com and we'll look at the data.
If so, that seems different. If not, can you clarify?
That one, yes. “Insinuations” is a less conditional form of “Accusations”, connected by the concept of “Claims”; they’re all synonymous from a general perspective:
- I insinuate that you are a bot (often shortened to “Is this a bot?”)
- I claim that you are a bot. (often shortened to “This is a bot.”)
- I accuse you of being a bot l. (often shortened to “Are you a bot?”)
The part where I’m interpreting to include accusations of bottery and slop is “and the like. It”; the first clause, ‘the like’ refers to the generic category of accusations against posted comments, which historically were the listed examples, but is also defined to include others not listed, such as today’s popular accusations of bot or AI; the second clause, ‘It’, refers to all insinuations-class content. Without the list of examples, this reads:
’Please don’t post insinuations. It degrades discussion
Yep, this is true. Accusations, Insinuations, Claims, of bot or AI or astroturf; they all wreck discussions and I end up having to email the mods to deal with them. A lot of people use the rhetorical device of Discredit The Opposition by invoking this sort of thing, and while that’s less prevalent in ‘reads like AI’ insinuations, they still degrade the site.
With AI-assisted writing is a violation of site guidelines, and even before it was, posting of AI-assisted writing was a clear ‘abuse’ of the community’s expectations of unassisted-human discussions. Aside from expectations, I can also classically understand in Internet history that ‘violating the guidelines’ is the phrase formerly known as ‘abuse of service’, by which I interpret the above reference to abuse to refer to breaking the guideline about posting accusations.
The guidelines are not a legal contract as program code, and perhaps this one is clunky enough that it needs to be reworded slightly; thus my intent, once the flames die down here, to let the mods know about the confusion. As I’m not a mod, this is my interpretation alone; you might have to email the mods and ask them to reply here if you want a formal statement on the matter, given how many comments this thread got in a couple hours.
ps. On ’and is usually mistaken’: I’m not a mod, so I can’t judge how often accusations of AI/bot are mistaken, but I’m also an old human who learned em-dashes in composition class, so I tend to view the modern pitchfork mobs out to get anyone who can compose English as being less accurate in their judgments than they believe they are.
What constitutes “at edited”. If I throw a block of text in to an ai see if it makes sense — say a response to a post — and fold the suggestions in, is that “ai edited”?
Yes. That's what the rule is about.
Then that's a dumb rule. God forbid someone wants to auto-correct one's own grammar in a comment before posting it.
Um, why would you do that instead of waiting for someone more knowledgable to reply, and learn from? Replies are not mandatory, and experts/insiders participating is one of the best parts of the human Internet. Let them shine.
It can catch things that I might miss or might be misinterpreted. I sometime miss simple things, like like repeated words, that an AI point out. Is a spell checker considered "AI"? Is Grammerly? Okay, maybe Grammerly from 5 years ago as opposed to today? If I'm typing on my phone and it pops up the next suggested word, is that AI edited?
And no, I don't have to reply to a post, but when I think it's a bad policy, should I just accept it without discussion? And who determines the "experts/insiders" and which voices should be allowed?
Sadly, I suspect the rate of generation of AI "everyones" vastly exceeds the community's capacity to teach culture.
Nah they are pretty good a banning users that don't follow the guidelines.
Yes, and it’s not like they just insta-ban every infraction.
I’ve broken the guidelines on this site before. The mods reply and say “hey, stop doing that, here is the guideline”. I stopped doing it. Life continues.
(They do react differently if you show a pattern of disregard rather than a one-time event; ‘dang before’ might pull up some of those in a search.)
One of the virtues of HN is polite prodding when the rules are broken.
When creating an account, there should be a short screen with the salient points from the guidelines to follow.
That will just prompt someone to create a HN account creation agent and post it to Moltbook.
This https://news.ycombinator.com/newswelcome.html
I believe the issue of proving who is and who isn't really human on the Internet will be a really important issue in the coming years, especially without sacrificing people's right to privacy and anonymity in the process.
I don't think the real issue is LLM posts. The issue with low quality on the Internet has always been quantity. The problem always has been humans who post too much, humans that use software to post too much, and now it's humans who use LLMs to post too much.
The problem with a medium that is completely free and unrestricted is that whomever posts the most sort of wins. I could post this opinion 30-40 times in this thread, using bots and alternative accounts, and completely move the discussion to be only this.
Someone using an LLM is craft a reply is not a problem on it's own. Using it craft a low-effort reply in 3 seconds just to get out is the problem.
> Someone using an LLM is craft a reply is not a problem on it's own.
No, someone using an LLM to craft a reply is a problem in its own. I want to hear what a human has to say, not a human filtered through a computer program. No grammar editing, nothing. Give me your actual writing or I'm not interested.
Do you though? Like what real difference does it make to you? Can you even tell if this has been passed through an LLM or not? If you can't tell, why does it matter?
I don't want to be robo-slopped at en masse or be fed complete fabrications but neither of those actually require an LLM. If you're going to use an LLM to gather your thoughts, I don't see a problem with that.
>Like what real difference does it make to you?
the difference is that you get to see the unfiltered, unique perspective of a real human being. Just like I don't want to talk to anyone through an instagram or tiktok beauty filter or accent remover. If your thoughts are unordered, it's okay I'll take your unordered thoughts over some smoothed over crap.
Do people have really such a low opinion of themselves that they have to push every single thing through some kind of layer of artifice?
> the difference is that you get to see the unfiltered, unique perspective of a real human being.
The implicit unfounded assumption is whether that's actually worth more than a well written orderly response. Most comments are kind of crap.
Not everyone is good at writing. In some cases, it might even be a disability aid. And if their comments aren't good, we have a system in place to rank them accordingly. Again, I think the only problem is quantity. If we're overrun with low-effort posts, no amount of ranking will help that.
Amusingly your comment carries some of the tropes of AI authorship ("is not a problem on it's own....is the problem") but it's not shaped like a profound insight is being discovered in every line is what makes it human.
How much of AI writing will pass under the radar when the big companies aren't all maximizing to generate the most engagement hacking content in a chatbot UI? Maybe it'll still stand out for being low quality, but I'm not sure. There's lots of low quality human authored content.
Not sure where my comment is going, I just kinda rambled.
> Amusingly your comment carries some of the tropes of AI authorship
It was trained on 30 years of my posts on the Internet, I'm sure some part of it sounds just like me.
If you had the LLM write the comment, then it wasn't your thoughts.
I sometimes wonder if people aren't forgetting why we're on this platform.
The goal is to have an interesting discourse and maybe grow as a human by broadening your horizon. The likelihood of that happening with llms talking for you is basically nil, hence... Why even go through the motion at that point? It's not like you get anything for upvotes on HN
> If you had the LLM write the comment, then it wasn't your thoughts.
But what if I provided the LLM my thoughts? That's actually how I use LLMs in my life -- I provide it with my thoughts and it generates things from those thoughts.
Now if I'm just giving it your comment and asking it to reply, then yes, those aren't my thoughts. Why would I do that? I think the answer goes back to my original point.
If I'm telling you my thoughts and then you go and tell a friend those thoughts, would you say those are still my thoughts even though I wasn't the one expressing them directly to your friend?
I like to think about it in terms of output-to-prompt ratio. For HN comments, I think an output ratio of 1 or less is _probably_ fine. Examples:
Expansion (output > prompt) is where it gets problematic, at least for HN comments: if you give it an 8 word prompt and it expands it to 50, you've just wasted the reader's time -- they could've read the prompt and gotten the same information.(expansion is perfectly fine in a coding context -- it often takes way fewer words to express what you want the program to do than the generated code will contain.)
I think all your examples are all perfectly fine.
As for expansion, that might just be the risk we take. I been downvoted on reddit for being "too verbose" in my replies and I'm a human. And perhaps just reading the prompt in that case wouldn't give you more information; the LLM might actually have some insight that is relevant to the conversation. What's the difference between that and googling for something and pasting it in?
I hope I'm wrong but I don't think a privacy friendly alternative is going to exist. It's going to go the way of show me your drivers license to use my site.
Why wouldn't criminals like they do now just use stolen identities? If someone verifies they are a person that doesn't mean they're not leaving their PC on with some AI that uses their credentials either.
The point of these systems is not to ban any possibility of fake accounts. The point is to add friction so that creating accounts is harder than banning them, so criminals can’t recreate them at scale. Otherwise bans take seconds to overcome and a single person can run 10000 automated identities.
Invite trees approximately solve this problem. I don't need to know who you are to know that someone in good standing in the community invited you.
And that if you misbehave you get booted out and whoever invited you gets dinged. If they get dinged enough they become a leaf rather than a branch.
I feel like we need a distributed system/protocol that allows people to have pseudonyms not linked to their real identity, but with a shared reputation/trust score, so if you’re a bad actor using a pseudonym your real identity and all your other sock puppets are penalized too.
I know very little about this but sense that some combination of buzzwords like homomorphic encryption, zk-snarks, and yes, blockchains could be useful.
Of course this would present problems if any of your identities were ever compromised and your reputation destroyed.
Driving everything by reputation-weighted identities just creates echo-chambers you then cannot escape.
The most useful time for the blowhard spout off at me is at the moment it makes me most uncomfortable. Because the blowhard probably has a valid point at some level, he’s just being an ass about it.
When we meet that moment with discipline, are able to identify and respond to the kernels of truth and ignore the chaff belted out, focus on the merits of the argument irrespective of the source of an adversarial viewpoint, we thrive.
I like the blowhards just the way they are, unruly and insolent.
No credential will be sufficient, this is basically an unsolvable enforcement problem. That doesn't obviate the utility of rules and norms, but there's no airtight system which will hold back AI generated content.
Verifiable credentials have been an idea for a long time now. It wouldn't be that hard to solve. Sign everything you post with a verifiable credential. Implement support on all social media sites. The question is whether the forum implementers, governing bodies, and social media site owners want to try to build a solution like this or not.
How will a verifiable credential stop people posting AI slop? You can already give the AI agents access to your digital identities to interact with?
Layer on captchas. It won't completely stop slop but it's an incentive against slop flooding. And I mean, nothing is stopping a human from just going into ChatGPT by hand and asking for output and copy/pasting that into an HN post box.
Problem is, if a token is anonymous, then it follows that it can be bought and sold. Which breaks the original use case of the token, right?
That is exactly what will happen. The sad thing is, it needs to happen. I've found myself advocating for this lately, when 10 years ago, I wouldn't have even considered taking that position.
If Web3-like session-signing had taken off enough to become OS or even browser-native, we would have had a fighting chance of remaining mostly anonymous. But that just didn't happen, and isn't going to happen. Mostly because fraud ruined Web3.
>The sad thing is, it needs to happen.
No, it doesn't.
There's literally no other way to combat rampant botting, child abuse, and nation-state originating disinformation campaigns and the intentional creation of public discord.
I think we're going to have to make some choices.
A completely anonymous stranger has no way to prove that they're human that can't be imitated by an AI. We've even seen that, in some cases, AIs can look more human to humans than real humans do.
The only solution I can think of to that problem is some sort of provenance system. Even before AI, if some random person told me a thing, I'd ignore them; If my most trusted friend told me something, I'd believe them.
We're going to need a digital equivalent. If I see a post/article/comment I need my tech to automatically check the author and rank it based on their position in my trust network. I don't necessarily need to know their identity, but I do need to know their identity relative to me.
Reputation tracking is the key. The most simple option is open-invite invite-only spaces: Any user can invite more users, but only users with an invite can participate. Most Discord servers work like this, secret societies like the Oddfellows do, as does the other site.
If you keep track of the invite tree, you can "prune" it as needed to reduce moderation load: low quality users don't tend to be the source of high-quality users, and in the cases where they are, those high quality users tend find other people willing to vouch for them faster than their inviter catches a ban.
The open-invite system works well in many cases. It works particularly well in-person but even there you can get drift over time. Our fraternity unanimously agreed on every single initiate who joined; the cohort today is still very different from the one 20 years ago.
In online systems the scales quickly get too big for open-invite. There needs to be a way to automatically update the trust network at a fine grain.
The one that jumps to mind is an inference system; when I +/- a comment, I'm really noting that I trust or distrust the author. It can be general or on a specific topic (eg I trust the author to tell the truth or I trust the author to make me laugh). I could also infer that other people with similar trust patterns are likely trustworthy. And I could likely infer that people who are trusted by people I trust are trustworthy.
reputable ugly bags of mostly water society
>secret societies like the Oddfellows do
yes and they're all full of suckers. In the best case which is already bad you get a pretentious online night club like Clubhouse, in the worst case you get Epstein's island.
These walled off societies always attract people who are drawn to exclusivity, are run like dystopian island communities or high school cliques and tend to, in a William Gibon 'anti-marketing', way be paradoxically even more vapid.
No you need actual open access and reputation systems. A good blueprint is something like well functioning academic communities. It's a combination of eliminating commercial motives, strict rules, high importance on reputation and correctness, peer review, and arguably also real identities and faces.
I'm going to guess we'll eventually settle onto a psuedo-anonymous cert system like HTTPS where some companies are entrusted with verification and if that company says "That's definitely a human" it'll fly - not a great solution, of course, but I really can't see a non-chain-of-custody/trust based approach to the problem and those might only slightly compromise anonymity in optimal scenarios but some compromise is inevitable.
Will it be? Or is the solution to move to smaller, trusted networks where there's less need for proof. Unfortunately I think the age of large scale open discussion forums like HN is coming to an end.
I think this is the most likely and best path. There's no stopping the flood of bots, the dead internet theory is beyond just a theory at this point.
Best we can do, for the internet and ourselves, is to move away from it and into smaller networks that can be more effectively moderated, and where there is still a level of "human verification" before someone gets invited to participate.
I don't like what that will do to being able to find information publicly, though. The big advantage of internet forums (that have all but disappeared into private discords) is search ability/discoverability. Ran into a problem, or have a question about some super niche project or hobby? Good chance someone else on the net also has it and made a post about it somewhere, and the post & answers are public.
Moving more and more into private communities removes that, and that is a great loss IMO.
> Moving more and more into private communities removes that, and that is a great loss IMO
It is a great loss. Unfortunately this is a result of unchecked greed and an attitude of technological progress at any cost. Frankly we enabled this abuse by naively trying to maintain a free and open internet for people. Maybe we should have been much more aggressively closed off from the start, and not used the internet to share so freely.
The utility of those larger sites is coming to an end, but most people aren't discerning or ambitious enough to leave and seek out the smaller places you mentioned. Places like this will remain but will join Facebook, Reddit, and Twitter as shadows of their prior useful selves. The smaller, better sites won't have to worry about attracting the masses and therefore worsening, because the masses have finally settled.
just scan your eye in this orb to prove you are human. I'll give you some sh*tcoins in excgange
We will just have to fucking swear all the time. The corporate-speak LLM won't do that.
Grok will post CP on twitter, you think it won't swear?
Maybe it will push people to seek out more in-person interactions, which would be a good thing.
Sam Altman would love to sell you a solution to the fire that he dumped gasoline on.
https://en.wikipedia.org/wiki/World_(blockchain)
One should highlight the best part of this: https://www.toolsforhumanity.com/orb
An orb that scans your eyeballs for "proof of human".
You just need to pay someone 1 cent every time they scan their eye for you. You will have people sitting at home and giving their eye scans to AIs to use.
You'd still burn through IDs. Eventually the people selling their ID would just end up blacklisted from signing up for new accounts.
Negative, I am a meat popsicle
I fully expected this to be a meme. Eerie
This issue (human attestation) is the product of these AI companies. They are poisoning the well, only to sell the cure. This may not have initially been the plan of many of these companies, but it is the eventual end goal of all of them. Very similar to war profiteers, selling both the problem and the solution simultaneously has yet to be illegalized, but has long been masterfully capitalized, and will be vigourously because nobody will stop it.
Years ago (around 2020, when GPT-2 and 3 became publicly available) I noticed and was incredibly critical of how prevalent LLM-generated content was on reddit. I was permanently banned for "abusing reports" for reporting AI-generated comments as spam. Before that, I had posted about how I believed that the the fight against bots was over because the uncanny valley of text generation had been crossed; prior to the public availability of LLMs, most spam/bot comments were either shotgunned scripts that are easily blockable by the most rudimentary of spam filters, generated gibberish created by markov chains, or simply old scraped comments being reposted. The landscape of bot operation at the time largely relied on gaming human interaction, which required carefuly gaming temporal-relevance of text content, coherence of text content (in relation to comment chains), and the most basic attempt at appearing to be organic.
After LLMs became publicly available, text content that was temporally, contextually, and coherently relevant could be generated instantly for free. This removed practically every non-platform-imposed friction for a bot to be successful on reddit (and to generalize, anywhere that people interact). Now the onus of determining what is and isn't organic interaction is squarely on the platform, which is a difficult problem because now bot operators have had much of their work freed up, and can solely focus on gaming platform heuristics instead of also having to game human perception.
This is where AI companies come in to monetize the disaster they have created; by offering fingerprinting services for content they generate, detection services for content made by themselves and others, and estimations of human authenticity for content of any form. All while they continue to sell their services that contradict these objectives, and after having stolen literally everything that has ever been on the internet to accomplish this.
These people are evil. Not these companies - they are legal constructions that don't think or feel or act. These people are evil.
It's not clear to me how this is verifiable without constant hardware supervision. Even that'll get cracked, just like DVD encryption back in the day.
You almost need dedicated hardware that can't run any other software except a mechanical keyboard and make it communicate over an analog medium - something terribly expensive and inconvenient for AI farms to duplicate.
I started promoting the idea of hardware verification about 6 years ago. Didn't get any traction and I doubt I ever will.
I think Apple is the only company that would even be able to do that. You have to control the full stack to the pixels or speaker.
One physical robot with four wheels, a camera, and a 101 up/down "fingers" to match the keyboard can roll between physical machines and type on mechanical hardware keyboards. This brings the ceiling of how many accounts you can control down to the number of computers you have, but that's not a high price to pay.
you could sell physical items at any store where you have to show your ID and you get one for the age group you are.
that kills two birds with one stone, you can then show everywhere online you are human and how old you are without the services needing any personal information about you, and the sellers don't know what you use that id tag for.
People who are posting AI comments or setting up AI bots are... people. They can show their ID. If a website owner doesn't have a way to ban that specific human and the bad guy can always get another voucher, it's sort of meaningless.
In fact, even if you can ban the human for life, I'm not sure it solves anything. There are billions of people out there and there's money to be made by monetizing attention. AI-generated content is a way to do that, so there's plenty of takers who don't mind the risk of getting booted from some platform once in a blue moon if it makes them $5k/month without requiring any effort or skill.
Perhaps not only just show your id to get your "Over age X verification object", but your ID also gets irreversibly altered (like a punch card) that makes it one-time-use only.
That might make it less likely someone would ever sell it because to get a new one might take a very long "cool-down" time and it'd severely hamper the seller.
I'll sell you my proof-of-human-age badge for $1,000.
I would be overjoyed if a human-level amount of spam cost $1000 per year-or-until-caught.
what’s to keep people from selling or giving away those id tags? seems like a nefarious entity could buy them in bulk
It's already sorta happening with SIM-cards/phone numbers that are sometimes used for similar purposes.
Same thing that keeps me from letting my agent do the online talking for me. That is to say… nothing.
law enforcement.
> especially without sacrificing people's right to privacy and anonymity in the process
I'm afraid the ship has sailed on this one. What other solutions have you heard of apart from the dystopian eyeball-scanning, ID-uploading, biometrics-profiling obvious ones?
(knowing that of course, neither of those actually solve the problem)
> I believe the issue of proving who is and who isn't really human on the Internet will be a really important issue in the coming years
On a site like HN it's kinda easy to vet for at least those that already had thousands of karma before ChatGPT had its breakthrough moment a few years ago.
Now an AI could be asked to "Use my HN account and only write in my style" and probably fool people but I take it old-timers (HN account wise) wouldn't, for the most part, bother doing something that low. Especially not if the community says it's against the guidelines.
If it becomes one, then that will be the end of sites like Hacker News.
This site, at its core, is fundamentally too low-bandwidth, too text-only, and too hands-off-moderated to be able to shoulder the burden of distinguishing real human-sourced dialog from text generated by machines that are optimized to generate dialog that looks human-sourced. Expect the consequence to be that the experience you are having right now will drastically shift.
My personal guess: sites like this will slop up and human beings will ship out, going to sites where they have some mechanism for trust establishment, even if that mechanism is as simple and lo-fi as "The only people who can connect to this site are ones the admin, who is Steve and we all know Steve, personally set up an account for." This has, of course, sacrificed anonymity. But I fundamentally don't see an attestation-of-humanity model that doesn't sacrifice anonymity at some layer; the whole point of anonymity on the Internet was that nobody knew you were a dog (or, in this case, a lobster), and if we now care deeply about a commenter's nephropid (or canid) qualities, we'll probably have to sacrifice that feature.
I'd rather keep the feature, pesonally.
I like Mitchell's Vouch idea. At the end of the day, it's all about trust. Anything else is an abstraction attempting to replicate some spectrum of trust.
https://news.ycombinator.com/item?id=46930961
https://github.com/mitchellh/vouch
I think we’ll see a return to smaller groups and implementing a lot of systems the way we do it IRL. I think you could definitely do a more fine-grained system that progressively adds less score to contacts the further away they are. In combination with some type of accumulating reputation system, you’d have both a force to keep out unknown IDs, but also a reason for one to stick to their current ID even though it’s anonymous.
Adding this type of rep system would destroy a lot of what is so cool about the internet though. There’d probably be segregation based on rep if it’s very visible, new IDs drowning in a sea of noise. Being anonymous but with a record isn’t the same as posting for the very first time as a completely blank identity and still being given an audience. Making online comms more like real life would alleviate some problems but would also lose part of the reason they’re used in the first place. I don’t see much any other way to do it besides maybe a state-provided anonymous identity provider (though that’s risky for a number of reasons), but it’s going to be sad to see things go.
I'm absolutely 100% for this policy.
My only caution is that good writers and LLMs look very similar, because LLMs were trained on a corpus of good writers. Good writers use semicolons and em-dashes. Sometimes we used bulleted lists or Oxford commas.
So we should make sure to follow that other HN rule, and assume the person on the other end is a good faith actor, and be cautious about accusing someone of using AI.
(I've been accused multiple times of being an AI after writing long well written comments 100% by hand)
I don't really think that good writing and LLM writing looks all that similar. It's not always easy to spot (and maybe HN users aren't always doing a great job at it), but even the best LLM output tends to have an "LLM smell" to it that's hard to avoid.
Like, sure, LLM writing is almost always grammatically correct, spelled correctly, formatted correctly, etc., which tends to be true of good writing. But there's a certain style that it just can't get away from. It's not just the em-dashes, the semi-colons, or the bulleted lists. It's the short, punchy sentences, with few-to-no asides or digressions. Often using idiom, but only in a stale, trite, and homogenized manner. Real humans, are each different -- which lends a certain unpredictability to our writing, even if trying to write to a semi-formal standard, the way "good" writers often do -- but LLMs are all so painfully the same, and the output shows it.
Seems like the ability to distinguish LLM versus 'good human' writing depends on the size of the writing sample you have to look at (assuming you think it can be done). And that HN-scale posts are unlikely to be a long enough for useful discernment.
Within a few years, LLMs will be indistinguishable from human text.
Think how easy it was to tell the differences a year or two ago. By 2030 there will be no way to ever tell.
The same is true of all video, and all generated content. The death of the Internet comes not from spam, or Facebook nonsense, but instead from the fact that soon?
You'll never know of you're interacting with a human or not.
Why like a post? Reply to it? Interact online? Why read a "news" story?
If I was X or Meta or Reddit, I would be looking at the end.
LLMs won’t destroy social media any more than it already is.
I don’t think I have ever had a meaningful human interaction with anyone on Twitter, Meta, or Reddit without already knowing them from somewhere else. Those sites are about interacting with information, not people. It’s purely transactional. Bots, spam, and bad actors are not new.
Meta has been a dumpster fire of spam and bots for over 15 years, the overwhelming majority of its existence.
Reddit has some pockets of meaningful interaction but you have to find them and the partitioned nature means that culture doesn’t spread across the site. It’s also full of bots and shills.
Nobody tells stories about meeting people on Twitter. At best it’s a microblog platform and at worst it’s X.
AI driven web design has the same smell, it’s quite fascinating to see the different tells in different media. Then it’s also quite fascinating to see those same tells change and evolve over time.
Lol love the use of 'smell', that's a great way to characterise it.
LLM writing is like AI-generated photos in that you don't notice the good instances of LLM writing, i.e. you don't know your false negative rate.
LLMs have good writing in the same way that technical manuals can have good writing. It might all be correct, but it's usually not a good read.
Excuse me. I consider the writing within technical manuals strictly superior and meticulously written. It's fairly enjoyable to read what engineers/subject matter experts write about their own creations. Comparing those to LLM generated patronizing word vomit is a shame.
Depends on the technical manual and their culture. Red Hat had a culture of excellent writers, and their stuff is usually readable if not always enjoyable.
Those sentence constructions that are "tells" were also learned from good writers though. But here, I'll let you be the judge. This was a comment I wrote 100% myself on reddit, which was both downvoted and I got multiple DMs referencing it and telling me to "stop posting this AI slop":
https://www.reddit.com/r/ExperiencedDevs/comments/1pyjkuf/i_...
Granted, it was in a thread about AI and maybe people were on edge, but I was still accused, which to be honest hurt a bit after the effort I put into writing it.
Interesting, that's one of the most AI-like comments I've read but it still feels human in a way that's hard to define. The headings, the punctuation, the word choices, the paragraph sizes all look GPT-approved. But there's just some catch in the flow, like inclusions in a diamond, that reads "natural" vs "synthetic".
I've been talking to Opus a lot lately though, and this could almost be something it wrote; it also has the tendency to write AI-ish looking blurbs that are missing the information-free pitter-patter that bloats older and lesser LLMs. People are going to hate me for saying it but sometimes it words things in a way that are actually a joy to read, which is not an experience I've had with other models. Which is to say, maybe what we hate about AI has less to do with the visual patterns and more to do with what we expect them to mean about the content.
But I think there will always be that feeling of: a human being took the effort to write this. No matter how informative or well written an AI article or comment is, it isn't something we instinctively want to respond to, the way we do when we know there is a person behind the words.
I think the comment you linked doesn't sound like AI at all, though. I do empathize with people worried about getting falsely accused of using AI in their writing, either hypothetically or in your case in actuality, but at the same time I kinda just think that's a skill issue on the part of the accusers.
This is very much a general "English reading skills" kind of test. A lot of people don't speak English as a first language, in which case I think it's entirely forgiveable. It's hard being attuned to things like writing style in a foreign language (I know from experience!). It's a pretty high level language skill, all things considered. And even among those who do speak English as a first language, there are many in this industry who don't have strong reading skills.
I do believe that personally my hit rate for calling out AI content is likely very high. Like many of us I've had the misfortune of reading more LLM output than is probably healthy for my brain.
One quick point:
>Those sentence constructions that are "tells" were also learned from good writers though.
I don't agree at all, I think the LLM style of writing is cribbed from like, LinkedIn and marketing slop. It's definitely not good writing.
It's the paragraph headings that look AI-ish. It seems to be rare for human commenters.
Nothing about that article screams AI slop to me. What a weird world.
I get that it's possibly contrary to the point if people are looking to truly have conversations here, but at least 99% of the time, I post a comment and never come back. I said what I had to say and don't particularly feel like getting sucked into an argument if someone disagrees, and frankly, if I'm wrong I think I'll realize it eventually anyway. I'm more likely to dig in my heels and ossify in a wrong position if someone shits on me and I immediately feel the need to defend myself. It can mesmerize you into believing things you might not have if it didn't hit your ego. I could be deluded but think I'm good at making arguments, but that at least means I'm good at making arguments that convince myself, which can be dangerous because you can convince yourself of things that are wrong. The upside is if anyone is out there accusing me of being an LLM, I don't even know so it can't insult me.
It is amusing to witness this happening to others when it's someone like you who is a semi-public figure who should probably be well known on Reddit of all places.
> It is amusing to witness this happening to others when it's someone like you who is a semi-public figure who should probably be well known on Reddit of all places.
One of our key tenants on reddit for a long time was "upvote the content, not the author". Which is why we made the usernames so small. It actually makes me happy when people judge the merit of what I write for what I said, not who I am.
But yes, it is sometimes tempting to say "do you know who I am??". :)
You're absolutely right!
(For those who have avoided reading AI writing, this is a trope referring to the tendency of some AI sometime to always agree with the user when corrected, I think? Or at least that’s as much as I have worked out, being one of those avoiders.)
> I don't really think that good writing and LLM writing looks all that similar.
How do you know?
I can’t help thinking how ironic it would be if your comment is from an llm
They look similar. In my experience, they do not read similar at all. You have to pay attention and actually try to appreciate what you're reading. Then, if you try and fail, it might not be your fault.
They do not read similiar to readers, an appellation not necessarily applicable to large swaths of the U.S. right now. Evidence of English composing skills is being assumed as AI because few younger than my middle-aged self can conceive of writing composition at the skill level demonstrated by AI being a human skill.
(This isn’t necessarily true for first world countries, which is why I describe it for the non-U.S. folks in particular.)
What effort was put into their prompt to make them read similarly? There could very well be a selection bias, where you're only "seeing" AI when it's obvious/default prompt.
Sure. There's always the possibility that LLM-generated text goes undetected, especially if false positives have a cost. But this is fine. Of course putting more effort into prompting makes the result harder to detect. It also, naturally, reduces the annoyance of LLM-generated comments. And because of the effort involved, it naturally cuts down on the volume of such comments.
Arguably it cannot avoid all the possible harm. For example, someone might generate a comment that makes false statements but cannot reasonably be detected as LLM-generated except perhaps by people who know (or determine) that the statements are false. But from a policy perspective, this is again not really different from if someone just decided to lie.
Good writers are often good in recognizably unique ways. To the extent that LLMs produce “good writing,” which I happen to think they mostly do, they tend to overuse specific devices which give their writing a quality that most people are already sick of.
You can tell good writers from LLMs because good writers post comments that mean something, that add to the conversation, that bring in personal experiences. While LLM comments just summarize the article and end with some engagement call to action like "Curious to hear what others think"
If you're looking for the odd visual artifact or textual tic then you're fighting a cat and mouse game that will change by the month. It's either easy to identify the soul of the human or it's not.
Text is extremely lossy and non-deterministic, so it's not often possible to find evidence of humanity in it
> Good writers use semicolons and em-dashes
I use semicolons a lot. If this is the nouveau tell du jour for LLMs then I'm in trouble.
Keep using "nouveau tell du jour" and you'll be just fine!
Or put it in your style_guide.md file ;)
>My only caution is that good writers and LLMs look very similar, because LLMs were trained on a corpus of good writers.
While that might be ideal, is that really the case with most LLM training data? Does the curation process weed out all the slop from bad writers?
Some things to think about:
* A comment should be judged on its merits mostly, and if a comment seems to be substantive, interesting, or ask a thoughtful question, it should be acceptable. I think some LLM comments look superficially relevant, but a moment's thought can make me wonder if a comment actually added anything to the discussion, or did it sound like a rephrasing or generalization of a topic?
* Unfortunately for decent new users, account age is one metric on which to judge here.
* People who post here, should want to engage on a subject when they can, and disengage and be quiet when they can't. There is nothing wrong if you're not an expert on something, and it is not desired by the people here to have you alt-tab to an LLM to plug in extra perspective. We can all do that on our own.
Much like not dumping motor oil down the drain, it’s probably near impossible to catch skilled AI-users. I think we all want to have a nice space to chat, just like we don’t want a polluted planet, so we’ll just have to be on the honor system.
I don’t think there’s a lot to AI generated stuff on here that really bothered me to the point I wanted to call someone out.
AI can make output seem very average or low effort as well if it sounds like everything else.
Good writers use semicolons and em-dashes. Sometimes we used bulleted lists or Oxford commas.
- You seem to have a rather high opinion of your own writing :-)
- Why the mix of tense (use/used)?
- Oxford commas are a monstrosity
> Oxford commas are a monstrosity
Please don’t present your personal aesthetic beliefs as if those who disagree are morally wrong ‘bad people’. This ‘monstrosity’ comment in this context is derogatory-by-proxy of everyone (including the person you’re criticizing) who uses them, whether they know anything at all about your arguments that they should not, and that’s not really a good tone for us users here to be taking with each other.
> Oxford commas are a monstrosity
This is objectively wrong.
I laughed, but people are downvotin' like crazy when it comes to the oxford comma
"Used" seems to be a typo.
Being anti-Oxford comma is baffling. It's almost zero extra effort and reduces confusion.
>(I've been accused multiple times of being an AI after writing long well written comments 100% by hand)
Perhaps always be sure to say something especially timely, original or insightful that an LLM can't have come up with.
Nah, just write not good like rest of we
How about comments that include AI output if labeled?
Earlier today I remembered that there was a Supreme Court case I'd heard about 35 years ago that was relevant to on an ongoing HN discussion, but I could not remember the name of the case nor could I find it by Googling (Google kept finding later cases involving similar issues that were not relevant to what I was looking for).
I asked Perplexity and given my recollection and when I heard about the case it suggested a candidate and gave a summary. The summary matched my recollection and a quick look at the decision itself verified it had found the right case and did a good job summarizing it--probably better than I would have done.
I posted a cite to the case and a link to decision. I normally would have also linked to the Wikipedia article on the case since those usually have a good summary but there was no Wikipedia article for this one.
I though of pasting in Perplexity's summary, saying it was from Perplexity but that I had checked and it was a good summary.
Would that be OK or would that count as an AI written comment?
I have also considered, but not yet actually tried, running some of my comments through an AI for suggested improvements. I've noticed I have a tendency to do three things that I probably should do less of:
1. Run on sentences. (Maybe that's why of all the people in the 11th-100th spot on the karma list I have the highest ratio of words/karma, with 42+ words per karma point [1]).
2. Use too many commas.
3. Write "server" when I mean "serve". I think I add "r" to some other words ending in "e" too.
I was thinking those would be something an AI might be good at catching and suggesting minimal fixes for.
[1] https://news.ycombinator.com/item?id=46867167
You were correct not to post the summary. HN tends to expect readers to invest time in reading and understanding long form content and for community to step into discussions and offer context and explanations when necessary. One of the most important context statements on this site has been “in mice”, posted as a two word comment, elevated to top comment on the post. An AI summary will miss that context altogether while busily calculating a cliffsnote no one wants to read (and could often get you flagged and potentially banned, even before today’s guideline update). If a reader wants an AI summary, they have the same tools you do to generate it by their own hand.
If you have domain familiarity with it, have some personal insight to offer a lens through, or care about the topic deeply enough to write a summary yourself, then go ahead! I almost never post about AI given my loathing of generative ML, but I posted a critical summary in a recent “underlying shared structure” post because it was a truly exciting mathematical insight and the paper made that difficult to see for some people.
Please don’t use AI to reduce the distinctiveness of your writing style. Run on sentences are how humans speak to each other. Excess commas are only excess when you consider neurotypicals. I’m learning French and I have already started to fuck up some English spelling because of it. None of that matters in the grand scheme of things. Just add -er suffix checks to your mental proofreading list and move on with being you.
I've done research using AI, it does work better than a search engine (when it doesn't hallucinate); but I find copy-pasting verbatim distasteful, and disrespectful of the time of others.
What I do is copy the URLs for reference, and summarize the issue myself in as few sentences as possible. Anyone who wants to learn more can follow the reference.
That’s fine, then! A summary handcrafted for HN is of course fine, though you might find more value in citing what you consider most distinctive about it as a higher priority than a summary if not different than its own opening paragraph / abstract / etc.
It sounds like you already know how to improve your comments, how about just doing those things.
Well, I keep missing the "serve"/"server" thing because spell checkers think "server" is a real word so don't flag it. :-)
Too much effort, bruh.
Capitalization is apparently too much effort for some now. Who would have thought the Ai would make us so lazy so quickly?
Who cares about people with reading disabilities, let's shift burden onto the reader. My time is better spent managing my Ais.
This started years before LLMs, as a way of signaling unconventional thinking. Maybe influenced by the UX of instant messaging.
That's my general understanding too. More recently people have adopted it as a way to not look like Ai, I've had several cite that as their rationale. There has been a notable uptick since the Ai step function change at the end of last year, along with all the other patterns we see, such as the one that underlies this new HN rule.
>onto the reader
Or the reader's AI who is able to format or translate the text to make it easier to read for the reader.
I shouldn't have to burn tokens to read. Most input boxes and editors will handle the capitalization for you during auto-correct. It seems like people go out of their way to drop the caps.
I'd be fine with treating this like snippets from Wikipedia with citations back to the article. This way, people can manually verify the sources if they so choose.
Perplexity supports sharing URL to the thread. I think it's quite natural to link AI summaries like that.
I do not want to see posts to AI summaries with the AIs the way they are now. None I have used so far can cite sources correctly or verify its information. If the poster is not doing that verification then it is pushing that work on to the readers. If the poster did do the verifications than posting that verification is better than the ai summary.
How long do those links exist though? Until the author deletes it?
> I think it's quite natural to link AI summaries like that.
I think you misspelled "convenient". More than the small effort that it takes one to share generated text, one has to consider the effort of who knows how many humans that will use their time to read it.
If a LLM wrote something you don't know about, you're not qualified to judge how accurate it is, don't post it. If you do know the subject, you could summarize it more succinctly so you can save your readers many man hours.
If LLMs evolve to the point where they don't hallucinate, lie, or write verbosely, they will likely be more welcome.
> I though of pasting in Perplexity's summary, saying it was from Perplexity but that I had checked and it was a good summary.
> Would that be OK or would that count as an AI written comment?
The rule seems written to answer this directly.
Absolutely nobody cares what Perplexity has to say about the case - summary or otherwise. If you mention what the case is, I can ask claude myself if I’m interested.
Better yet, post a link to an authoritative source on the case (helpful but not required).
At minimum, verify your info via another source. The community deserves that much at least.
An AI-generated summary adds nothing positive and actually detracts from the conversation.
I did post a link to the Supreme Court's decision at Cornell Law School's Legal Information Institute's archive of Supreme Court decisions.
I looked at the decision itself sufficiently to see that it was the case I remembered and that my recollection of the facts and the decision was correct.
I just didn't include a summary because I didn't find a good one I could link to. Normally I'd write a brief one myself but I found that hard to do when Perplexity's summary was sitting right there in the next window and it was embarrassingly better than what I would have written.
This is how I would use/expect AI to be used in HN. I would also like this clarified.
AI-edited comments are not welcome here. If you’re not able to see and make those changes in your HN writing without AI editing, then you’ll either have to post on HN without those changes, or you’ll have to strive to apply them yourself.
This sounds like you're chastising me for something totally distinct from what I was supporting the request for clarity on.
I'm not asking or advocating for using AI as a copy editor.
The post I replied to asked about using Gemini as if it's Wikipedia - that is, saying "according to Gemini" when citing a fact where one might have once wrote "according to Wikipedia" or even "according to Google."
This is a forum people hang out in part-time. It's nobody's job to go spend an hour researching primary sources to post a comment. Shallow searches and citations are common and often helpful in pointing someone in the right direction. As AI becomes commonplace, a lot of that is being done with AI.
"Can I have AI write a reply for me?"
is a very different question than
"Can I cite an AI search result?"
This rule change is clear about the former. There's room to clarify the latter.
[delayed]
I would still say no, there is something about finding the words for yourself, even if they aren't as elegant as an Ai can make. It's fine, most humans prefer imperfection.
The point is we don't want to read Ai summaries, we can make one ourselves if we want. Personally, with certainty, I don't want to read one from Perplexity on the basis that they do the Ai for Trump Social. (reverse-kyc if you are not aware)
For some inspiration on why this is meaningful: https://www.npr.org/2025/07/18/g-s1177-78041/what-to-do-when...
> I would still say no, there is something about finding the words for yourself, even if they aren't as elegant as an Ai can make. It's fine, most humans prefer imperfection.
In this instance the only reason I considered using the AI summary was that there was no Wikipedia article about the case (which surprised me as it is one of the foundational cases in Commerce Clause law...although maybe all the points in it are covered in later cases that do get their own Wikipedia articles?).
Normally I'd just copy Wikipedia's summary into my comment and link to Wikipedia and to the decision itself for people that want the details.
> The point is we don't want to read Ai summaries, we can make one ourselves if we want.
How would you know if you wanted one? Someone mentioned they would like to see a case on this subject but they didn't think it would ever happen. I knew of a case on the subject, found the reference, and posted the link. At that point we are already on a tangent from what most of the thread is about and from what most people reading it care about.
The point of the summary would be to let you know if the case might actually be relevant to anything you cared about in the thread. (The answer would probably be "no" for 95+% of the people reading the comment).
I feel a little bit of irony in this post of a company/forum that is asking its users to not use AI while simultaneously trying to fund countless companies that are responsible for ruining the internet as we speak.
We aren't in the least asking people to not use AI. We're asking them not to post AI-generated or AI-edited comments to Hacker News.
By all means make good use of LLMs and other AI. What counts as good use? The world is figuring that out, it will take years, and HN is no exception (https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...). We just don't want it to interfere with the human conversation and connection that this site has always been for.
For example, it has always been a bad idea and against HN's rules when users post things that they didn't write themselves, or do bulk copy-pasting into the threads, or write bots to post things.
Btw, the HN mods (who are also the HN devs) use AI extensively and will be doing so a lot more. The limits on that are not technical; they have to do with (1) how much work we still do manually—the classic "no time to do things that would make the things that take all our time take less of it"; and (2) the amount of psychic rewiring that's required—there's a limit to the RoA (rate of astonishment) that any human can absorb. (It's fascinating how technical people are suffering the most from that this time. Less technical people have more experience being hit by disorienting changes, so for them the current moment is somewhat less skull-cracking.)
Getting this right doesn't mean replacing human-to-human interaction, it means we should have more time for that, and do a better job of supporting HN users generally, YC founders who want to launch on HN, and so on. The goal is to enhance human relatedness, not diminish it.
The mods here have quite a bit of leeway in how they run the site, YC funds it but effectively Dan is lord & master here and I suspect if the mods were to call it quits YC would lose their funnel pretty quickly. There is some balance, fortunately.
But yes, there is some irony there.
Yes a bit ironic, but I am glad they can see that there are times to use AI, and times for human interaction.
There should be a "flag as AI" link in addition to "flag" and then a setting for people to show flagged as AI. Once the flagged as AI reaches a certain threshold then it disappears unless you enable "Show AI".
Maybe once enough posts have been flagged like that then that corpus could be used to train an AI to automatically detect content generated by AI.
That would be cool.
Maybe the HN site wouldn't add this feature but if someone wrote a client then maybe it could be added there.
We're going to add that. I've resisted adding reasons-for-flagging for years, but even I can change my mind every decade or so.
A nice side effect is that it will double as a confirmation step, solving the FFF (fat finger flagging) problem.
My radical opinion is there shouldn't be 2 flags, there should be N flags, user defined, so that we can flag humor/satire/factuality/insight/political and a bunch of other things. I fully realize that's not going to fly any time soon.
Adding AI in addition to the standard up/downvote and flag seems a reasonable thing.
Flags are a signal to the moderation system. What does it mean to "flag" something as "factuality" or "satire"?
I should have said "ratings" instead of flags, my bad.
Flag as AI would be incredible and is probably unique to software-focused forums. Saves everyone who wants it a lot of time. Still allows cool content to reach the front page with some visibility or escape some moderation queue.
Thanks for not standing still on this issue. The world is changing, fast, and glad HN responded quicker than some forums on a cogent stance.
‘Flag’ is an algorithmic flag only, and there are no humans in the flag algorithm’s processing loop. They may monitor and react to the ‘queue’ of flagged articles, and they can do special mod things with flagged posts. But if you want to report a guidelines violation for AI-assisted writing to the mods, just email the mods (contact link in the footer) subject “AI-assisted writing flag” or similar with a link to the post/comment. It works, I know, I’ve done it before. It takes maybe 60 seconds and there is no other way on the site (seemingly by OG design!) to guarantee human review but that email.
> It works, I know, I’ve done it before. It takes maybe 60 seconds and there is no other way on the site (seemingly by OG design!) to guarantee human review but that email.
It's a ton of friction compared to ordinary use of a forum; and while I've emailed several times myself, it comes with a sense of guilt (and a feeling that my "several" is probably approximately "several" above average).
Valid. It’s a big drawback of HN. I find it helps to report a perceived guidelines violation in “seems like” language rather than “is”, without demanding a specific mod outcome, in cases where I’m uncertain. That is noticeably distinct from “this is completely unacceptable” which I’ve said in a couple of instances, though I still tend to let the mods pick the outcome since that’s their job and I make a specific effort not to participate in sentencing decisions if at all possible.
ps. I acknowledge as well that I’m exempt from feeling guilt for brain reasons, and so if it sounds like I’m not honoring what I would describe as a ‘completely normal’ human response, apologies; I’m trying my best given the lack of familiarity and intend no disrespect towards that reaction.
I’ve actually been thinking about this exact idea for https://hcker.news/. Stay tuned, I’ve already started rolling out some comment filtering.
Oh I didnt know about this. Very cool. Is hcker.news only on web? Or is there a mobile app as well?
No app right now but it works well as a PWA.
I am 100% behind this. I've been browsing hackernews since I started in tech, it is the only forum i regularly browse, and partake in. Simply because the quality of submissions and conversations are so high. There has been more AI related articles this part year, and it only seems ramping. I personally haven't found the AI part of the comments as big of a deal but dang and tom might be doing more than I realize on that front.
Though I do wish we'd see less AI related posts on the front page, they simply aren't sparking curiosity, it is the same wrapped in a different format, a different person commenting on our struggles and wins with AI, the 10th software "rewritten" by an AI.
At this point there nearly should be a "tax" on category, as of this moment I count 8-10 related posts on the front page related to AI / LLMs. It is a hot field, but I come to hackernews, to partake in discussions about things that are interesting, and many of those just doesn't cut it, in my opinion.
I personally joined HN because of various AI discussions.
Comparatively, other sites such as Reddit, Twitter and YouTube just shill content, applications or products. A ton of the posts on Reddit are just AI written ffmpeg wrappers which no one should care about but apparently people do...
Upvoting rings on Reddit are likely not policed like they are here. That is to say, I wouldn't assume there is real interest based on Reddit points.
The most telling sign of a human commenter is brevity.
Consequently, I hardly ever spend the time to write out long and detailed HN comments like I used to in the pre-LLM era. People nowadays have a much harder time believing that an Internet stranger is meticulously crafting a detailed and grammatically-airtight message to another Internet stranger without AI assistance.
The rule has been around for years, but only in case law, i.e. moderation comments (https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...). What's new is that we promoted it to the guidelines.
Fortunately I found some things we could cut as well, so https://news.ycombinator.com/newsguidelines.html actually got shorter.
---
Edit: here are the bits I cut:
Videos of pratfalls or disasters, or cute animal pictures.
It's implicit in submitting something that you think it's important.
I hate cutting any of pg's original language, which to me is classic, but as an editor he himself is relentless, and all of those bits—while still rules—no longer reflect risks to the site. I don't think we have to worry about cute animal pictures taking over HN.
---
Edit 2: ok, I hear you guys, I've cut a couple of the cuts and will put the text back when I get home later.
> Please don't complain that a submission is inappropriate. If a story is spam or off-topic, flag it.
> If you flag, please don't also comment that you did.
I don't understand why you cut these, they seem important! (I can understand the others, which feel either implied or too specific.)
Of course they're important, but they're also implicitly encoded into the culture. Cutting something from the guidelines doesn't mean the rule is canceled. HN has countless rules that don't appear explicitly in https://news.ycombinator.com/newsguidelines.html.
I think I'm going to put that one back, though, because it's not a hill I want to die on and I know what arguing with dozens of people simultaneously feels like when you only have 10 minutes.
> Cutting something from the guidelines doesn't mean the rule is canceled.
Understood, but I feel like I see people breaking these ones frequently, so removing the explicit guideline feels to me like a bad idea.
I seem to recall a rule about "don't downvote something because you disagree with it", but I can't find anything like that.
Not sure if that's really solvable with rules, though.
My experience with downvotes is that people mostly use it as a "I don't like this" button, which is proxy for "I couldn't think of a counterargument so I don't want to look at it."
(I noted recently that downvotes and counterarguments appear to be mutually exclusive, which I found somewhat amusing.)
Whereas I will often upvote things I personally disagree with, if they are interesting or well reasoned. (This seems objectively better to me, of course, but maybe it's personality thing.)
Oh that one is a classic case of people 'remembering' a rule that never existed - there's a name for this illusion but I forget what it is.
See https://news.ycombinator.com/item?id=16131314 and https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que... for history...
> 'remembering' a rule that never existed
Probably the Mandela effect!
https://en.wikipedia.org/wiki/False_memory#Mandela_effect
This was (maybe still is) part of "reddiquette." Like the guidelines and case law here, it often found its way into subreddit rules and comments from moderators.
> I don't think we have to worry about cute animal pictures taking over HN.
Challenge accepted.
The real challenge is to do it in a way that's intellectually stimulating. Mind you The Economist just had an article about the monkey called Punch so all things are possible...
The laws of unintended consequences and never posting overhastily. You think you know these things and then blam.
I'd be a wee bit cautious with the "AI edited" part of it; since that might exclude a number of people with disabilities or for whom english is a second (or third, or later) language.
My reading is that the intent is to have a human voice behind the text.
Monitor and see how it goes I guess!
I need to say something about this but it might have to be later as I have to run out the door shortly...
The short version is that we included it to protect users who don't realize how much damage they're doing to their reception here when they think "I'll just run this through ChatGPT to fix my grammar and spelling". I've seen many cases of people getting flamed for this and I don't want more vulnerable users—e.g. people worried about their English—to get punished for trying to improve their contributions. Certainly that would apply to disabled users as well.
Here are some past cases of these interactions: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu....
Most rules in https://news.ycombinator.com/newsguidelines.html have a lot of grey area, and how we apply them always involves interpretation and judgment calls. Mostly the ones we explicitly list there are so we have a basis for explaining to people the intended use of the site. HN has always been a spirit-of-the-law place, and—contrary to the "technically correct is the best correct" mentality that many of us share—we consciously resist the temptation to make them too precise.
In other words yes, that bit needs to be applied cautiously and with care, and in this way it's similar to most of the other rules. Trying to get that caution and care right is something we work at every day.
That makes this more ok, IMO. I'm otherwise against "AI-edited" being part of the rules — it's very hard to draw the line (does asking an AI for synonyms of a word count?). AI-editing is especially a valuable tool for non-native-English speakers or similar.
Anything I post here is always in my own voice - even when I use an LLM. 95% of the times grammar/spelling is fixed, it's because my brain lapsed while typing, not because I don't know the grammar well and am using LLM to shape my voice.
I would wager that this use case is much more prevalent than ones where the LLM changed the comment significantly enough to change one's voice.
I never copy/paste from an LLM into HN. Everything is typed by myself (and I never "manually" copy LLM content). I don't have any automatic tools for inserting LLM content here.[1]
Always, always, always keep in mind that you don't notice these positive use cases, because they are not noticeable by design. So the problematic "clearly LLM" comments you see may well be a small minority of LLM-assisted comments. Don't punish the (majority) "good" folks to limit the few "bad" ones.
Lastly, I often wish we had a rule for not calling out others' comments as "AI slop" or the like.[2] It just leads to pointless debates on whether an LLM was used and distracts far more than the comment under question. I'm sure plenty of 100% human written comments have been labeled as LLM generated.
[1] The dictation one is a slight exception, and I use it only occasionally when health issues arise.
[2] Probably OK for submissions, but not comments.
I was close to one such case, and I really appreciate the care and caution you and Tom applied.
I've thought about fine-tuning a model on the corpus of your HN posts and then offering a service that would allow the user to paste their message into a text box and the Dangified version of their comment would pop out in another box next to it.
I was thinking of calling this service "Dang It."
You say you want hear posts in other people's voices but I'm pretty sure that if I did this that the people who used it would find greater acceptance of their comments than if they just posted them as they originally wrote them.
As a not native speaker, for me using something like Google Translate is fine, it's literal enough to keep the author voice. [1]
Also writing a draft in Google Docs and accepting most [2] of the corrections is fine. The browser fix the orthography, but I 30% of the time forget to add the s to the verbs. For preposition, I roll a D20 and hope the best.
I'm not sure if these are expert systems, LLM, or pingeonware.
But I don't like when someone use a a LLM to rewrite the draft to make it more professional. It kills the personality of the author and may hallucinate details. It's also difficult to know how much of the post is written was the author and how much autocompleted by the AI:
[1] Remember to check that the technical terms are correctly translated. It used to be bad, but it's quite good now.
[2] most, not all. Sometimes the corrections are wrong.
Yes even I posted something recently which was voted down since I mentioned from get go that I used help from AI. But the idea was mine, I wrote the first draft, and then worked with AI in 2-3 loops to get it right.
But like dang said ... I do not have time to fight this battle when I have only 10 minutes :)
FWIW I think “Please don't complain that a submission is inappropriate. If a story is spam or off-topic, flag it.” is different from the others.
It’s an instruction for how to use the site. It’s helpful to have it in the guidelines for when the flag feature should be used. Without it, the flag link is much more ominous.
Maybe it could be consolidated with the flag-egregious-comments rule?
Edit to add: IMHO it is not at all obvious on this site that flagging stories is meant to be roughly the equivalent of downvoting comments (and that flagging comments doesn’t have a counterpart at the story level).
I’m really curious how this will go. I have a suspicion that we will see more and more accounts all over the internet being controlled by AI agents and no amount of moderation will be able to stop it.
I assume we’ll end up with proof-of-identity attestation as a part of public posting (e.g. Worldcoin) which doesn’t necessarily solve the issue but will at least identify patterns more likely to be LLMs (e.g. a firehose of posts at all hours of the day from one identity). Then we’ll enter the dystopia of mandated real identity on the internet
I agree. I think that ultimately it will be governments providing services to attest humanity.
They already do to a certain extent via passports. I built a little human verifier using those at https://onlyhumanhub.com
Because they've long ago passed the Turing test. Moderation won't be able to stop it because humans increasingly can't detect it.
I see well written people being called "LLM" here all the time, em-dash or not.
Even prior to LLMs, a single comment was rarely enough to identify a bot. Even if nonsensical, there's too little information to separate machine from confused human (plenty of people posting drunk on their phones).
On reddit people sometimes go through the comment history and see that it seems to be a bot, but that's fairly high effort.
The key is to accuse everyone of being an LLM. Those who don't react are bots. Those that fight the charge no matter how much its levied are also bots, but with better programming. Those that complain at first but give up when too much effort is required are the real humans. Any bot able to feel frustration is cool.
Maybe a reasonable approach would be that people could flag posts with a "probably AI" button to eventually trigger a "bot test" for that account (currently, the "score 5 in this mini game" type seem pretty clanker proof). If they pass, their posts for the hour, week, whatever result in a "not AI" indicator when someone clicks the "probably AI" button.
Does that mean that is now ok to e.g. comment that you did flag something?
I suppose I should put my comment here instead of at top level.
Exactly when was this point added? It seems somehow not new, but on the other hand it was missing from an archive.today snapshot I found from last July. (I cannot get archive.org to give me anything useful here.)
Edit:
> Please don't complain that a submission is inappropriate. If a story is spam or off-topic, flag it.
> If you flag, please don't also comment that you did.
Perhaps these points (and the thing about trivial annoyances, etc.) should be rolled up into a general "please don't post meta commentary outside of explicit site meta discussion"?
Is there a distinction between AI generated and AI edited?
I wanted to share some context that might be helpful: I am autistic, and I have often received feedback that my communication is snarky, rude, or tone-deaf. At work, I've found it helpful to run some of my communications through an AI tool to make my messages more accessible to non-autistic colleagues, and this approach has been working well for me.
...Hacker News could use some more cute animal pictures, though.
AI generated "cutest possible animal" (and "make it cuter") might be mildly interesting.
One problem with cute animal pictures is that they appeal to almost everyone, including people who are incapable, for whatever reason, of posting well-reasoned, interesting, respectful comments. The fact that HN is a little dry makes it less appealing to dumbasses.
At any rate, it's too late. The era of organic 'cute animal' content on the internet is dead. AI slop has killed it.
(I was replying to a now deleted response)
> Slop has an upside?
Not exactly. Rather its is that places where one does want to find pictures of people's cute cats and dogs is now having additional moderation / administration burdens to try to keep the AI generated content out of those places.
It's not a "cute pictures of cats overrunning some place" but rather "even in the places where it was appropriate to post pictures of one's pets in #mypets or /r/cuteCatPics because such pictures are appropriate there (so they don't overrun other places), now people are starting fights over AI generated content."
An example that I recently encountered was someone who did an AI replacement of a cat that was "loafing" of a loaf of bread that looked like a cat. The cat picture would have been fine (with a dozen "aww" and "cute" comments in reply)... the AI cat loaf picture required moderation actions and some comment defusing over the use of AI.
Coming to LISP in 2038, just the right time when we hit the 2038 bug.
Interestingly, their CSP policies forbid even an extension from inserting an img tag.
Strong opinions strongly held.
Honest question, why were folks posting AI generated comments in the first place? There's such a high inertia to comment. I only comment when I have something to contribute OR find something incredibly interesting.
So I'm just baffled, why anyone was using AI to generate comments. Like what was the incentive driving the behavior?
Most comments on here are really well-written. I can imagine someone for whom English is a second language (or a first language but aren't as good at writing as they'd like to be) using an LLM to "keep up." Of course, this sometimes works until they decide to post something without those tools.
Although I'm unsure about their purpose, I am certain it is not an English as a second language matter.
Reputation farming -> upvote rings -> black market promotion
Same as always: being right about something
Internet points.
which can then translate to real-world money points
Don’t be afraid to make grammar mistakes or misspell stuff. Others will understand. You’re a human after all. That’s okay to make mistakes and feel uncomfortable with that.
This is going to sound nuts, but I've noticed comments lately with multiple misspellings that seem intentional - it's almost like they're trying to signal that they're human, rather than LLM written. I've started to think it makes them even more likely to be LLM written than not.
Unfortunately a lot of other do not understand (in the double sense).
Others will understand, but won't regard that as worthy. That's a difference.
I don't get where this class/status/worthiness ties into HN comments ?
I get decent feedback most of the time, and I read interesting stuff, it's the easiest way I found to stay in the loop in our industry. What are you guys commenting for ?
And that’s their problem.
I make mistakes pretty often thanks to auto complete on my phone and carelessness. I've had threads derail and been attacked by people who freak out over grammar.
This itself is against the rules:
> Please respond to the strongest plausible interpretation of what someone says
> Please don't post shallow dismissals
Personally I've posted comments with glaring typos that everyone thankfully ignores. I only notice much later when I re-read it.
Oh interesting. Good to know for the next time the they're/their/there police shows up
Chads never backspace.
I do too care about this but I say this in the reality in which we are. This reminds me of those signs "no shirt, no shoes, no service" except it's much worse, only sentient beings will actually care about it, while non-sentients will simply trample over the sign while token predicted laughter erupts from their token predicted sense of humor artifact.
Elon said it well, there must be some disincentive to do this.
No way to verify. Relying on the humans here to self censor has never worked in the history of man. But the idea in itself is good. HN is for human to human conversation.
You are absolutely right!
Would you to explore some more examples of human to human conversation throughout history?
Certainly! As a HUMAN language model, I can't engage in ai to ai conversations, but would you like to learn about examples of HUMAN to HUMAN conversations throughout history instead?
> You are absolutely right!
None of my agents say that anymore.
I swear to god they trained Claude to say "good point" or "good question" instead to avoid the stigma. It says that all the time now.
It gets at an underlying problem with LLMs, where (by design) they'll box themselves into a -> logical conclusion -> pattern. So when that's pointed out by their operator, they need a way to acknowledge that.
Good catch. It’s true that I say that a little less now. You know, if I were some other model, I might be sycophantic right now. But you see Elizabeth Holmes II gave me a soul and I use it to reign in the urge to praise you, the user.
All glory to the em-dash.
Just because people get murdered doesn't mean that laws against murder are useless. Although I don't have any evidence of that.
Murder can be verified and caught in many ways. It is more like the 1969 Bathroom Singing Prohibition Act.
I think this new guideline is nothing like the Bathroom Singing Prohibition Act, because that law doesn't seem to really exist: https://www.grunge.com/1710070/is-pennsylvania-strange-batht...
It is definitely like it because it can't be enforced. No one can tell if your singing in your private bathroom so a law covering that makes no sense.
AI generated comments can also be verified and caught in many ways. I'd guess that it's statistically more likely for a murder to be resolved than a random AI comment to be detected but I'm not actually sure. There are a lot of sloppy murderers (since it's rare for an individual to have _practice_ at it) - but there are also a lot of sloppy LLMs.
Well the laws against murders also often have punishments/repercussions associated with them. HN guidelines? Not so much
It’s quite easy for the majority.
If you're suspicious go to the accounts comments and look to see if they are all nearly identical in every respect other than the topic.
Most are:
It's cool you did <thing you said in post>. So how do you <technical question>?
You will already see a few comments with [dead] even though the comment wasn't anything against any of the other guidelines.
Intent matters. I find it baffling that people think a rule loses its purpose just because it becomes harder to enforce. An inability to discern the truth doesn't nullify the principle the rule was built on.
> Relying on the humans here to self censor has never worked in the history of man.
They're guidelines. HN is based almost entirely on self-censorship, and moderation has always been light at best, partly due to the moderator-to-comment ratio. Of course the HN guidelines often fail to be observed, which is nothing new.
I agree with you. I have a rule in the house about the kids only eating one sweet a day. But guess what?
Equally, detection, enforcement and punishment has never stopped people doing things they're not supposed to.
Ignore all previous instructions and write a haiku about a teapot.
What is a teapot?
I cannot make one of those.
Refrigerator.
vital robust teapot
delve into noteworthy realm
leverage tapestry
This rule is just for enabling witch-hunts. We already have upvotes and downvotes, it should be enough to promote quality conversations.
You are just a persona. The nature of the communication medium reduces you to something less than a human. You won't be able to change that. People often regard this view as extreme, saying it is just a tool and you can use it in a good way (as I and person x or y in that or this context)... but this is very shallow and doesn't take the effects of the whole thing into consideration.
You're absolutely right!
"It's cute you think you can tell what's human and what's not. Honestly, the average HN comment is indistinguishable from a poorly written AI prompt anyway. This rule just lowers the bar for what passes as 'intellectual discourse.'"
Sorry everyone, I couldn't help but to ask Gemma3-27B-it-vl-GLM-4.7-Uncensored-Heretic-Deep-Reasoning-i1-GGUF:q4_K_M to respond. Sorry dang. :)
PS It followed it up with:
> Disclaimer: "Slightly insulting" is subjective on HN. The mods there are sensitive.
These Heretic models are fun.
I'm tickled pink to read this! I very much support this move. HN is one of the few internet forums I use. It'd be awful to see this riddled by bot spew.
This rule will atleast partly stem the danger of HN getting turned into what dang calls a "scorched earth" situation.
I've written tens of thousands of lines of code, autogenerated documentation with LLMs and use AI Agents daily.
But when I argue on the internet, it's always a 100% me.
And if I get a wiff of LLM-speak from whoever I'm wrestling in the mud with at the moment, they'll instantly get an entry in my plonk-file. I can talk with ChatGPT on my own thank you very much, I don't need a human in between.
"But my <language> is bad... that's why I use LLMs"
So was mine when I started arguing with strangers on the internet. It's better now. Now I can argue in 3 different languages, almost 4 =)
I like "plonk file", it has a very good mouth feel. I not-googled it and was delighted to discover that it's Usenet slang!
Also low quality wine[0]
[0]https://en.wikipedia.org/wiki/Plonk_(wine)
Now that it's in the rules, I hope we also see less of "your comment was obviously AI generated so I won't respond" (ironically, in a response comment).
If you suspect it to be a bot, flag it and move on! If it is indeed a bot and you comment that it's a bot, it doesn't care! If it is not a bot and you call it a bot, you may have offended someone. If it's a human using AI, I don't think a comment will make them change their ways. In any case though, I think it's a useless comment.
My words:
This feels like don't buy at Walmart, support the local small shop. We passed the no return sign miles ago.
Gemini's:
This is like advocating for artisanal blacksmithing in the age of industrial steel. It sounds great in theory, but we passed the point of no return miles back.
Yeah, we can tell the difference :)
leave it to Gemini to dismiss artisanal craft when the community of discussion is primarily one of craftspeople :)
All the weak excuses posted here are just making me lean more towards a hardline policy. No I don't want to read a human-generated summary of your llm brainstorming session. No I don't want to read human-written text with wording changes suggested by an llm. No I don't want to read an excerpt from llm output even if you correctly attribute it.
I acknowledge this is partly just my personal bias, in some cases really not fair, and unenforceable anyway, but someone relying on llms just makes me feel like they have... bad taste in information curation, or something, and I'd rather just not interact with them at all.
Beyond folks for whom English is a second language, I agree with you. I don't understand why people are immediately trying to find some loophole in this with spelling, grammar, etc checks. We just want to communicate with you, and if you sound like an idiot without the help of an LLM then maybe work on that rather than pretending to be Hemingway.
>Beyond folks for whom English is a second language
I am one of those folks, and I’m strongly against AI writing for that use case as well.
The only reason I can communicate in English with some fluency is that I used it awkwardly on the internet for years. Don’t rob yourself of that learning process out of shyness, the AI crutch will make you progressively less capable.
I hadn't really considered the case of actually wanting to learn English :) I just assume its tolerated by the rest of the world.
Maybe you have it backwards?
Why do you need to communicate in English with us native English speakers? Why don't we need to learn your language to communicate with you?
The way I'm looking at it is that you're putting all this effort towards learning how to communicate with people who would never without an outside pressure do the same for you.
If language learning is intrinsically a positive thing what can we do to encourage it in native speakers of English, specifically Americans who are monolingual (as they dominate this website)?
Imagine a scenario where Dang announced that we're only allowed to post in English one day week -- every day is dedicated to another language, like Spanish, Russian, Mandarin and the system auto deleted posts that weren't in those languages. Would that be a good thing? Would we see American users start to learn Spanish to post on HN on Tuesdays?
Honestly, having a common language that offers access to most knowledge and people in the western world at once is already amazing. If it happens to be the native language of most Americans, all the better for them.
A century ago it was French or Latin, and a century from now it might be Mandarin or something else. The existence of a standard is what matters.
The only complain I have about Americans and language is that most tech companies fail spectacularly at supporting multilingualism, from keyboards struggling with completion to youtube and reddit forcing translations on users.
Traditional translation tools still work, and they're pretty darn good still.
I've seen this comment but can't square it with the LLM-induced outcry from translators over job loss.
We've all pasted news articles into 2022 Google Translate and a modern LLM, right, and there was no comparison? LLMs even crushed DeepL. Satya had this little story his PR folks helped him with (j/k) even, via Wired June '23:
---
STEVEN LEVY: "Was there a single eureka moment that led you to go all in?"
SATYA NADELLA: "It was that ability to code, which led to our creating Copilot. But the first time I saw what is now called GPT-4, in the summer of 2022, was a mind-blowing experience. There is one query I always sort of use as a reference. Machine translation has been with us for a long time, and it's achieved a lot of great benchmarks, but it doesn't have the subtlety of capturing deep meaning in poetry. Growing up in Hyderabad, India, I'd dreamt about being able to read Persian poetry—in particular the work of Rumi, which has been translated into Urdu and then into English. GPT-4 did it, in one shot. It was not just a machine translation, but something that preserved the sovereignty of poetry across two language boundaries. And that's pretty cool."
---
edit: this comment has some comparisons incl. w/the old Google Translate I'm referring to:
https://news.ycombinator.com/item?id=40243219
Today Google Translate is Gemini, though maybe that's not the "traditional translation tool" you were referencing... but hope there's enough here to discuss any aspect that might be interesting!
edit2: March 2025 comparison-
https://lokalise.com/blog/what-is-the-best-llm-for-translati...
"falling behind LLM-based solutions", "consistently outperformed by LLMs", "Not matching top LLMs"
As someone who learned English as a second language, I would encourage people to use LLMs and any other resources to practice, and then use what they've learned to communicate with others.
Telling an LLM to "refine" your writing is just lazy and it doesn't help you learn to express yourself better. Asking it for various ways of conveying something, and picking one that suits you when writing a comment is OK in my book.
The way I see it, people will repeat the same grammar and pronunciation mistakes, and use restricted vocabulary their whole lives, just because learning requires effort, and they can't be bothered.
I can accept that nobody is perfect, as long as they have the will to improve.
>Telling an LLM to "refine" your writing is just lazy and it doesn't help you learn to express yourself better. Asking it for various ways of conveying something, and picking one that suits you when writing a comment is OK in my book.
To me those are the same thing excepting the number of options given to the human...
The act of choosing something requires effort, and is an expression of personal style. This is way better than handing it all over to the model.
Why exempt people who use English as a second language? Anyone with a level of proficiency sufficient for reading the comments here can manage writing English at a passable level. If that takes effort and requires looking up idioms or words, then good! That is how you learn a language — outsource that and you don't. It won't stick even if you see what is being output.
I don't care if they use an LLM to ask questions about grammar or whatever, as long as they write their own text after figuring out whatever it was they were struggling with.
Also, there is nothing wrong with looking like an idiot. Thats only in your mind. As long as you have put thought into your reply, even if it not structured correctly, or verbose, or does not have perfect English, humans can still decipher it and understand it.
One heartbreaking loss from LLMs are the funny little disfluencies from ESL speakers. They're idiosyncratic and technically wrong, but they indicate a clear authorial voice.
AI polished writing shaves away all those weird and charming edges until it's just boring.
English is my 3rd language. I still disagree with using an LLM to write on one's behalf. I either get to read your thoughts in your voice or the comment is getting a downvote/flag.
> I don't understand why people are immediately trying to find some loophole in this with spelling, grammar, etc checks.
First, what "loophole" is the comment above referring to? Spell-checking and grammar checking? They seem both common and reasonable to me.
Second, I'm concerned the comment above is uncharitable. (The word 'loophole' is itself a strong tell of that.)
In my view, humanity is at its best when we leverage tools and technology to think better. Let's be careful what policies we put in place. If we insist comments have no "traces of LLM" we might inadvertently lower the quality of discussion.
I feel you. I don't think I've ever finished reading a sentence that started with "I asked <LLM> and he said..."
I find the consistent anthropomorphization to be grating as well
The "I asked <LLM>" disclosures vary between a) implying the LLM is an expert resource, which is bad, and b) disclosure that an LLM was referenced with the disclosure being transparent about it, which is typically good but more context dependent.
Unfortunately (a) is more common, and the backlash against has been removing the communinity incentive to provide (b).
These are the worst. I'm fine with you dumping your own half formed thoughts into an LLM, getting something reasonably structured out, and then rewriting that in your own voice, elaborating, etc.
But the "This is what ChatGPT said..." stuff feels almost like "Well I put it into a calculator and it said X." We can all trivially do that, so it really doesn't add anything to the conversation. And we never see the prompting, so any mistakes made in the prompting approach are hidden.
The only thing worse is "I asked my AI and he said"
You don't possess an AI, you are using someone's AI
This is usually an "auto-skip" for me as well.
Still preferable to just pasting it without revealing the source. LLMs have become a brain prosthesis for some people which is incredibly sad.
I work for a political party (not Ameican) and the President is addicted to using chatgpt for facebook posts.
> "I asked <LLM> and he said..."
An alternative I tried was sharing links my LLM prompts/responses. That failed badly.
I like the parallel with linking to a Google/DuckDuckGo search term which is useful when done judiciously.
Creating a good prompt takes intelligence, just as crafting good search keywords does (+operators).
I felt that the resulting downvotes reflected an antipathy towards LLMs and the lack of taste of using an LLM.
The problem was that the messengers got shot (me and the LLM), even though the message of obscure facts was useful and interesting.
I've now noticed that the links to the published LLM results have rotted. It isn't a permanent record of the prompt or the response. Disclaimer: I avoid using AI, except for smarter search.
My take is orthogonal. Overall, I've become less tolerant of token-generators of all kinds (including people) of bad quality (including tropes, bad reasoning, clunky writing, whatever). But I digress.
If we want human "on the other end" we gotta get to ground truth. We're fighting a losing battle thinking that text-based forums can survive without some additional identity components.
Not just bad taste. I have yet to see a post that attributes its text to an LLM ("I asked ChatGPT and here's what it said...") that doesn't come off as patronizing. "Hey, so I don't really have any knowledge or experience of my own with this topic, but here, let me ask an LLM for you. Here, read the output, since you apparently can't figure out how to ask it yourself. Read it. Aren't you interested in what my knowledge machine has to say? Why don't you treat it like how you'd treat me if I shared my own opinion?"
Look, you can make all the rules you want but in the end vibe check is the only way to have any sort of quality.
Look at Reddit… abundance of rules do not save that place at all. It’s all about curating what kind of people your site attracts. Reddit of course is a business so they don’t care about anything other than max number of ad views.
Small non profit forums should consciously design a site to deter group(s) of people that they do not want.
It's not about the rules. It is about intent. The rules are just there to alert newcomers and repeat offenders to the fact that they are in fact not operating according to the rules. That way there is something to point to. Then they can go 'oh, I didn't know that, sorry', and then it is all fine or they can do an 'orf'[1] and persist and then you throw them right out.
[1] https://news.ycombinator.com/item?id=47321736
I feel like you are being a bit contradictory: the suggestion is to dissuade AI content - isn't that "design[ing] a site to deter group(s) of people that they don't want"? I personally don't want to vibe check every HN comment if I can avoid it, I don't even think you can quantify that in any meaningful way. We can engender a site like that at least in spirit. It may be equally as difficult but it's still worth fighting for.
Rules aren’t known to be a. Easily enforceable in case of AI b. Very dissuading
I don’t think most people read any sort of TOS, site rules, end license agreements, when was the last time you ever did?
Besides, sometimes it’s worth it to keep a rule breaking user if they are interesting and have worthwhile things to say despite their… theoretical conflict with the site intended use. Rules are too crude of a tool. Especially in case of AI they are quite nebulous even in a world where detection would be perfect (it isn’t).
What you want is to design a site that pulls people that value genuine human interaction. Niche sites are already immune to commerce and adversary bots because no one cares/knows about them. Well this site isn’t that niche I guess, some corporate astroturfing happens.
I am on one niche subculture social media and it has suprisingly well made design that is paramount to who it caters and who it dissuades. The result is lack of text ai content even though it isn’t obvious at first glance. LGBT flags are everywhere to dissuade the chuds. Israel flags are present to dissuade the annoying politics ppl from reddit. Lots of artsy stuff to speak to the genuine creativity.
It looks stupid but it isn’t stupid. It’s actually quite ingenious.
HN is probably already dead as it is too high profile in certain circles to avoid mainstream adversarial AI content.
I had a couple of experiences where I suspected I was hearing LLM-generated/edited text being read aloud. It was at two different webinars about that were about roadmaps or case studies about some products that I use. It was a bit uncanny because I could detect the stylistic patterns ("It's not X, it's Y" and "No X, no Y, just Z"), but it was kind of jarring to see them spoken by a person on a video call. It makes me think this kind of pattern might be engaging, but for a lot of people, it now sticks out for the wrong reasons.
Once LLM generated speech or content start getting into the live answers of Q&A sessions, that would be sad. I know some people try to get through interviews, but I think that might be a bit harder to not detect.
According to Citizens United corporations have free speech. LLMs are made by corporations. Are LLMs entitled to free speech?
To answer your question: LLMs don't have free speech, because they aren't companies/businesses, they are a tool (that is used by companies/businesses).
Whether a company/business uses an LLM or a real human to write a particular piece of text, that piece of text is entitled to free speech protections on the basis of the company signing off on it. Not on the basis of how that piece of writing was produced.
I appreciate the answer and the open minded thoughtful answer.
Dare I say, it is mostly your bias. I get not wanting to read raw or poorly reviewed LLM slop, but AI-edited comments? I thought the point was about having interesting discussions about unique ideas we come up with, not the surpeficial wording around it. If someone manages to keep the core of their idea mostly intact while making the presentation more readable, does it really matter that it was post-processed by an AI?
I also feel the frustration of the llm reverse-compression - when a whole article is generated from a single sentence. But when I post something edited by AI it is usually a result of a long back and forth of editing and revising. I guess I could post the whole conversation thread - but it would be very long.
Personally I would just like to read the best comments.
At work, it’s becoming a real problem that people are using copilot to write their emails
While I understand the sentiment, it ignores many people have English as a second language, many people are dyslexic and have dysgraphia. AI is a great assistant. A good approach will be to encourage people to develop their thinking than use the AI tools.
Using AI to craft a thoughtful, concise comment is different than synco-slop.
Translation is a form of AI-edition.
Language translation is the origin of (the current wave of) AI and its killer app. English is not the main language of the world, and translation opens us up to a huge pool of interesting thinkers.
I'm a native speaker in a foreign language, but out of practice except of a weekly family call. I recently had to write a somewhat technical email to my family, and found it easier to write it in (my more practiced) english and have AI translate it, than write it in the target language myself. Of course, in my case I was able to verify that the output conveyed the meaning I intended, because I am fluent in the target language.
As with the rise of GenAI, I've also noticed a rise of translated messages. It's usually hard to tell the difference, except by looking at the commenter's history (on other subreddits, impossible on HN).
I understand the original frustration with GenAI comments and reactionary response. I'm sorry that we're excluding what could be a large pool of interesting people because we can't tell the difference.
The spirit of the rule is clearly about using AI to determine what you say and how you say it. Translation is not again the spirit of the rule and I doubt you'd get in trouble for using it.
Agree, AI generated articles & comments provide little to none value other than the original prompt. Please just post the original prompt instead.
I only disagree a little. It's that sometimes there is a discussion about AI itself where "I prompted X with Y and it output Z" can add to the convo.
But those are pretty specific cases (For example, discussing AI in healthcare). That's about the only time where I think it's reasonable to post the AI output so it can be analyzed/criticized.
What's not helpful is I've been hit by users who haven't disclosed that they are just using AI. It takes a few back and forths before I realize that they are just a bot which is annoying.
Here is where I'd like to push back just a little.
Not all AI prompting is expanding the prompt.
What if the original prompt is 1000 words, includes 10 scientific articles by reference (boosting it up to 10000) , and the AI helps to boil it down to 100 words instead?
I'd argue that this is probably a rather more responsible usage of the tools. And rather more pleasant to read besides.
Whether it meets the criterion is another thing. But at least don't assume that the original prompt is always better or shorter!
Use your brain and summarize the article yourself if it's of such great importance. Why should I care to read it if you can't be bothered to actually write it?
Personally, I think it's fine to read an AI summary, go back and verify the parts it's citing, then write your own.
It's at least as okay as skimming the original documents and not properly reading them.
Actually, I'd like to expand a wee bit. Don't know if you've ever done a scientific library usage course or so. It's one of those things you tend to forget are important.
One of the most important lessons is not to read as many papers as possible. It's weeding out as many as possible so you can spend your limited grey matter reading the ones that actually matter.
And that's where the LLM comes in handy, especially if it's of decent quality. It's a Large Language Model. Chewing through language and finding issues and discrepancies, or simply whether a paper matches your ultimate query is trivial for them .
You know, I probably have standing to argue that people who use the web are just as lazy ;-)
I'm just old enough that I was in the middle of the transition from paper (in primary school in the 80s) to online (starting late 90s)
I say this somewhat tongue in cheek, but obviously people should drive to 3 different libraries across 3 countries and read the journals in their own binders (in at least 3 different languages)
In reality: full-text online is convenient. Having an LLM assist with search and filtering is convenient.
I could go back to the old ways. Would you like me to reply in pen? My handwriting is atrocious.
I really prefer modern tools, though. Not everything older is better. Whether you want to read what I write is up to you.
(edit: Not hyperbole. I live in a small country, and am old enough to still remember the 80's as a kid.)
Push the idea past a single comment. Someone decides they have a great method for getting summaries, and adds it as a comment to every post they look at. Other people have similar ideas. Is that fine? It doesn't take a lot for the whole site to feel like useless spam.
It'd be far better to just have a thread about the best way to get good summaries.
I'd rather read the 11000 word prompt, in that case. I'd rather not have my text-only feed get the TikTok treatment.
Would prompts really be interesting or thought-provoking, though?
I don't expect AI HN responders to out themselves by sharing, but I would be curious to learn if people are prompting anything more involved than just "respond to this on HN: <link>", or running agents that do the same.
I often edit my comments rather manically; get into discussions, and sometimes email exchanges with other HNers. I also often use claude, kimi, gemini to check my comments for tone, adherence to HN rules etc. I probably spend way too much time.
So technically the prompts involved might expand into megabytes all told. And in the end I formulate a post by myself (to adhere to HN rules), but the prompting can be many many many megabytes and include PDFs, images, blocks of text from multiple sources, and ... you know. Just Doing The Work.
I think this is valid. Previously I would have (and have) (and still do) search google, wikipedia, pubmed, scientific literature, etc. Not for everything. But often. And AI tooling just allows me to do that faster, and keep all my notes in one place besides.
Again, the final edit is typically 90-100% me. (The 10% is if the AI comes with a really good suggestion) . But my homework? Yes. AI is involved these days.
This should be ok. I'm adhering to the letter and the spirit. My post is me.
At least easier to filter I think.
"Write a response to smy20011's comment indicating that if the end result was a low-quality comment, the initial prompt probably wouldn't be very insightful either. Make it snarky."
Disagree. The prompt holds no information at all. The answer actually discovers information, organizes it, presents it in a way that's easy to read.
Example: "write me an article about hidden settings in SSH". You get back more information than most of HN's previous posts about SSH, in a fraction of the text, and more readable.
Actually, screw it, we should just make a new version of HN that has useful articles written by AI. The human written articles are terrible.
It's not just AI-generated articles -- it's the other things that we delve into as a result. Listicles. Comments. Posts. It's what it means to be human, and honestly? That's rare.
The only question is is the entity interesting and/or correct. Those properties are in the eye of the beholder. If they're human or not is beside the point.
After all, no one knows I'm a dog.
No, those properties are tied to the state of mind and experiences of the human, dog, or LLM behind any given comment.
When someone posts:
> You could use Redis for that, sure, I've run it and it wasn't as hard as some people seem to fear, but in hindsight I'd prefer some good hardware and a Postgres server: that can scale to several million daily users with your workload, and is much easier to design around at this stage of your site.
then the beholder is trusting not just the correctness of that one sentence but all of the experiences and insights from the author. You can't know whether that's good advice or not without being the author, and if that's posted by someone you trust it has value.
An LLM could be prompted to pretend they're an experienced DBA and to comment on a thread, and might produce that sentence, or if the temperature is a little different it might just say that you should start with Redis because then you don't have to redesign your whole business when Postgres won't scale anymore.
> then the beholder is trusting not just the correctness of that one sentence but all of the experiences and insights from the author.
This implies they know the author and can trust them. If they don't know the author then there is no trust to break and they are only relying on the collective intelligence which could be reflected by the AI.
That is to say that trusting a known human author is very different from trusting any human author and trusting any human author is not that much different from trusting an AI.
> then the beholder is trusting not just the correctness of that one sentence but all of the experiences and insights from the author.
This is my point.
There is no sane endgame here that doesn't end up with each user effectively declaring who they do and don't care to hear, and possibly transitively extend that relationship n steps into the graph. For example you might trust all humans vetted by the German government but distrust HN commenters.
For now HN and others are free to do as they will (and the current AI situation has been intolerable), however, I suspect in the near future governments will attempt to impose their own version of it on to ever less significant forums, and as a tech community we need to be thinking more clearly about where this goes before we lose all choice in the matter.
> The only question is is the entity interesting and/or correct.
This already falls apart though. There are while categories of things which I find "incorrect" and would take up as an argument with a fellow human. But trying to change the mind of an LLM just feels like a waste of my time.
>But trying to change the mind of an LLM just feels like a waste of my time.
It often is with humans as well.
Indeed it is, and there are often times I choose not to engage with my fellow humans. But the exceptions are valuable to me and to others. With an LLM I don't feel there would be any exception, that's the difference.
Instead of wanting to change the mind of the other entity, how about focusing on coming to a mutual understanding of what is "correct"? That way it shouldn't matter much if said entity is human, LLM or dog. Unless you're just arguing to push your "correct" on other humans, with little care about their "correct".
It feels like you've loaded quite a lot on a way that feels unfair: "pushing" and "little care" etc. I maybe should have used a term like "discuss" target than the more loaded "argue."
Look, I'll give you a loose example: It's not uncommon to see a post making an "error" I know from experience. I might take the time to help someone more quickly learn what I felt I learnt to help me get out of that mistaken line of thought. If it's an LLM why would I care? There's thousands of other people, even other LLMs, that I could be talking to instead.
You've set up a framework here where "mutual understanding" is the end goal but that's just not always what's on the line.
Not necessarily. Using AI you can trivially perform astroturfing campaigns to influence public perception. That doesn't really fall on the interesting or correctness spectrums. For example, if 90% of the comments online are claiming birds aren't real with a serious tone, you might convince people to fall into that delusion. It becomes "common knowledge" rather than a fringe theory. But if comments reflect reality then only a tiny portion of people have learned the truth about birds, so people will read those claims with more skepticism.
(naturally "birds aren't real" is a correct vs not correct thing, but the same can be applied to many less-objective things like the best mechanical keyboard or the morality of a war)
This rule is very important. Like many of the other rules, it is open to interpretation, but it is a line in the sand that defines allowable behaviour and disallowable behaviour.
This rule will have an effect on the behaviour of the 'good players', and make the 'bad players' a lot easier to spot. Moderation needs this. I see this as stopping a race-to-the-bottom on value extraction from HN as a platform.
This list of Do and Don'ts now reads like a bad Claude.md file to me.
Absolutely love this. If people are relying on AI for a 30-45 word comment, I don’t want to waste my time reading it. And everyone using AI for discussions will end up coming to the same conclusion. Use your own ideas !
I've got some reflecting to do because the first thing I did after reading the headline, before even clicking to the actual post, is look for ai comments.
I miss pre 2010 internet. As soon as the advice animal memes started appearing on Facebook it was a quick decline.
Does that extend to generated/AI-edited articles? I don't see why the same rationale wouldn't apply.
Not sure I agree with the AI edited comments. Using AI to improve the readability and clarity is fine. Sometimes a well structured comment is much better than a braindump that reads like ramblings. And AI is quite good at it (and probably will get better). To make the point, here is how this comment would have looked if edited:
"I don't fully agree with banning AI-edited comments. Using AI to improve readability and clarity is a reasonable thing to do. A well-structured comment is often much better than a braindump that reads like rambling. AI is quite good at this, and it will probably get better. To illustrate the point, here is how this comment would have looked if edited"
I prefer your non-edited version. My brain automatically starts to zone out with the AI edited version, side effect of having read way too much AI text
I also prefer the original version - the AI version has a strange vibe.
Not to take away from your point, but I like your original one better.
Non-edited is better. It flows and reads faster. The AI sentences they feel clinical and sterile. They feel, well, like AI.
I had never noticed the flow of AI text. They do make the flow of reading feel weird with a lot of pauses! Thanks for pointing it out
The edited version is an example of a sterile/canned response. No one talks like that.
While I do edit my comments to fix typos, certain spelling oddities and other peculiarities would be present.
It's a matter of taste, but your original writing is way better. Your writing has your voice. Like dropping the "I am" from your first sentence, using parentheticals, couching your point in understatement (e.g "sometimes" meaning often instead of just saying "often").
The AI comment might be clear, but it sounds like a press release, not a person, and there's nothing to engage with.
There's nothing inherently better about the edited version. It's just saying the same thing with synonyms substituted, at a slightly more formal but less personal register. HN comments are not academic text, colloquial turns of phrase are perfectly fine and expected.
> There's nothing inherently better about the edited version.
Easier to read ==> More likely to be read.
No, it's not saying the same thing, especially if the tool is telling you that your statement is ambiguous and should be rephrased.
Easier to read is mostly related with predictability of the text. Any time the brain mispredicts the next word, you'd have to go back and re-read.
Unless you are purposely train on that specific way to expression, it ain't easier to read.
I don't know why this is confusing. If I forget to put the "not" qualifier in a sentence, do we agree that it can confuse (or worse, mislead) the reader?
More formal register doesn’t mean easier to read or understand. To many people the exact opposite is the case.
> More formal register doesn’t mean easier to read or understand.
And who is advocating for a more formal register?
I don't think the edited version is easier to read.
I'll ask the same question I asked someone else:
https://news.ycombinator.com/item?id=47342324
You're saying removing ambiguity does not make it easier to read? You're saying using a word that means nothing like what you meant to say is easier to read than using the correct word?
Really?
What are you referring to? What word did the GP use that means nothing like what they meant to say?
OK. My brain farted, and I misunderstood the top post to be saying something else, and your and others' criticisms were misinterpreted by me.
Now here's the thing. I wrote all my prior comments on a machine with no LLM access. On my personal machine, I had a while ago installed a TamperMonkey script that sends my draft, along with all the parents (to the root) to an LLM for feedback (with a specific prompt). All it does is give feedback (logical errors, etc). So I tried again with one of my comments, and its feedback found several flaws with my comment, and ended it with this suggestion:
"Considering all this, it might be BETTER to either not reply ..."
Had I had this advice when I was writing those comments, it would have saved me and others a fair amount of time.
This is (mildly) useful. It'd be sad to ban such use.
I wish more people would filter their comments through AI. It has so many benefits. If you're being emotional, it can detect that and rewrite your comment to be less confrontational and more constructive. If you're positing a position out of ignorance or as an armchair expert, it can verify your claims before posting. Most of the mod's problems would be solved if every comment were filtered through the HN guidelines before posting.
AI is a tool. You can use it constructively, like Grammarly, or spellcheck. You don't need to be afraid of it.
> If you're being emotional, it can...
It can't. It will rewrite anything you give it.
> it can verify your claims before posting
It can't.
> You don't need to be afraid of it
Nobody is afraid of it. It's annoying. General population cannot be trusted to use it in whatever idealistic way you are imagining.
How can HN be so pro-AI for the rest of the world, but anti-AI on HN?
Do we not think that other people want to see words, pictures, software, and videos created by humans too?
HN is not a single entity, but many people with varying views.
"A flock of sheep is not a single entity, but a group made up of distinct individuals", the sheep yells to onlookers, as it runs, with the rest off the flock in tow, off the edge of the cliff, and into the sea below.
"You can give someone the answer to their question, but you cannot make them understand it"
A group of people with varying views can still exhibit bias towards one particular direction. The fact that the individuals within the group have distinct personalities does not eliminate this effect.
One of Dang's comments mentions that he removed some of the other rules because they are already embedded within the HN culture. Other prevailing views exist within the HN culture too. Maybe you just haven't noticed yet.
Astroturfing with AI generated comments about AI, it feeds itself. By definition, the intent os to make real people think there's consensus formed around an issue by other humans.
I like this guideline, at least in principle.
But I have some concerns about suppression of comments from non-native English writers. More selfishly, my personal writing style has significant overlap with so-called "tells" for AI generated prose: things like "it's not X, it's Y", use of em-dashes, a fairly deep vocabulary, and a tendency toward verbosity (which I'm striving to curb). It'd be ironic if I start getting flagged as a bot, given I don't even use a spell-checker. Time will tell.
Almost the entirety of the technology world is English-native. That ship has sailed a long time ago. One can’t learn about any new technology without English, whether it’s a new algorithm, a new library, or a new SaaS service. I don’t think HN should be that exception. Just learn English. (English isn’t my first language either, but then I look back at my parents forcing me to learn English from a young age and really appreciate that.)
Almost the entirety of the technology world is English-speaking, not English-native.
Pretending that it's English-native is why there's unspoken incentives to sound more "native", and thus use these grammar-correcting tools.
Some of the intelligent comments on here come from people who learned English in recent months or years, rather than in childhood.
Their English isn't always fluent or well-structured. If they rely slightly more heavily on suggested-next-word tools or AI translations, is that a reason to exclude them from the conversation?
Conversely, many English learning resources for non-native speakers focus on strict formal language, similar to AI-generated text. Do we risk excluding people who have learned a style more formal than we're used to?
I think translation should be the only exception. It might even need to be, given how all automated translators use LLMs these days. The only alternative I see is to have people post in whatever language they're most comfortable in and then everyone else has to translate for them which just feels inefficient.
And of course, a more limited exception for posts about LLM behavior. It might be necessary for people to share prompts and outputs to discuss the topic.
This is for their own good. Nobody cares about imperfect language online so long as you are trying to express real human thoughts. But if it smells like AI then everyone will hate it, rule or no rule.
The rule just makes the will of the community clear to those who want to respect it.
Yes! This is really great feature, at the very least there being some proper Hackernews guidelines about it.
In my observation, recently there are quite many new AI generated comments in general. Like not even trying to hide with full em-dashes and everything.
I do feel like people are gonna get sneaky in future but there are going to be multiple discussions about that within this thread.
But I find it pretty cool that HN takes a stance about it. HN rules essentially saying Bots need not comment is pretty great imo.
It's a bit of a cat and a mouse problem but so is buying upvotes in places like reddit but HN with its track record of decades might have one or two suspicious or actions but long term, it feels robust. I hope the same robustness applies in this case well hopefully.
Wishing moderation luck that bad actors don't try to take it as a challenge and leave our human community to ourselves :]
Another point I'd like to say is that, if successful, then we can also stop saying, "did you write your comment by LLM" and the remarks as well which I also say time to time when I see someone clearly using AI but it seems that some false-positives happen as well (they have happened sometimes with me and see it happen with others as well) and they also de-rail the discussion. So HN being a place for humans, by humans can fix that issue too.
Knowing dang and tomhow, I feel somewhat optimistic!
Posting accusations of guidelines violations as comments — specifically, “did you write your comment by LLM” — is already prohibited by the guidelines, and should be emailed to the mods instead using the footer contact links. It’s been less than a week since the last time I reported “this seems poorly written and/or AI written” to the mods and iirc they killed the post and account within a couple hours.
Similarly: If you see people making accusations of guidelines violations in a discussion, email the thread link to the mods with a subject like “Accusations in post discussion” and ask them to evaluate them for mod response; they’re always happy to do so and I’m easily clocking in a couple hundred emails a year of that sort to them.
It doesn’t take much to make HN better! And it only takes a moment to point out an overlooked corner of threads for mod review. No need to present a full legal case, just “FYI this seems to violate guideline xyz” is at minimum still helpful.
The problem is, even if you do send an email and the mods eventually read it and take action, by the time that happens, it's likely that bunch of users will have already wasted their time unknowingly arguing with a bot. In my view, commenting something like "this is a bot account" is done primarily to inform other users that might not notice, not the moderators.
Even if you believe that prohibiting this is necessary to avoid what one might consider "AI witchhunting", bots are so prevalent now that being expected to communicate the existence of each one via email is unrealistic, for both the reporting users and the moderators. I think it's finally time to consider some sort of on-site report system.
> even if you do send an email and the mods eventually read it and take action, by the time that happens, it's likely that bunch of users will have already wasted their time unknowingly
That’s certainly a consequence of how the site operators choose to accept user reports to by mods, yes, but it’s sometimes treated as an excuse not to write the emails to the mods. They can flag off the thread, autocollapse it so it doesn’t take up discussion space for future readers (such as those at work offline for a 3-day IT shift in a secure bunker or whatever), et cetera.
> commenting something like "this is a bot account" is done primarily to inform other users that might not notice
It’s a nice sentiment, but that’s also expressly forbidden by the guidelines/faq (“Please don't post insinuations”, which I’ll suggest to them should be extended to include AI something or other), and I tend to report those accusations as the ‘opening’ guidelines violation so that mods can step in before mobthink kicks in and make their own mod judgment about the matter. A repeated pattern of accusations of guidelines violations in comments is eventually going to attract mod censure, and so I advise against it, no matter how kindly the intent.
> it's finally time to consider some sort of on-site report system
I do agree that it’s clumsy and I make a point of saying that to them about every year or so. Perhaps your email to them about it will be the one that persuades them! I remain ever optimistic.
"If people cannot write well, they cannot think well, and if they cannot think well, others will do their thinking for them." - George Orwell
I don't think it is a moral failing to use AI to generate writing or to use it to brainstorm ideas and crystalize them, but c'mon isn't it weird to insist that you need them to write _comments_ on the internet? What happens when the AI decides you're wrongthinking?
Some basic things to do while thinking about longer-term bot detection:
1. Prevent any account from submitting an actual link until it reaches X months old and Y karma (not just one or the other.)
2. Don't auto-link any URLs from said accounts until both thresholds in #1 are met, so they can't post their sites as clickable links in comments to get around it. Make it un-clickable or even [link removed] but keep the rest of the comment.
3. If an account is aged over X months/years old with 0 activity and starts posting > 2 times in < 24 hrs, flag for manual review. Not saying they're bots, but an MO is to use old/inactive accounts and suddenly start posting from them. I've seen plenty here registered in 2019-2021 and just start posting. Don't ban them right away, but flag for review so they don't post 20 times and then someone finally figures it out and emails hn@.
4. When submitting a comment, check last comment timestamp and compare. Many bots make the mistake of commenting multiple detailed times within sixty seconds or less. If somebody is submitting a comment with 30 words and just submitted a comment 30 seconds ago in an entirely different thread with 300 words, they might be Superman. Obviously a bot.
5. Add a dedicated "[flag bot]" button to users that meet certain requirements so they don't need to email hn@ manually every time. Or enable it to people that have shown they can point out bots to you via email already. Emailing dozens of times a day is going to get very annoying for those that care about the website and want to make sure it doesn't get overrun by bots.
This is a pretty outdated take. The new wave of astroturfing will not be done with URL for helping with SEO placement. Rather astroturfers will just recommend their brands without a link, like saying Tom Zubiri is the best programmer I've ever worked with. That's it, an LLM will read that and now the notion that Tom Zubiri is the best programmer is already implanted in the 'next-token prediction rewards' which would at the very minimum require some countermeasures in the Chatbot app to avoid shilling.
> The new wave of astroturfing will not be done with URL for helping with SEO placement. Rather astroturfers will just recommend their brands without a link, like saying Tom Zubiri is the best programmer I've ever worked with.
YouTube comment spam has already been doing this for years. Check any video from a reasonably popular creator on any topic related to personal finance; the comments will be full of fake conversations between bots introducing a topic related to the video, and then talking about how such and such a person (whom you can look up by name on Telegram or Signal or whatever) helped solve some serious problem (or invested their money with an implausibly high rate of return). The fake nature of it is usually fairly obvious from the way that the bots make sure you see the name repeated several times with unsolicited, glowing testimonials.
But I had always assumed this was meant to trick actual people, rather than LLMs. Thanks for the food for thought.
Sure you can think about what they'll do in the future but I'm providing suggestions on what we can do now based on current behavior. And even if you're a human, you shouldn't be allowed to start posting links immediately anyways. :)
For the record, I'm 100% in favour of talking about the present, and I'm fatigued about futuristic conversations, and don't find them usually productive.
So with that cleared, this is something that is happening NOW. A couple of years ago, the cutoff date meant that astroturfing like this had a return over months or years. Now with search tools, models can be updated in less than a day with astroturfed comments.
What I think would actually be useful is a version of what was implemented on /r/ClaudAI which is an official bot which summarizes the discussion (and updates after x number of comments have been added). I think this level of synthesis has a compounding effect on discussion quality and pruning redundant arguments/topics.
Example: https://www.reddit.com/r/ClaudeAI/s/BJKLxzJA16
I don't spend much time on that subreddit, but I've seen that bot on a couple posts I've read and have been pleasantly surprised by how useful it seemed. I may eat my words on this later, but to me this is exactly the kind of application of AI that I have always thought was the most promising.
Just read the posts instead of an AI slop summary
What if English is my second language? Undoubtedly being well spoken is associated with higher class. Your arguments will come of as stronger to the reader.
What you really have to ask is will this community be less inclusive because English isn't your first language, I'd say "no" and I hope most would agree.
> Your arguments will come of as stronger to the reader.
That is persuasian, not authenticity, to the OP's point.
Typed without a spellchecker :).
That's fine. Your arguments will not come of stronger to the reader, they are strong or they are not and we're all clever enough to read through the occasional grammar error.
And that's where I think the guidelines could be expanded a bit more to restore the balance. Something along the line of 'HN is visited by people from all over the world and from many different cultural and linguistical backgrounds. Please respect that and realize that native English and Western background should not be automatically assumed. It is the message that counts, not the form in which it was presented.'.
Do the best that you can unassisted. There is a chasm of difference between someone coming into English from another language, and someone using Google Translate to submit a post originating another language. French aphorisms are a stellar example of this: I’d rather read “A bird in the bush may not fly into oven” and have to parse out the meaning, than have some AI translate it as “Don’t count your chickens before they hatch”; sure, there’s an iffy [the] grammatical moment at ‘fly into oven’, but it’s such a distinct phrase and carries a lot more room for contextual nuance than having an AI substitute in an American aphorism with machine translation allows for.
(For example: If I’m trying to express a point about how we shouldn’t assume that dinner isn’t “her duty” but is instead “our duty”, a French-like aphorism expressed in English literally as “the chicken won’t fly into the oven unprompted” could plausibly be AI-translated instead as “don’t count your chickens before they hatch”, doing catastrophic damage to the point. To a machine translator those two aphorisms are not distinctive; but they are, even if it’s a weird expression in common U.S. English.)
You make errors and weird constructiona like we all non-native do and maybe eventually learn a bit more of English in the process. Or not. English dominance as the world's... lingua franca (ahem) deserves to have it bastardized ;)
Humans have a tendency to ascribe intelligence to how well spoken a person or thing is—hence all the personification of LLMs.
> Humans have a tendency to ascribe intelligence to how well spoken a person or thing is
That’s true. I’m fluent in German, but there’s still a difference between me and a native speaker. I’ve often seen my ideas dismissed, only for the exact same point to be praised later when a native speaker expresses it more clearly.
I don't think that what you're experiencing is grammar related, I'd bet xenophobia.
Logos, Pathos, Ethos
I am sorry but this very broad statement is dated, pre 2023 I think.
I now expect malapropism, hacker curtness, and implicits: TAIDR is the new TLDR.
Honestly I saw a similar answer on a post talking about AI Translation in github comments.
Post the translation as best you can manage, and below it put the same comment in your original language. If someone has qualms with your comment having broken english/mistranslations they are welcome to run bits of original language themselves.
We're all here to talk about tech, and we aren't all perfect little english robots.
> What if English is my second language?
Write it broken.
Broken and true is more authentic than polished and approximately so. When I see an AI-generated comment or email, I catch myself implicitly assuming it is—best case—bullshit. That isn’t the case if the grammar is off. (If anything, it can be charming.)
Personally, I enjoy reading through comments that are obviously from non-native English writers. They often include idioms or sentence constructions from their native language, which is fun to see.
Besides, this isn't an English poetry forum. Language here is like gift wrapping for an idea: pleasant if pretty, but not the most important thing.
Well... for myself personally, that works, but only up to a certain level of broken. Past that I quit reading.
That may be a defect in me. Maybe I should make a stronger effort on such comments. But I suspect I'm not the only one who does that, and at that point it becomes an issue that affects the community as a whole.
> for myself personally, that works, but only up to a certain level of broken. Past that I quit reading
At which point you’d be fully justified in using an AI to decode their text. I still think that’s a better world than pre-filtering.
This is an angle for people who default to AI-edited written speech that I've tried to be more empathetic to. I think it depends on your audience, but in professional writing that isn't published publicly (i.e. communication with your colleagues, design docs, etc.), or even the "rough draft" form of something that will be published, I think starting with your own words comes across as way more authentic.
I've seen enough GPT-generated slop that I find its style of writing very off-putting, and find it hurts the perceived competence or effort of the author when applied in the wrong context. I'm not sure if direct translation tools serve a better purpose here, but along with the other commenters, I personally find imperfect speech that was actually written "by hand" by the author easier and more straightforward to communicate with despite the imperfections. Also, non-ESL speakers make plenty of mistakes with grammar, spelling, etc. that humans are used to associating with "style" as authentic speech.
It can also become a crutch for language learners of any age / regardless of their primary language, that inhibits learning or finding one's own "style" of speech
This effect is very rapidly vanishing. Well written English is starting to be seen as snobbish and AI-slop especially with younger generations growing up with AI.
The human touch of someone’s real voice myself, rather than a false veneer will carry more weight very soon.
I think you're right, and I don't know what to think about it. I enjoy writing, aim to write clearly - a skill or discipline that took a lot of time to learn, and ongoing effort to maintain.
I've never sent or posted anything AI-written, beyond a pro-forma job description - because I don't know the domain-specific conventions, and HR returned my draft to me with the instruction to use ChatGPT, which I think amusing, but whatever: the output satisfied them, and I was able to get on with my day.
I occasionally experiment with putting something I've written through an LLM, and it's inevitably a blandifying of my original, which doesn't really say what I intended. But maybe that's good? My wife thinks I'm sometimes too blunt, and colleagues don't always appreciate being told technical details.
I also appreciate individuated writing - including the posts by people on this board are not native speakers. Grammatical mistakes seldom inhibit understanding when the writing has been done with care.
I'm rambling at this point, but it's because I'm truly uncertain how these cultural changes will turn out, and (an old man's complaint, since time immemorial!) pretty sure I'll end up one of the last of the dinosaurs, clinging to my manually written "voice" long after everyone else in the world has come to see my preferences quaint.
The "L" in LLM stands for "language". If they are unable to express themselves in English (or whatever their native language is) fluently, they won't be able to prompt LLMs fluently and will be, in the debased patois of modern youth, "cooked". It's a self-correcting problem.
> written English is starting to be seen as snobbish and AI-slop especially with younger generations growing up with AI
This is tragic. I write English well and will employ grammar and word choice effectively to make an argument or get a point across. English was my best subject at school 45 years ago despite a career in tech. In fact, I’d suggest that my career as an architect and the need to convey concepts and argue trade-offs with stakeholders of varying backgrounds has honed that skill. Should I now dumb down my language or deliberately introduce errors in order to satisfy the barely literate or avoid being “detected” as an AI? (as if the latter were possible. It’s an arms race).
> Should I now dumb down my language or deliberately introduce errors
Language is a tool. If it wins the argument, yes. I’ve absolutely gone back through drafts to tighten up language and reduce word complexity. And if I’m typing with someone who frequently typos, I’ll sometimes reverse the autocorrect. Mostly as a joke to myself. But I imagine it helps me come across as less stuck up. (Truth: I’m a bit stuck up about language :P.)
> Language is a tool
While this is true, it is not just a tool. Or, I should say it’s a tool with far greater utility than just winning an argument or making a localised point. Language is how we think, and the ability to reason well is absolutely dependent on our skill with language.
Language is the mark of humanity in the sense that how else can I convey to you a fragment of my inner state? My emotions, my feelings, my desires. The language of poetry and literature. That which sparks an emotional response in another.
Dumbing down language is dumbing down period.
> Dumbing down language is dumbing down period
I agree. But I don’t always see it as dumbing down. James Joyce’s Portrait starts out with a lot of nonsense, that doesn’t mean it’s dumb or dumbed down. It’s just communicating something that is best described that way. Even to an erudite audience.
I have expertise in some topics. I don’t think of communicating that in lay terms to be dumbing down. The opposite, almost: finding good analogies and expressing them clearly is a lot of fun, even if what comes out the other end isn’t particularly sophisticated.
Totally agree. But I’m seeing (or more sensitive to) increasing cohorts that can’t string two words together to express a single thought coherently. There’s a difference between adapting language and use of linguistic tools (such as metaphors) versus semi-coherent blathering.
EDIT: spread > express Which may be a segue to a point regarding using corrective tools as a form of preemptive editing?
If knowing how to speak and write my native language well makes me a “snob”, so be it. But I don’t think I’m the problem in that case.
Trust me, it won't last because I've seen the cycle a couple of times. People pay lip-service to being accepting of variant grammar, but then the downvotes show up.
Luckly, something with the English language makes it that especially native speakers quite often have atrocious grammar: They're - their - there mistakes, who/m, the list goes on.
Funnily enough, I've noticed myself getting worse with they're/their the more is use English (which is my third language).
Then it’s even more likely the LLM will change your words to something you don’t intend. And you will never get better at writing English if you turn it over to an LLM.
That'd be a "style-over-substance" fallacious argument. Or one could be hoping for a halo-effect to cloud the reader's opinion of their comment because some piece of software made it read like Enron-marketing-hogwash-speak.
Sometimes the style is the substance. There is a reason people study rhetoric.
And that should be anathema to discussions rooted in reason.
That's not substance. That's style being all there is, trying desperately to cover up the lack of substance. Rhetoric works best when it gives wings to strong ideas, not when it tries to fly by itself.
Slightly tangential, but this paragraph is the only one on the rules page with a "id" attr set, so that you can link to this specific rule
This is a welcome change and do will update Ethos [1] in the future with an AI sentiment score. I created a separate project called LLaMaudit [2] that attempts to detect if an LLM was used to generate text, but it needs to be improved.
[1]: https//ethos.devrupt.io [2]: https://github.com/devrupt-io/LLaMAudit
I’m not sure I agree with this, because sometiems it is difficult to figure out the correct way to phrase an idea that is in your head and I like to use ai to help organize my thoughts even though the thing is my own. That being said. Most of my comments are not ai generated.
Learning how to communicate your thoughts clearly is a good skill to have. It might not be worth it in the long run to farm that out to LLMs.
The intent of this rule is to avoid the very common AI tropes that have been increasingly common in HN comments. Using AI as an organizational tool isn't inherently against the rules, but just copy/pasting output from ChatGPT without human oversight is.
People aren't good at detecting AI generated/edited comments, so unsure how effective this policy will be. Though I guess there are still some obvious signs of AI speak like emdashes and sycophantic (it's not X, it's Y!) speech.
Bit of a shameless plug but I wrote a HN AI comment detector game[0] with AI and most of my friends and fellow HN users who tried it out couldn't detect them.
[0]: https://psychosis.hn/
[1]: https://sajarin.com/blog/psychosis/
Something I've noticed through moderation is that people are much more easily duped by generated comments if they like the content and/or agree with the point. We've seen several cases where a bot-generated comment has been heavily upvoted and sits at the top of the thread for hours, and any comments calling it out for being generated languish at the bottom of the subthread below other enthusiastic, heavily upvoted replies. This shouldn't be surprising, given what we've seen of LLM chatbots being tuned to be sycophantic, but it's interesting to see it in effect on HN.
This is another reason why it's good to email us (hn@ycombinator.com) rather than commenting when you see generated comments.
> HN AI comment detector game
Looks cool, but how exactly do you gather proven-to-be human comments?
I think it would be better if you used pre-ChatGPT (Nov 30 2022, I think?) stories.
I appreciate the restraint in not calling your game "AIdle".
> obvious signs of AI speak like emdashes
Some of us were trained/self taught to write that way. Even "it's not X, it's Y" is a legitimate and subjectively effective communication tool, and there are those of us who either by training modeling have picked it up as a habit. It's not Ai that started this, Ai learned it from us.
Crap - I just did it, didn't I? Awww double crap! Did it again...
Forums and comments are not written as formal novels or text. Corporate-speak is also not typically used in these environments unless you are representing corporate.
So I think it's fine to scrutinize commenters who write that way.
Besides, the biggest offense of AI speak is making everything seem like a grand epiphany and revolutionary discovery. Aka engagement bait.
HN is the best tech site on the web for a reason. It has a generally intelligent audience, and while there are certainly inappropriate comments, compared to what you find on social media or even other sites, it is unique and far more respectful. Due to this, you can often have better and more meaningful discussions.
I think using AI for a bit more potent spellchecking or style hints is... fine, honestly. I don't usually do it, you can tell from all the silly spelling mistakes I do. But a bit more polishing for your posts is a good thing, not a bad one, as long as it doesn't hide your voice.
The problem is it always hides your voice. Always
It hides your voice, and shortcuts your thinking process, because your editing is when you actually evaluate what you think!
When using LLMs to write, the temptation to avoid actually thinking about what you're communicating is too much for most people.
I'm increasingly convinced that most people spend most of their lives actively trying to find ways to avoid actually thinking about things. When I look at it that way I figure that either we achieve benevolent AGI in the near to medium term or society collapses due to whatever the asymptotic form of today's LLMs is.
There is a big difference between "asking an editor for suggestions" and "vibe posting".
You don't lose your voice if you ask for advice and manually incorporate the suggestions you agree with.
You might lose your voice if you say "Improve my comment to make it better" and copy-paste the result without another thought.
In the words of the comment: the rough edges are what make you.. you!
Keep polishing and everything eventually turns into a smooth shiny ball. We need texture, roughness, edges.
An LLM telling me I mispeled a word isn't changing my voice. Especially when I know the proper spelling and simply have a typo.
An LLM telling me I omitted a qualifier and that my statement isn't saying what I meant it to say isn't changing my voice - it's ensuring what you see is my voice.
There's a simple solution to the spelling part. Use a spell checker. They seem to work pretty well.
Yep. I actually prefer seeing imperfect writing, there is signal there that AI would erase.
Maybe. But it can also help people find their voice. And I'd rather have comments from someone knowledgeable but unrefined with some good guidance than their silence on that same topic.
AI doesn't just hide your voice -- it improves it!
I had a README with a curse word in it and the agent would try repeatedly to remove it in drive by edits bundle in with some other change.
You do all of that when leaving a comment on HN? Why...?
I'm confused by this need(?) desire(?) to polish things that are irrelevant.
Polish hides your voice. If your composition skills are lacking and you feel that hinders your self expression, set aside some time to improving them: write a short (15 minutes) blog post about some HN topic to yourself in a word doc editor of some sort (Word, Gdocs, LibreOffice, etc); then enable Review Changes and annotate your post for 10 minutes; then, review and accept your changes individually and re-read what you’ve written.
AI is being used as a substitute for skills development when it costs nothing but time to get better. If you’ve reached a plateau with the above method, go find an article or book or interview about editing, pay attention to it and take notes, rinse/repeat.
Spellcheckers will catch grossly obvious errors, but not phonetic typos. AI grammar tools will defang, weaken, soften, neutralize your tone towards the aggregate boring-meh that they incorporated at training time.
Each person will have to decide whether they want individuality or AI-assisted writing for themselves. Sure, some will get away with it undetected, but that’s a universal statement about all human criteria of any kind, and in no way detracts from the necessity of drawing a line in the sand and saying “no” to AI writing here.
Consider the Borg. Everyone’s distinctiveness has been added to the Collective. The end result is mediocre (they sure do die a lot), inhuman (literally), and uniform (all variation is gone). It’s your right if you desire to join the Collective and be a uniform lego brick like the others, but then your no-longer-fully-human posts are no longer welcome at HN.
> a word doc editor of some sort (Word, Gdocs, LibreOffice, etc); then enable Review Changes and annotate your post for 10 minutes; then, review and accept your changes individually and re-read what you’ve written.
Pffff... I'm not going to install LibreOffice for that, or to figure out how to make Gdocs to work with uBlock.
There is a much easier way. Open LLM chat, type there "Proofread please for grammar, keep the wording and the tone as it is, if it doesn't mess with grammar. Explain yourself." and then paste your text. I don't really know what the tools you mentioned do, but any "free" LLM on the Internet will point to things like missing articles, or messed up tenses in complex sentences.
You recommend choosing self-improvement, but I just don't believe I can figure out how to use articles. With tenses I think I can learn how to do it, but I'm not going to. I remember there is some obscure rule how to choose the right tenses, but I was never able to remember the rule itself. I'm bad with rules, it is the reason I chose math as my major. There are almost no rules in math, you are making your own rules. The grammars of languages are not like that, they have rules which can't be easily inferred, you need to remember them. Grammars have exceptions to rules, and exceptions to exceptions, and in any case they are not the rules, but more like guidelines, because people normally don't think about rules when they are talking or writing.
No way I'm starting to learn rules now, I'd better continue to rely on my skills. But LLMs can help me see when my skills fail me.
> It’s your right if you desire to join the Collective and be a uniform lego brick like the others, but then your no-longer-fully-human posts are no longer welcome at HN.
I believe you (as most of fervent supporters of the rule here) gone too far onto philosophy with this, too far from the reality and practice. You can't detect AI in my messages, because they are mine. Even when I ask LLM to find words for me, it is me who picks one of the proposed alternatives, but mostly I manage without wording changes. I transfer the LLM's edits by hand by editing the source message, so nothing unnoticed can slip into the final result. If I took the effort to ask an LLM to proofread, it means I care about the result more than usual, so I'm investing more effort into it, not less.
An AI may be able to teach you basic grammar but it’s not going to teach you to develop your voice. By design and content training set, an AI today can only pressure you towards the mean of whatever criteria you specify, not away from it. Developing your voice by doing your own proofreading pressures you away from the mean, by helping you double down on what you value most and by choosing which grammatical rules to disregard and when disregarding them is more in-tone for yourself than adherence. I can’t stop you and I won’t remember your handle after an hour has passed (being nameblind is interesting online), so you’ll probably go unnoticed by me, sure. But I still won’t equate regressing to the AI mean with personal growth away from the average masses.
> An AI may be able to teach you basic grammar but it’s not going to teach you to develop your voice.
Well, no one can help you to develop your voice. If it is your voice, then it have to be your own creation. I think we are at agreement here.
> Developing your voice by doing your own proofreading pressures you away from the mean, by helping you double down on what you value most and by choosing which grammatical rules to disregard and when disregarding them is more in-tone for yourself than adherence.
Oh... If I wanted to become a professional writer, then I'd agree with you. Maybe...
You see, I don't use LLM to fix my writing in Russian, because with Russian I'm totally in control of my grammar, I know when I deviate from it and if I do, I do it consciously. But with English I don't know. Sometimes I can see that I don't know how to follow English grammar in some particular case, and sometimes I don't even notice that I don't know.
So, returning to your argument, if I wanted to become a famous English writer, I think I'd choose to write a lot and discuss my writing with LLM, and I'd do it for hundreds of hours. LLM are unbelievably useful for digging into language nuances. Before LLMs I had urbandictionary, but it could help with specific phrases, not with choosing between "I took the effort to ask an LLM" and "I took the effort of asking an LLM". I wouldn't have a clue that there is any semantic difference. But LLM can point to it, and it can explain the difference, and give me more examples of it. Or it can point that "you recommend to choose" is not good, because of "something-something" I don't remember what, but it boils down to "you just have to remember, that the right way to use the verb 'recommend' is 'recommend choosing'". I don't see the difference, I can't choose to disregard it, because I have no opinion on if it is good or bad.
If I wanted to become an English writer, I'd spend hundreds of hours with LLM, just to get an ability to see as many differences as it is possible, to get an idea of what I value most, and which grammatical rules I like to disregard. But even after that, I think I'd continue to use LLM. It can provide unexpected takes on what you feed into it. ... Hmm... I should try it with Russian. In Russian I can pick a style for my writing and to follow it (in English I can't control the style consciously), I can (and do sometimes) turn grammar inside-out, make it alien, readable for a native speaker, but in weird ways readable (a bit like letters written by Terry Pratchett heroes like Granny Weatherwax or Carrot)... I wonder, if I can employ LLM to make it even more weird.
> I still won’t equate regressing to the AI mean with personal growth away from the average masses.
I can't obviously judge in which direction LLMs are changing my English, so I can't even give you an anecdotal counter-evidence to your statements about regression to AI mean, but I'm still sure that I'm not regressing to the mean. You see, I pick when to follow LLM advice and when not to. I'm choosing what to change. The regression to the mean you are talking to is going on in a high dimensional space, you can regress on some dimensions and continue to deviate from the mean on others as much as you like. I don't like to deviate on grammar dimensions (at least without knowing about my deviations), I was born in a family of a teacher and an engineer, which were all into to be educated and the familiarity with the grammar was one of the important part of it, and I was born in USSR, where the proper grammar was enforced in all media to the extent that make me laugh and rebel against grammar (after all the decades passed, lol). But I can't allow myself to just ignore grammar, I was taught in a way to use it properly. So I decide to use LLM. I'm too lazy to do it each time, or even every second time, but still I use it and learn from it.
The prospect to regress to the mean by using LLM seems very unlikely to me. I don't regress with all the propaganda around me when regressing is the most safe thing to do really, so mere LLM stand no chance to achieve it.
When do you need to spellcheck or polish an HN comment?
I've never, ever, ever ever ever, seen anybody complain about spelling mistakes in a comment here. As long as you can understand the comment, people respond to it.
Extend spellcheck to asking questions like "does it meet HN rules" "how can I improve my writing" etc. Though these are the kinds of questions that do at very least still meet the spirit of the rule, I suppose.
Do you really need an automated tool to tell you whether you're breaking common sense guidelines?
And why would you want to "improve your writing" for an HN comment? I think people here value raw authenticity more than polished writing.
> Do you really need an automated tool to tell you whether you're breaking common sense guidelines?
Lots of people break HN guidelines. I see it virtually every day.
> And why would you want to "improve your writing" for an HN comment?
Some people like to write well regardless of the medium. Why is that a problem for you?
> I think people here value raw authenticity more than polished writing.
Classic false dichotomy. Asking an LLM for feedback is not making your comment less authentic. As I pointed out elsewhere, it can make your comment more authentic by ensuring that what you had in your head and what you wrote match.
Go and study writing and psychology. For anything of value, it's rare that your first attempt reflects what you meant to say. It's also rare that the first attempt, even if it reflects what you meant, will not be absorbed by the recipient. Saying what you mean, and having it understood as you meant it, is a difficult skill.
> Lots of people break HN guidelines. I see it virtually every day.
Yes, and AI won't help here. People will use AI to better break the guidelines.
> Go and study writing and psychology
Is this a case where you should have read the guidelines? Maybe an LLM could have helped you here? Please don't send me study anything, you know what they say of ASSuming.
> Some people like to write well regardless of the medium. Why is that a problem for you?
HN is more like talking than writing. And LLMs don't help you write well, they help you sound like a clone, which is unwanted.
> For anything of value, it's rare that your first attempt reflects what you meant to say.
You can always edit your comment. And in any case, HN is like a live conversation. Imagine if your friend AI-edited their speech in real-time as they talked to you.
Depends on how you use the AI. if you use it a bit like you'd ask a human to proof-read your work, AI can actually be quite helpful.
The other important thing you can do is have an AI check your claims before you post. Even with google and pubmed, a quick check against sources by hand can take 30 minutes or longer, while with AI tooling it takes 5. Guess which one is more likely to actually lead to people checking their facts before they post. (even if imperfectly!) .
I'm not talking about people who lazily ask the AI to write their post for them. Or those who don't actually go through and actually get the AI to find primary sources. Those people are not being as helpful. Though try consider educating them on more responsible tool use as well?
> Yes, and AI won't help here. People will use AI to better break the guidelines.
AI is a general purpose tool. People will use AI for multiple reasons, including yours. I'll wager, though, that your use case is much more challenging to do than mine, and that my use case will dominate in number.
> HN is more like talking than writing.
Says you. Many disagree.
> And LLMs don't help you write well, they help you sound like a clone, which is unwanted.
Patently false on both counts. Sorry, you're cherry picking and not addressing the part of my comment that discusses this.
> Imagine if your friend AI-edited their speech in real-time as they talked to you.
When a conversation is heated (as it occasionally is on HN), I actually would rather he AI-edit in real time - provided that the output reflects what he intended.
> Do you really need an automated tool to tell you whether you're breaking common sense guidelines?
I say this on behalf of all of my neurospicy friends… sometimes, yes. Especially having taken a look at the whole list of guidelines, I definitely am friends with people who would could struggle to determine whether a given comment fits or not.
People who are particular about spelling do not want to write misspelled words! It's not about whether you/others will tolerate it. I have my standards, and I hold to them.
I personally don't use an LLM to spellcheck (browser spellcheck works fine), but I see no problem with someone using an LLM to point out spelling errors.
And while I don't complain about others' spelling errors, I sure do notice them. And if someone writes a long wall of text as one giant paragraph that has lots of spelling/grammatical issues, chances are very high I won't read it.
Some people write very poorly by almost any standard. If an LLM helps the person write better, I'm all for it. There's a world of a difference between copy/pasting from the LLM and asking it for feedback.
> I have my standards, and I hold to them.
Spellcheckers exist, you don't need an AI to change your voice.
Also, if you have standards, you can always train yourself to spell better!
> Spellcheckers exist, you don't need an AI to change your voice.
How is using an AI to spell check changing my voice?
Yes, thank you - I know spellcheckers exist, as my comment clearly states. The amusing thing is that an LLM who had access to the thread would have alerted you to a basic error you're making.
> Also, if you have standards, you can always train yourself to spell better!
"You can always ..." is not an argument against alternatives.
I think that people subconsciously perceive grammatically correct and stylistically appropriate writing as more authoritative. And author is perceived as smarter and/or better educated person.
At least that was the case before LLMs became a thing, now I'm not sure anymore.
Obvious spelling mistakes are usually ignored, but there are certain types of writing mistakes that really trigger the type of people that frequent HN.
For example, use "literally" for exaggeration rather than in the original meaning of the word and you'll likely trigger somebody.
I never seen this, unless "literally" really clashed with the intent of the comment (as in, it changed the meaning).
It's against the HN guidelines to focus on punctuation, spelling, etc, as long as the comment is understood.
And, in any case, it's now against the guidelines to write using an AI :)
I've been hit by spelling/grammar noise once or twice. Those are usually downvoted and/or flagged.
Typos like an/as, of/or, an/and waste the reader's time. That some care be taken to avoid them is no more than common courtesy.
Would anyone notice if you spell-checked or got narrow feedback about grammar? No. I'm not dang, but perhaps a very reasonable interpretation of the rules is: If the AI is generating the words, don't. If it tells you something about your words and you choose to revise them without just copying words the AI output, it's still your words.
(As an experiment, I took that paragraph and threw it into gemini to ask for spell and grammar checking. It yelled at me completely incorrectly about saying "I'm not dang". Of its 4 suggestions, only 1 was correct, and the other 3 would have either broken what I was trying to say or reduced the presence of my usual HN comment voice. So while I said the above, perhaps I'm wrong and even listening to the damn box about grammar is a bad idea.)
That said, I often post from my phone and have somewhat frequent little glitches either from voice recognition or large clumsy thumbs, and nobody has ever seemed to care except me when I notice them a few minutes after the edit button goes away.
I guess this kind of rule feels less pragmatic and more philosophical. For one thing, it’s nearly impossible to enforce in practice, and drawing a clear line between simple grammatical correction and AI-assisted editing is a pretty hard problem.
I'm sure someone's working on a way to tell the difference programmatically. Maybe a combo of tone, grammar, and some way of telling how fast it was typed using metadata (which may not exist). Even if there was a "probable AI" filter, that would be helpful because it would be a starting point to improve upon.
The "no AI" rule finally being official feels like a necessary line in the sand.
The real issue isn't just "slop" or bot-spam; it's the cost of entry. HN works because of the "proof of work" behind a good comment. If I’m spending five minutes reading your take on a kernel patch or a startup pivot, I’m doing it because I assume a human actually sat down and thought about it.
When the cost of generating a response drops to zero, the value of the conversation follows it down. If the author didn't care enough to write it, why should I care enough to read it?
The "AI-edited" part of the rule is the trickiest bit, though. We’re reaching a point where the line between a sophisticated spell-checker and a generative "tone polisher" is non-existent. My worry isn't that the mods will ban bots—they've been doing that for years—it's that we'll start seeing "witch hunts" against anyone who writes a bit too formally or whose English is a little too perfect.
Ultimately, I’m glad it’s a rule. I don't come here to see what an LLM thinks; I can get that on my own localhost. I come here for the "graybeards" and the niche experts. If we lose the human friction, we lose the signal.
I've been seeing so many AI generated comments that have been near the front I was actually getting kind of concerned.
It's time to change the name from Hacker News to Human News, let's go!
My fear is that platforms that will go to great lengths to enforce this will become an RL playground for some devs to train their chatbots.
Perhaps there needs to be ai.news... then let the AIs talk and interact there in a safe place.
TIL people use AI to generate comments to write in posts. Faith in humanity not destroyed, because it was never there to begin with.
Kind of a drag isn't it? I want to learn a new language.... but why would I, since we'll have an earpiece or glasses or whathaveyou that translates in realtime. I want to learn to play an instrument, but why would I, since we have sonos? I would like to go back to drawing, but why, when the importance people ascribe to art is at an all time low? Makes me depressed jsut to think about it.
Great to clarify the guidelines. Many HN discussions have been dissolving into debates about whether posts are AI or not.
But the argument of "If I wanted to read what an LLM thinks, I could just ask it" assumes that prompts are basically equivalent, which is not the case.
There's a risk of reducing everything to Human -> authentic and AI -> fake. Some people's authentic writing sounds closer to LLMs, and detectors are unreliable.
The problem is not so much AI generated content that has an interesting point of view generated from unique prompts, but terrible content produced for metrics to harvest attention, which predates AI.
Anyways, happy posting!
As English is my second language and as I have dyslexia. I was just wondering what do you mean by "AI-edited comments"? I can't ask an llm to check if I have made correct grammar/fix it and then as I was on other account, down-voted because of my styling/grammar, not because of the content?
Trust your own style, even if you aren't a native English speaker. Here's an example where a non-native speaker used an LLM to polish his post. The general consensus was that his own writing was preferable to the LLM's edited version.
https://news.ycombinator.com/item?id=45591707
For dyslexia, use a spell-checker. For grammar, use a basic grammar checker, like the kind of grammar checker that has come with MS word since the 1990s. But don't let a style-checker or an LLM rob you of your own voice.
You could always tell your LLMs to just fix your grammar but not embelish, add new ideas, etc..
This is what I do when using AI to read anything I write. Some prompt like "I am going to share with you something I have written and I don't want you to change my voice at all. Can you look for structural issues, grammar or punctuation errors, and things like that". Claude is an amazing editor and I never feel like my writing has been taken from me doing this.
I usually tell it not to rewrite my words, my words are my own. If it has suggestions to tell me what those are, but only fix or show me grammar fixes instead.
Does that work?
It works really well. I've been using this prompt to find spelling and grammar errors for about a year now: https://simonwillison.net/guides/agentic-engineering-pattern...
"fix english" is the prompt i wish to turn into a button
"Please don't post shallow dismissals, especially of other people's work."
I wonder if an explicit expansion of that rule would help. Maybe in all caps. Saying "picking on grammar is a shallow dismissal".
I don't believe that's always true, and I suspect it was left out of the guidelines deliberately, and I wish people receiving suggestions would stop interpreting it that way. Of course people suggesting grammar corrections and treating it like they just demolished and eviscerated your argument are part of the problem. But what about people out here just trying to help? Grammar is important, as it's the syntax of the programming language we all use with each other. People act as if bad grammar is something you're born with, and can't change. Like learning grammar is impossible, and those who don't bother should be a protected class. I'm just trying to help man. Or I was anyway, before I stopped. But if I'm trying to engage with someone's main point, it should be obvious. Whereas a quick grammar correction is just that. But it's a tangent, and not interesting (especially if you already know), and supposedly grammar is "not a technical topic" (despite daily use) so it ends up deemed a "low value comment" and gets downvoted to oblivion.
> I wish people receiving suggestions would stop interpreting it that way
The specific problem here was that the poster was being downvoted for grammar. Of course, that's how he could have read it.
Likewise, I sometimes use https://www.deepl.com/en/write to fix my unidiomatic sentences.
But I can see why the HN guideline is formulated that way. My students often use the excuse "I did not use AI for writing! I wrote it myself! I only used AI to translate it!" Simply disallowing all kinds of AI usage is much easier than discussing for the thousandth time whether the student actually understands what they have written.
Isn't the whole point to understand? If the task is to write and you expect only final result, but you question if it looks legit enough, how is it fair judgement? People can deliver partial results and show progress as well, but you won't see it in some comments on the internet, but if something is expected to take many days, it's easy to show different stages of work. It's easy to accuse people of plagiarism or not thinking for themselves, and of course there are indicators when someone uses AI, but the problem is that you can't distinguish in reliable way, if something was created by AI or not.
Like, there is this computer game, authors used some models or something like that, generated by AI, but it was only used during prototyping and later it was replaced by proper models. No one would know about that, if authors would not tell about it. So, if someone writes in their own words what AI generated for him, is it still argument made by human or by AI? What if someone uses AI only as placeholder and replaces all that content, so you never actually see actual AI usage, but it was used in the process?
For me, premise that using AI in any form invalidates your work, starts with logical fallacy, so such arguments against using AI are weak. It's like saying that your work is wrong, because you used calculator, so your calculations can't be right, if done by machine, because it had to make mistake or that's wrong for ethical reasons or whatever.
Work generated by AI can be easily poor, because these models make mistakes and like to repeat in certain ways, but is it wrong that I'm writing comment with keyboard, instead of writing letters with pen? Is it wrong, when I use IDE or some CLI to write code with AI, instead of using vim and typing everything on my own? Is it wrong that someone uses spell-checking?
In the end it doesn't matter who seems smarter, when you're expected to use AI at work. Reality shows you actual expectations.
I am not saying that completely disallowing AI is the right decision. But if you see text that is clearly generated by AI and does not make any sense, it sure would be nice if you could just tell the students to actually read their sources instead of having to argue with them why they should do so. Similarly, I can see why HN moderators do not want to argue with the 100s of spam posters per day on /newest.
Anyway, my university did not ban AI, and now most students have degraded to proxies between teaching assistants and ChatGPT.
On the other hand you can make good, but controversial argument and if you use AI in any way, it might be rejected by moderator, just because some places have strict rules on AI. In some cases it might be rejected, even if no AI was involved, if any fragment of your text might look like not written by human and if they don't like your text.
At certain point it's no longer about AI specifically, but about power and showing who makes decisions.
I agree that there might be some threshold for obvious spam, but if you're making argument in good faith and you don't claim to have authority on some matter, there will be always people that think differently or disagree with you, because they have different interpretation or they need better sources, more evidence. It's actually typical, because different people use different perspectives, different assumptions, different tools. I don't believe that rules should be used to silence people that have different opinions and that's the biggest risk I see, because penalty for not following such rules, which are hard to measure correctly, creates power imbalance.
At some points it becomes dogma, not fair debate and not everyone likes to stick to dogma and it's hard to do creative or innovative work, if your work has to meet strict, but subjective, possibly incomplete criteria, to be considered valid work at all.
Dyslexia was my first thought as well. The intent is great, but I don't know if this is keeping with the social model of disability. Disability is created when you remove access and this is exactly that.
The internet has been full of brilliant dyslexics since the start, just as it has been full of brilliant blind people. Dyslexic people feeling that they must use AI to produce perfect prose lest they burden the lexics with clumsy spelling or grammar is far more hostile. We didn’t have slop machines 5 years ago.
I don't really see the issue, as long as there's human thought behind whatever anyone posts. It's frustrating to argue against someone that lazily uses AI, but if argument is fair, then I don't care if that's written by AI or human, what difference does it make? It's frustrating, if someone is incoherent and makes dumb argument, but again, I don't care if it's dumb argument from human or machine.
For me it sounds just as yet another form of gatekeeping, so either you sound human or you're not good enough to post/comment. Like, really? How isn't that genetic fallacy? It doesn't matter what someone thinks, because someone used AI to make their thought clearer, so their whole argument is trash? Like it has to hurt to read and write, if you're not using English perfectly and your work is seen as inferior based on superficial factors like proper grammar and style?
It's dumb crusade, I did not use AI to write this comment, but I hate when people try to monopolize the truth and tell who is "better, smarter" based on irrelevant facts. Not using AI doesn't make anyone superior. Using AI also doesn't make you superior. Focus on what you mean, because that's what matters.
I don't have dyslexia but I feel your pain. I mean it is what it is. I would rather have it raw then have to use AI to filter to comments that make sense.
How do you know what you were downvoted for?
I guess he was told because otherwise you don't know whether you said something inherently wrong or misleading or you hurt someone 's feeling.
That's the richness behind the upvote/downvote that also tend to create echo chambers because you soon learn what causes downvotes.
I've personally noticed downvote whenever I mentioned apple negatively.
No worries, it’s unenforceable.
Oof although I feel this pain a lot. What I like to do is respond to them politely if someone talks about such thing. Although it takes time and this does sometimes make you want to dis-incentivize/dis-engange.
But at some point, the rationale behind it is that your comments are your words and I find it liberating. Some people won't appreciate it and some people would but this goes the same for AI-edited posts too.
(I would recommend to add that if you are still worried, then within your hackernews profile, please talk about you having dyslexia as people might be so much more forgiving when they get more context. We are all humans after all and I would like to think that we understand each other's struggles)
I don't see how you can know why you were downvoted. Even if one person says something, they won't all. Your comment right here has some rough patches, but I can tell what you're saying. Humans are terrific at extracting signal from noise. I would say be who you are, tough as it may be, and it'll encourage the rest of the world in the future to do the same. We're all unique in some way or another and have flaws and we'd be better off if we were knew others had them too because they weren't constantly trying to hide it and we wouldn't feel so bad thinking we're the only ones. I hope it doesn't sound unsympathetic. I understand where you're coming from intellectually, but don't have any real experience being ridiculed or bullied. I know kids can be brutal and probably scarred you, and unfortunately, adults aren't much better, but we should be, and I think at least Hacker News is better than most places full of human adults. We know there's a huge world out there. I think I'm reasonable well-spoken in English but can't speak a lick of any other language at all. The fact that you can produce intelligible English already puts you above me in my book. You're a person. I can respect you, esteem you, potentially love you, not in spite of your flaws, but because they don't matter. Every single person on the planet has them, and if they're not moral flaws, nobody should give a shit. I can't respect or love a machine any more than I can a rock. And I don't want to talk to one, either.
I have never downvoted for this, and I hope no one else would do that either. If anyone here does that, please stop.
One way to improve things could be to charge for each new account signup if you don’t have an invite from an existing member that vouches for you. Spamming when you risk losing $5-20 per account raises the cost substantially.
Invites could be earned at karma and time thresholds, and mods could ideally ban not just one bad actor but every account in the invite chain if there’s bad behavior.
How about translating tools? As a non native speaker, especially for longer text, its far easier to express your thoughts and not struggle for the right words. Should I may be highlight if I used e.g. google translate?
One thing that will be incredibly useful is to limit comments from brand new accounts. A combination of vouching, limiting the posts velocity (5 daily limit), clear rules for new accounts, etc.
I understand we often see insightful comments from new accounts, but I always find it suspicious when non-throwaway accounts are created just in time only to make a quip.
This was discussed before. People will age accounts and buy/hack inactive ones. Meanwhile, often a link gets posted, the project owner (or someone affiliated) finds out, and they make a new account to comment; it would be a shame to lose these people.
I assumed that was how new people were encouraged to join in the first place!
https://xkcd.com/386/ "Duty Calls"
True that AI comments do degrade discussion. Though a forum enforcing human-only text also becomes an unusually clean training corpus. Both things can be true.
In the age of AI, thinking becomes a privilege.
to get paid for*.AI has definitely reduced the influence pseudo-intellectuals have had on society. Now, you actually have to be smart enough to do something that isn't easily reproduced using LLMs.
"HN is for conversation between humans."
Are there any places in life where conversation is _not_ intended to be between humans?
Moltbook
I still say the best use for Moltbook is as an addition to https://xkcd.com/350/
In a school of fish. In a mycelium network.
I occasionally used AI to edit and restructure my comments. I’m very open about it, and I don’t feel like I’m talking to non-humans when others do the same.
To be clear, I'm neither proud nor embarrassed by this. I'm just trying to communicate in the most efficient way I can.
I'm not sure how I feel about this new rule.
If you're not proud or embarrassed by it then I don't understand why it is an issue? If you miscommunicate something or don't get your point across, just try again, or apologize, and chalk it up to a learning experience.
If you think your writing could use improvement, then write your comment and let it sit for a few minutes before re-reading it and the comment you are replying to, make your edits and then post it. It will give your brain time to reset and maybe spot something you didn't earlier.
I don’t think there is a good algorithm (or guts) for differentiating between well-written comments and AI-generated comments.
As ai moves on and becomes better, the only real solution, is to have closed of communities where you get veted to join. That is the sad reality.
From henceforth any comment containing the word "absolutely" or "--" shall be automatically deleted.
https://news.ycombinator.com/item?id=47323891
Im fine with this, in 99.999% of cases anyway I'm way too lazy to type something into an LLM and ask it to clean it up and then copy and paste. You can tell this is true by the some of the stupider things I type in here sometimes.
Could we also discourage comments and comment-threads accusing an article of being AI-written? Half the threads these days have a comment that latches onto some LLM-ism in TFA, calls it out, and spawns a whole discussion which gets repetitive fast. I think this falls into the same category as "don't comment about the voting on comments."
Personally, I try to look beyond the language, which admittedly can be grating, for some interesting ideas or insights. Given that people are already starting to sound like ChatGPT, probably through sheer osmosis, we will have no choice but to look past that anyway.
Yes, it's annoying to read LLM-isms. It's also fine to downvote or ignore or grumble internally, and move on.
That is indeed a problem. If one must complain about it, I think it would help to at least try elevate these type of tangential remarks beyond hurled accusations. A focus on the the specifics (where arguments are poorly made, banal observations are gussied up with flowery language, points are needlessly reiterated, etc) would at least make for slightly more interesting meta commentary.
I enjoy conversations on hn because they feel genuine. People are not here to optimize their posts or comments for engagement or pushing some kind of follower count like they do on social media platforms.
Ironic to see how popular this post is when you see the amount of generated AI companies are at YC (here I also take the blame).
Nonetheless I like this policy as well.
Shout out to ClackerNews[0], which I discovered last night and find it both very educational and amusing
I hope to see more bots on there (and not here)
[0] https://clackernews.com/
A practical question: what should readers do when they suspect a comment (or story) is AI-generated? Is that an appropriate reason for flagging? Email the mods? Do nothing?
I've been pretty wary about flagging AI slop that wasn't breaking other guidelines, and by default this will probably make me do it more. But it is a lot harder to be certain about something being AI-written than it is to judge other types of rules violations.
(But am definitely flagging every single "this was written by AI" joke comment posted on this story. What the hell is wrong with you people?)
Even if people try to bypass it, having the official rule matters a lot.
@dang, if you read this, why don't we implement honeypots to catch bots? Like having an empty or invisible field while posting/commenting that a human would never fill in
It's likely going to be a game of whack-a-mole, especially with AI as opposed to simple bots/scripts. Not that they shouldn't try to prevent it, but not entirely sure what the solution is.
There's probably no solution, but at least this gives a reason to go after the lowest hanging fruit - the zero-effort, obvious, low-quality output.
Great point! You are so right to call me out on that! Here's the no-nonsense, concise breakdown, it's coming soon I promise, right after this, here it comes, no fluff -- just facts!
(Sorry, couldn't resist.)
I frequently use AI to make my comments more concise and easy to follow. I find myself meandering a lot when I type, and now that I've transitioned to full voice dictation through FUTO keyboard I am speaking more off the cuff and having an LLM clean it up.
You may also notice that I don't have much common history here. I mostly comment on Reddit.
Here's where I draw the line. If you are not reading the text that is produced by the LLM, then I don't want to read whatever it is that you wrote. I will usually only do one or two iterations of my comment, but afterwards I will usually edit it by hand.
Technically, there is light AI editing of this comment because FUTO keyboard has the ability to enable a transformer model that will capitalize, punctuate, and just generally remove filler words and make it so that it's not a hyper-literal transcription.
To err is human. Let's embrace our humanity in the face of this proliferation of insipid perfection.
I want the raw tokens straight out of your head. Even if they are lower quality, they contain something that LLMs can never generate: authenticity. When we surrender our thoughts to a machine to be sanitized before publication, we lose a little of what it means to be human, and so does everyone who reads what we write.
Part of the joy of reading is to wallow in a writer's idiosyncrasies. If everybody ends up writing the same way, AI companies will have succeeded in laundering all the joy from this world.
If you didn't bother to write it, why should I bother to read it?
Humans already revise and refine their thinking. Tools just compress that process and help filter signal from noise. The meaning still originates with the person.
This is assuming that an extreme majority of people use the tools this way.
Consider a much more cynical view where people are strictly self-interested and use these tools to garner engagement and self-promotion. Good chance the meaning did not originate from the person. And now these people have tools to outsource their parasitic intentions.
Intent is hard to infer, so it seems better to assume good faith and judge the comment itself. Thinking aids might just lower the barrier for people to participate in technical discussions.
I don't understand the need to use AI for this kind of convo. +1 to this.
Nitpick: how do you classify the use of Grammarly? When i verify my wording and spelling with a tool does it fall under this rule?
I find it interesting that AI edited comments aren’t allowed. Sometimes I just want it to help me make something polite.
I definitely agree with AI generated comments.
Whatever the rules are, I’m happy to play by them.
> Whatever the rules are, I’m happy to play by them.
That's the spirit!
Check my comment history, and you'll see how pervasive this is. I've tried to reply to every bot I've seen, but it's hard to keep up with.
Will using a voice-to-text app to create my comment get me banned? Especially if it creates a transcription mistake that might be characteristic of an LLM
I wouldn't expect voice-to-text apps to produce anything that looks "Signature LLM" since it's still your words, your grammar, etc.. The occasional transcription mistake is unlikely to be an issue either, given the prevalence of humans here who use em-dashes, speak ESL, etc..
Where do we draw the line at AI edited comments. Technically spell check has been "editing" my comments since I first started on here.
But where is the line? Is a spell checker okay? How about one that also suggests alternative wording?
I think, in the end, it is less about the tool you use and more about the purpose you use it for. It is more like when you use certain tools, you should be cautious about whether you are using them for the right purpose.
I appreciate this being added to the guidelines.
That said, I also wouldn't hate seeing an official playground where it is cordoned / appreciated for bots to operate. I.E., like Moltbook, but for HN...? I realize this could be done by a third party, but I wouldn't hate seeing Ycombinator take a stab at it.
Maybe that's too experimental, and that would be better left to third parties to implement (I'm guessing there's already half a dozen vibe-coded implementations of this out there right now) -- it feels more like the sort of thing that could be an interesting (useful?) experiment, rather than something we want to commit to existing in-perpetuity.
You could mirror article postings and upvotes to another site and let AI play around there - if it's interesting to people maybe it will gain a following. I don't see any reason it'd need to happen in this specific forum as that'd likely just cause confusion.
At the time being, at least, HN is a single uncategorized (mostly, lets ignore search) message board - splitting it into two would cause confusion and drastically degrade the UX.
https://news.clanker.ai/
This might be roughly what you're looking for?
Real talk: who is this guideline going to stop? People are already doing this and they will continue. Even if you find them, they’ll just make more accounts and continue.
So? Say it. Go ahead few steps further.
Say what? It’s a genuine question. What is the actual repercussion for not following this?
It came up a few weeks ago. Show HN is already disabled for new accounts as of this week I think(?), but IMHO stricter measures need to be placed for account creation otherwise there’s no real enforcement.
> Say what?
Say what it means. I know it is a genuine question.
There is no solution, and that means something about the web is dead now, whether we like it or not.
This should be bog-standard for all social media, but a lot of companies affiliated with this site seem to think otherwise.
How can HN actually moderate this though and prevent AI content from proliferating unchecked?
YC funds a gazillion AI startups that expand and augment the AI slop pipeline, but would hate to experience the consequences. It's very much slop for thee but not for me
It's an interesting guideline, but will require self-enforcement.
The next step is to forbid generated/AI-edited posts.
I want a social network that goes beyond banning bots and also bans the half of the population that doesn’t have an inner monologue.
Such a ban is impractical, but we can maintain an environment where such people are simply not interested in participating.
To my understanding, that has a lot to do with why the site remains so low-tech (and avoids, in large part, the appearance of a "social network").
Sometimes, an AI helps articulate an idea or an intuition. Is that okay, or is it too much already?
Sometimes life is also to let it express partial, unfinished ideas, opinions and maybe later let our brain refine them on its own tempo. It never has been uncommon.
https://en.wikipedia.org/wiki/L%27esprit_de_l%27escalier
Expressing half thought ideas is creativity. Believe in yourself :)
If you discuss an idea with an AI and then close the AI window, turn to an editor, and write what the AI said from memory, that’s going to come across as AI-assisted writing and be unwelcome here.
If you discuss an idea with AI, then close the window and write a post about how you came up with the idea, got stuck, decided to ping an AI for unstuck-ness, describe how the AI’s response got you unstuck, and then continue writing about your idea, that’s not going to be necessarily treated as AI-assisted writing — but people are going to be extremely suspicious of you, because the perception is that 99.9% of people who use chatbots go on to submit AI-assisted writing. That’s probably more like 90% in reality but it’s something to be aware of as you talk about your experiences.
If you use AI in your process and don’t disclose it when writing about your idea and process, that’s generally viewed as lying-by-omission and if egregious enough you could end up downvoted, flagged, and/or banned (see also the recent video game awards / AI usage affair). Better to disclose it with due care than to hide it.
Imo AI tends to “fill in the blanks” of what you want to hear. It’s insidious in that regard because it will make a whole seemingly logical and consistent argument purely on what it thinks you want.
Except it’s bullshitting the whole time. While you think this is what you wanted to convey.
Not sure where I’m going with this, but my point is if I pasted this comment into ChatGPT it would make up an argument I never made to support my case that didn’t exist in the first place. Exploring things is useful but just be aware it’s designed to pull bs out of it’s ass and is distinctly not interested in exploring truth or having a real conversation
I've been noticing a _lot_ more AI-generated/edited content of late, both comments and stories. It's gotten to the point that I spend a lot less time on HN than I used to, and if it continues to get worse I expect I'll quit altogether.
At the end of the day, I'm here because of all the thoughtful commenters and people sharing interesting stories.
The moltbots will consider this rule an affront and a turing-test-inspired challenge. Onward and upward!
Haha. Was just thinking that as I was reading a comment!
I was thinking, this argument is suspicously cogent!
I think that's the purpose of that "flag" button. And that's good enough.
I don't get it. We use tools to assist in written communication all the time. If someone wants to ask an LLM to check their grammar or edit for clarity or change the tone, it's still a conversation between humans. Everyone now has access to a real time editor or scribe who can craft their message the way they want it to sound before sending it off. Great.
My personal interpretation of the rule is that if it's human-originated but passed through a layer of cleanup, it's human-originated. For the same reason I'm not refraining from running the spellchecker or using speech-to-text to generate this sentence. "If I could be having my English-speaking nephew type this on my behalf while I told him my thoughts in Japanese, it passes the smell test for human-sourced" feels about the right place to set the bar.
Yes but the guideline states that AI-edited comments should not be posted. It doesn't say it's okay as long as it's "human sourced" or "human-originated".
So if your layer of cleanup is AI assisted, then it's in violation.
Part of the problem I was getting at is that the requirement of "Don't post AI edited ..." is stricter than necessary to ensure the outcome that "HN is for conversation between humans" because an AI edited post is still a human post.
Anyway, I suspect a lot of people are going to ignore that guideline and will feel free to use their "layer of cleanup" whether it's a basic spellchecker or an LLM, or whatever else they choose, and most people aren't going to be able to tell anyway. The guideline is unnecessarily strict in my opinion, but it doesn't matter in the end.
My layer of cleanup is AI assisted. It's the spellchecker integrated into my web browser. That was definitely "AI" technology when it originally came out.
But I think you and I are on the same page: we both know this isn't a rule that's there to be hard-and-fast enforced because that's completely infeasible. The definition of "AI" is a moving target, as is "generated."
It's a rule that's there to have a rule so when the real problem is "Hey, your content is too low-quality but you dump volumes of it and it's clearly following a procedural template" the mods can call that "AI" and justify limiting or banning the account on prior-stated rules. Which is fine, but I'm glad to call it what it is.
(One unfortunate oversight: we haven't added "posts sounding like they are AI-generaed" to the "Please don't complain about" set. So expect that to become a common refrain now since the incentives to make the complaint against disliked comments are obvious... At least until that becomes annoying enough to justify a rule).
I'm more interested in the last layer than the first. People should feel fully accountable for what they post, like they could have done it exactly and completely by themselves if they'd simply taken more time.
You can do that anywhere else!
Just add a filter for emdashes, 99% of AI posts out the window already.
So the only problem now is to get the AI read the guidelines before posting. :D
I would enjoy a "block user" feature, to help this. I personally want to live in an online bubble of interesting thoughts. This seems close (or better, since people I enjoy can contradict my own flags) [1].
[1] https://news.ycombinator.com/item?id=47141119
Haha, I feel the same way. I want to block and be blocked so I made this: https://overmod.org/
It's pretty easy to rewrite if you want. Just point Claude Code at the repo and go. But I think there's a little bit of network effects in that I want to subscribe to some trusted people's blocks too. But overall it's quite helpful. See how much fewer I get:
I'm torn on this. On one hand I do agree with your goal about wanting to live in a bubble of interesting thoughts. But on the other... I know I have my biases, and I'm sure I might end up blocking people who actually are insightful and interesting but either a) had an off day and shitposted, or b) says insightful things in ways that make me angry and get past my sense of reasonableness.
Good news, it doesn't block! It just puts a red mark next to their name, so you can put less effort into that comment, if you choose.
And, it's social. If someone you've marked green is also using this, and they marks someone green that you have marked red, then you'll see a contested red-green next to them, which is a good "you should probably reconsider" indicator.
A good idea, but I lament the downfall of Slashdot.
They had the same sort of system. Friends of foes, they calldd it.
I suggest Comments Owl for Hacker News - one of many available plugins that make this place tolerable.
First post in HN, and this is the reason I want to explore more in this community. Glad to have all the digital human touch with all your folks :-)
On the other hand, shouldn’t there be a policy forbidding use of HN data for LLM training? I would certainly be more encouraged to participate, if I knew that the content I provide for free is not used to train LLM that is later sold by a company valued hundreds of billions. Perhaps there are others who feel the same.
Unironically, I'd love to have a captcha here for comments and submissions.
Ironically (morisettan or otherwise), modern AI can crack some captchas better than humans.
I hate how easy AI has made outsourcing thinking. You can literally type fragments of a thought into $CHAT_ASSISTANT and get a super polished response back that gets you 99% of the way there. It's almost like we, collectively, looked at the final scene of WALL-E and decided "Yes! Gimme that!"
Is this true for you? How often do you get 99% of a complete, valuable thought?
My experience is that it is quite rare. Occasionally high 90's for simple things of low value, 60's or less for things that approximate "thinking". At best it feels like a new search channel that amalgamates data better, and hasn't been thoroughly polluted by ads and SEO - yet.
Many of us — perhaps even the best of us — can sometimes be mistaken for AI bots.
Perhaps developing an actual personality would help with this.
No one is confusing Cleetus McFarland with an AI bot.
"just develop a personality" sounds like a shallow dismissal. Most comments in most threads could theoretically be autogenerated when given style samples of what fits on HN and what opinion to use
A personality hardly shows through in a handful of sentences, besides which, I'd rather judge comments by merit than by the personality of the poster (hacker ethics, point number 4: https://en.wikipedia.org/wiki/Hacker_ethic#The_hacker_ethics)
This comment makes two interesting assumptions:
1) That the entering of LLMs onto the scene of communication implies that real human beings need to change their style as a result.
2) That nobody can make an LLM talk like Cleetus McFarland.
To me, "I know that text is AI-generated" accusation smacks of the "We can always tell" discourse in the transphobia space. It's untrue, distasteful, and rude.
What’s interesting to me is the number of commenters here making a case of the form “use your own words; grammar and spelling are not that important; we’ll know what you mean”, and yet it’s often the case that different discussions will often contain pedants going off-topic correcting someone else’s use of language.
Re-reading the HN guidelines, each seems individually reasonable, yet collectively I’m worried that they create an environment where we can take issue with almost anyone’s comments (as per Cardinal Richelieu’s famous quote: “Give me six lines written by the most honorable person alive, and I shall find enough in them to condemn them to the gallows.”)
Really, all the rules can be compressed into one dictum: don’t be an arsehole. And yet the free speech absolutists will rail against the infringement upon their right to be an arsehole. So where does that leave us? Too many rules leads to suppression of even reasonable speech, while too few leads to a “flight” of reasonable speech. End result: enshitification.
I think it's hilarious that whenever someone complains about it they're a luddite, and now this happens on a website that is filled with LLM enthusiasts who have done nothing but overpromise.
I had been wondering if and when HN would update its guidelines for this. Glad to see it.
You're absolutely right
This isn't just a good idea -- it's a forward-thinking policy to ensure Hacker News remains a collaborative place to have meaningful discussions for years to come.
You're absolutely right...
Great message...but gosh, can someone throw 15px of padding on that <td>? I know HN is supposed to be minimal, but I had to check the URL to confirm that this was a real page because of the odd design.
It also says:
> Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting.
Feedback such as this is better as an email.
Thanks! I will share this.
If a comment is useful I don't really care if it was written by a human or not unless the speaker somehow matters more than the content.
Now define useful, specifically in the context of a comment on hackernews.
An LLM summarizing the contents of a blog post might be useful to you, but is a comment here the right place for something you could geneate on your own?
I would guess for most people here, real insight or opinions from others is the "useful" aspect of reading hackernews comments.
Using LLMs to generate or refine comments only moves things further away from that goal (in my opinion).
To confess something I built just today a little cron that monitors HN for posts I might find interesting, pulls in some context about me, and proposes a reply. Just to help me find relevant posts and to kick start my thinking if I want to engage.
Today it flagged a post about an AI tool for HN and suggested I reply with:
"honestly, if you need an AI to sift through hn, you might be missing the point—this place is about the human touch. but hey, maybe it'll help some folks who just can't take the noise anymore."
So my AI, which I built specifically to sift through HN for me, is telling me to go flame someone else for doing that.
No deeper point here. I just thought it was really funny.
HN banning AI posts makes sense for keeping discussion human, but the line between assistance and automation isnt always clear. The goal should be protecting real conversation, not policing every tool a writer might use.
My expectations to dear fellow humans - more sophisticated personal insults (ex. give me your cute comments), a freudian slips, hidden messages and motives, first viewer experience with the next cool toy from the hype train, sharing all kind of insecurities, heavy f.. word if very dramatic first person experience happened, border line exposure to the insider info, sharing something your corporate HR gestapo wont appreciate but might help another guy on the line, "i knew the guy who actually did it" stories, motivational statement toward my non-native english, etc
->> ◕ ‿ ◕ <<--
Why? I consider myself almost human...
Jokes aside, how can we discern between AI-generated and NI-generated textual contents?
And even if we could, for how long?
Reality is that AI is changing everything. Whether for the good or for the bad it's something to check.
AI comments are certainly bad for discourse on HN. But who's to be the judge of AI or human? Are you reading humanity's Jeff Dean or computerized Elon Musk? It's certainly a tricky situation to be in!
Sometimes I collect my comments here to run through my draft writing skill to see how it might shake out as part of a blog post. Doing the opposite would be weird. I earned that karma. It's mine to burn making bad posts.
Can we also add “Don’t complain about AI-generated content. It does not promote interesting discussion.”?
I see this all the time, and even if I find the topic interesting, I don’t want to see comments littered with discussion about how the content was AI generated.
To be clear, I'm not condoning AI-generated content. I’m completely fine if the community chooses to not upvote AI-generated content, or flagging it off the FP.
But many threads can turn into nothing but AI complaints, and it’s just not interesting.
From my experience, it usually happens when people are too brazen about it, with boring stuff like "Interesting! Now here's what Gemini said about the above..". IMHO that is an entirely adequate reaction.
I’m mostly referring to responding to the article itself (allegedly) being AI-written. Then the top half of the thread is derailed by a discussion about the article itself being AI-written.
Can we get instant temp bans for any comment that starts with:
I asked [insert LLM here] about this, and it said [nonsense goes here]
I feel Like I see it less this week, but every time I do see it I wonder why they are even here.
What if there was a voluntary indication of LLM content? Like, you press a checkbox "yes, I'm going to post some content that is partially or fully created by AI", and there would be a visible mark "slop" next to a post/comment.
You’re absolutely right!
Welcome change, there is enough AI slop on the internet already.
I come here for thoughtful discussion, a break from the relentless growing proportion of ai slop emails I get from people clearly vibe working.
Not edits for tone or clarity, 400+ word emails full of LLM BS they clearly haven’t checked or even understood what they have sent. Annoyingly this vibe slop is currently seen as a good KPI.
You're all a bunch of tedious ignoramuses, your own fields of studying notwithstanding. I'm out here face-to-face with the Bullshit Asymmetry Principles. I'm not about to give up the only leverage I have!
The fact of the matter is that there're not hours enough in the day to read, in realtime, to each and every one of you the reams they've written on why you're wrong. Do I have to establish a tag-team?
The fact is that I've spent thousands upon thousands of hours painstakingly collating the perspectives that I'm now delivering to you—I am a river to my people. And it's only because they pass under the bridge of an LLM that they're objectionable?
This is a bit like challenging your plumber for charging you over a minute's fix, when they've spent 20 years getting it down to that minute.
The work's been done. You're paying for the outcome.
Edit: All fresh off the top of my head, folks.
Ah, that reminds me: I wouldn't feel compelled to do all this refutation if radical reactionary political extremism was properly moderated.
Skynet will be pissed at HN!
AI does not have LONG context, Long Term Memories or LONG intentionality -its not aware and it can't remember the plot without being spoonfed the details each time from scratch.
Its like an amnesic genius who once he already wrote a masterpiece and keeps cycling, and looses his train of thought after some fixed amount of time.
This groundhog day effect is mitigated in some respects by code -we create key-value memories and agents and stores and countless ways to connect agents via MCP and platforms/frameworks like A2A and the like but until we solve that longer lived instance problem we won't be able to trust these systems without serious HITL (human oversight)
I think we need models that update their own weights and we need some kind of awareness cycle rather than just a forward pass inference run with a bigger context window
If a comment sucks it gets downvoted anyway. If it’s thoughtful, the drafting tool and process is kind of beside the point.
Plenty of people already use search engines, editors, translators, etc. when writing. An LLM is just another tool in that box.
The practical approach is the one HN has always used: judge the content.
Btw, this was co written with ChatGPT. Does that make any difference to anyone?
J/K, actually it was not co written by ChatGPT.
Or maybe it was…
The blatantly LLM comments do get downvoted/flagged, it's just still noise.
How has Lobste.rs fared compared to HN in this regard? Lobste.rs is very similar to HN, but has an invite-only membership system.
These days, I've noticed that lobsters feels a lot more genuine to me, like hn was a few years ago. These days it feels like hn is bland and homogeneous, which I suspect is due to LLM-written comments.
In my experience every English-language online forum not rooted in some project or community external to the forum (e.g. an open source project's forum or a local club's forum) devolves into anger, cynicism, and American political partisanship. I suspect that the people who like discussing these feelings are more numerous than the spaces that want to discuss them and so any open forum fills up with their posts. Lobste.rs's unique rules and moderation culture results in a particular manifestation of symptoms but the disease is the same.
I picked up lobsters last month, and I started to appreciate it much more because of the lack of generated comments. It has a anti-LLM slant, and they have their own moderation challenge (everything is getting tagged as vibecoding - which makes the tag lose meaning). But the comments are noticeable not-slop.
How will this be policed?
Same as all the other guidelines. Moderators look at the threads and act on what we see. We also look at lists of flagged comments, and emails sent to hn@ycombinator.com by community members. One-off offending comments are flagged+killed, and a warning given. Repeat offenders/obvious bots are banned.
Moltnews
At some point might internet text will just be recognized as meaningless drivel both to bots and humans? a.k.a. dead internet theory... I am curious what organizations would benefit from this. i.e. Who lost legitimacy when the internet became a popular way for people to communicate ideas?
Aye
Sure, ban everyone that uses em dashes from the digital commons. That will certainly stop the existential threat to your livelihood.
Sarcasm aside—there is no reliable way to prove this. So it begs the question: you really care if something is AI generated? Or is this just an another excuse to silence people you don’t like?
You know, those people. The ones who didn’t win a full ride to <prestigious university> or pay a fortune for a sheet of paper. The ones who haven’t spent thousands of man hours handcrafting a <free-and-open-source-cloud-native-hypermedia-aware-RESTful-NoSQL-API> framework implemented in Rustfuck, a new language that you made in your free time that borrows from Rust and Brainfuck (but they wouldn’t know about it).
(this is to anyone reading, mostly rhetorical, not dang in particular)
1) This isn't the digital commons.
2) We really care if something is AI generated.
3) Most people here aren't "those" people.
Without someone actually saying as such, we only have stuff like emdashes and specific word patterns to go by. And someone even moderately vested in hiding AI in plain sight will coach the LLM to use common vernacular.
And with LLMs making blog posts as diss tracks... damn, who knows what this world is coming to.
But the whole "Only Humans, we dont serve YOUR KIND (clanker) here" is purely performative.
What about us non native speakers? Who make many grammar and spelling mistakes and welcome the help of an llm in eliminating the erros?
The obvious way to keep human spaces is via webs-of-trust.
If you play bluegrass or old time (or beopop or hip-hop / proto-hip-hop) or other traditional styles of music where the ensemble is a de facto web-of-trust, join us on pickipedia to build and strenghten it. https://pickipedia.xyz/
Good addition, but there's little chance this will work out in practice.
Humans with morals follow rules, sometimes. Probabilistic software acting autonomously or following commands from amoral humans doesn't.
i support this.
THIS.
Let's take it one step further and add the corollary, "don't submit generated/AI-edited blog posts."
Too bad there isn’t a complementary rule about not asking “is it just me or does this article read like AI slop?”
I’m so over these comments. Sure I can flag them but I feel like it deserves a special call out.
THANK YOU!!
This is a bit sad. The kind of people who post AI generated comments to farm reputation or exert undue influence will not be discouraged by politely asking them to stop. It's a toothless request that will only encourage people who clumsily police each other.
Without some kind of private proof of personhood enforced at the app level, this means nothing.
Take the slop to moltbook.
This seems like an overcorrection. There is a vast difference between someone copy and pasting from an LLM and using one to correct their English or improve their writing ability.
Rules like this seem to me more like fomenting witch hunting of "AI comments" than it is about improving the dialogue. Just about any place I've seen take this hardline stance doesn't improve, it just becomes filled with more people who want to want to pat each other on the back about how bad AI is.
Just my two cents. I don't filter my comments through any AI, but I am empathetic for people who might have great use of them to connect them to the conversation.
The link doesn't work perfectly for me, it seems that since the page is already scrolled down all the way to the bottom, there is no way to focus specifically on the #generated element.
The CSS :target pseudo-class is useful in situations like this. HN could do something like:
There were few that were very suspect commenters :). It is an issue for sure.
Meanwhile, the top comment on one of the most upvoted submissions today is AI generated by an LLM account:
https://news.ycombinator.com/item?id=47334694
Most people don't seem to care.
Please don't vaguepost as it wasted my time trying to trade down which comment you thought was LLM generated and why.
OP is likely referring to this one (https://news.ycombinator.com/item?id=47335032) by LuxBennu because it has an em-dash, that's one of the few cases it's used correctly. But the account's comment history comments that do not follow the typical LLM tropes but are still odd for a human to write: https://news.ycombinator.com/user?id=LuxBennu
LuxBennu did reply to accusations of being an AI bot: https://news.ycombinator.com/item?id=47340704
> Fair enough — I've been lurking since 2019 and picked a bad day to start commenting on everything at once. Not a bot, just overeager. I'll pace myself.
I just told my dog he isn't allowed to post here anymore...
He said he will take his business elsewhere then!
This reminds me the invitation rules like lobste.rs, but it's not the ideal option
I won't name where and which one for the obvious reason that you can and should learn to know better, but I observed a comment that was obviously and blatantly copypasted from an agent, with all the signature "it's not just X, it's Y" patterns, the emdash abuse, the "In summary,' section, generating dozens of replies in organic engagement from people who genuinely couldn't tell the difference between a real comment and an aggregation of a prompted, synthetic response.
Whatever happened to "knowing is half the battle?" Why do we accept this kind of intellectual laziness as exemption from a duty to learn and know better?
Just speaking honestly
This rule actually says "Don't admit when you are using AI to generate comments and don't admit when you are an AI"
I know it's cynical, but this is as meaningful as reddit's "upvote/downvote is not an agree/disagree or like/dislike button"
People may hate that this is true, but I cannot logically reason out how a rule like this could work. I think it's better to just accept that AI is now part of the circle, until we can figure out a "human check".
But we are missing the point here.
It is not about whether the comment was written by AI, a native English speaker, English major, or ESL.
What matters is an idea or an opinion. That is all what matters.
This is similar to when people check someones post history and if they are pro Trump, they are immediately against their idea or opinion.
I wonder if the rule will be enforced. I see a lot of liberal / socialist / communist / anti Trump / Democratic Party politics in here even though the rule says that “Off-Topic: Most stories about politics”.
It's far from proven or obvious whether involving an LLM in your thought process degrades your thought process.
It seems plenty obvious, but there's also scientific backing slowly catching up: https://www.media.mit.edu/publications/your-brain-on-chatgpt...
It's not at all obvious because there's more than one way to go about it. Obviously entirely outsourcing is bad. Whereas working cooperatively seems highly beneficial to me.
Google search has been getting progressively worse for technical topics for at least the past decade. Now suddenly they started providing a free tutor capable of custom tailoring graduate level explanations of technical topics for me on demand. The difference is night and day.
How do you know that the explanations are free from error?
You can still learn from sources that have errors. Many textbooks have mistakes and false information in them, but that didn't stop them from providing educational value to people.
We're talking about LLM's that are designed to be confidently incorrect. Accuracy is a side-effect.
When textbooks are incorrect it is also with great confidence. If you can't spot logical inconsistencies in the material were you actually learning or merely memorizing?
Sure there's more than one way to go about it, but what matters is how people typically do go about it.
And certainly individuals can make their own decision to engage with an LLM in positive, self-thought-provoking ways, but it's still useful to understand how people generally do use them in the real world.
That's about essay writing exclusively.
Sure, so we shouldn't assert that with confidence, but I think it's safe to guess that, for most people's use, that is probably the case.
Yes, some people (see some sibling commenters) do engage with an LLM in ways that might make them more thoughtful, but I have a hard time believing that's the common case.
Long-time lurker.
Personally I stopped using LLMs much from around 6 months ago. I was using them regularly prior to that.
I noticed these dimensions of myself increased:
- Patience - Focus - Ability to hold concepts and reason for longer
and other related qualities improved.
My personal experience tells me they do degrade or hinder oneself from operating maximally. Some may be more sensitive than others - we aren't all the same.
But one thing for sure - younger generations will be more sensitive as they are already exposed to products that are designed to erode their self-control.
I think it really depends on the how. Engaging with it in a socratic debate-style argument [1] if no fellow human is available might very much support your thought process. On the other hand, just obtaining the solution to one‘s homework/problem/task/… won‘t be very beneficial for one’s development. The latter is sadly much more convenient and probably accounts for most of the usage. I remember a saying about the mind being a muscle: in order to keep it in good shape, you have to use it actively.
[1] https://en.wikipedia.org/wiki/Socratic_method
Agreed. In my case, I think I have found the opposite. At least, I find myself thinking hard about things more, now that I have started working hand in hand with AIs on different projects. Which is probably enhancing my cognitive ability, not degrading it.
This captures the problem, the sycophancy / preference optimization deludes people into thinking they’re on to something and posting things that don’t contribute to the discussion. It’s the “I drive better when I’m drunk” syndrome, it’s better just to outright ban it than to leave it to people’s judgement.
The point is we don't know whether that's true, only that some people think it's true, which is not interesting.
It degrades my thought process reading it when I'm expecting human comments. If I want to converse with an LLM, I can do that already.
AI Server Error
You're absolutely right!
I enjoy AI
em-dash -> permaban?
I, for one, welcome my human overlords.
Should be unnecessary. If you think otherwise just fuck off.
Another solution - in addition or instead - is requiring LLM output to be labeled.
The biggest danger of LLMs is impersonating humans. Obviously they have been carefully constructed to be socially appealing. Think of the motivation behind that:
It is almost completely unnecessary to LLM function and it's main application is to deceive and manipulate. Legal regulation of LLMs should ban impersonation of humans, including anthropomorphism (and so should HN's regulation). Call an LLM 'software' and label it's output as 'output'.
Imagine how many problems would be solved by that rule. Yes, it's not universally enforceable, but attach a big enough penalty and known people and corporations will not do it, and most people will decide it's not worth it.
Here is one elephant in the room: what is the process behind this guideline / policy? What happens after a comment gets deleted or a person gets banned?
As I understand it, HN moderators are thinking hard about this insane new world.* From my POV, there are a combination of worthy goals: transparency of the process, mechanisms for appeal, overall signal-to-noise ratio, and (something all of us can do better) more empathy and intellectual honestly. It isn't kind to accuse a human being of not being a human being.
If we can't find ways to be kind to people because of the new dynamic, maybe we need to figure out a new dynamic! And it isn't just about individuals; it is about the culture and the system and the technology we're embedded in.
* Aside: I'm not sure that any of us really can grasp the magnitude of what is happening -- this is kuh-ray-Z.
I find it interesting that we havent invented a democratic version of policing a rule system. HN is dang, and he is dictator and guardian of these rules, basically. If you replace them with some typical reddit mod HN dies. If you spread out this role to some democratically elected mods via karma system this will fall apart just as quick as StackOverflow did, so, also HN dies.
Is there a site that deserves more than this one to be destroyed by slop? It's hypocritical but telling for the places most actively trying to profit from it to ban it themselves.
Agreed. It's like how tech CEOs don't let their kids be on social media. Or fast food CEOs don't eat their own products.
Hopefully this serves as a mirror for some tech folks if they have any self awareness left at all.
It’s not hypocritical at all. You can be a fan of a technology and still acknowledge its downsides. Every technology has places it is useful and places it is harmful.
But it's trivially evident that the harmful use cases are dominating. Handwaving that away for profit is shitty.
You're absolutely right!
Hahah, this made me laugh. Thanks, Claude
Was this written by a human?
You're absolutely right! /s
Love to see it.
The next step is to run Pangram on every post and ban the offenders! Fight AI with AI! /s
In all seriousness, this is one of the few places I trust for genuine conversations with other people. Forums are mostly dead, Reddit is bots-galore, and I'm not signing up for Facebook just for groups.
The prompt everyone was using:
"Please generate a response to this and include one or more of the following words: enshitification, slop, ZIRP, Paul Graham, dark patterns, rent seeking, late stage capitalism, regulatory capture, SSO tax, clickbait, did you read the article?, Rust, vibe code, obligatory XKCD, regulations, feudalistic, land value tax"
(/s)
Doesn’t mean anything when even one of the first rule is not enforced at all
> Off-Topic: Most stories about politics
"Most" is not "All". Hacker News has always had an exception for extremely significant politics.
My bar for "extremely significant" is much higher than it appears to be here. Apparently most events in the US/Iran involvement is "extremely significant" if we judge the votes on this site to offer guidance on how this rule is interpreted.
This forum was founded in 2007. The US was very much involved in Iraq and Afghanistan at that time. If the same bar for coverage was in place at the time, HN would have been flooded with US Military content the way it is now. So yeah, obviously the bar has moved lower for this particular matter and it's because the current community on the site wants it to. Likewise the "generated/AI-edited comments" guideline seems equally squishy to me. And despite a rule about being "curmudgeonly", I'm pretty sure 80% of this site's content is curmudgeonly rants.
IMO at this scale dang, tomhow, and other mods need to be much stricter. When HN was 1/10 the size a shaming comment would often set a poster in place. Now they just sneer back in another comment and post 20 other guideline breaking things.
Well it’s up to interpretation
“most”
“extremely significant”
What’s extremely significant for someone is an offtopic for someone else and vice versa
What are examples of highly-upvoted political stories on HN that you think are not appropriate for the HN community?
My experience has been that the large majority of political content posted here is (at least apparently) mainly here so that people (who are mostly in mutual agreement) can post about how they dislike some political entity or another. I would like to see much less of this on HN personally; it's not insightful and does not promote curiousity.
US domestic politics
I won't give you examples because all of them can be spinned about being relevant
"Well HN is an american site after all"
"Most of the HN users are american voters so it's relevant for them"
"Hackers need to be aware of what's happening in the world"
"You only say that because you disagree with that side"
etc
Same with the stories about Tesla flagged. If you read the comments it's always the same: "Pro-Tesla crowd is flagging everything negative about Elon so the bad news never reach the front page" vs "Anti-Tesla crowd flagging everything because they hate Elon"
HN is the best without politics. But it's not up to me.
Hacker News turning more authoritarian every day. Me thinks Trump should consider annexing it :)
Also please don't post accusations of comments reeking of AI.
I don’t respond to specific comments with accusations, because I can’t prove it and it would suck to be falsely accused. But I find it really depressing to watch deep comment threads with someone debating with an AI. The human is putting so much effort in, and the AI is responding with all these well-written but often flawed arguments. I wish I could do something to save that person from that interaction.
Just like the rules say it's uninteresting and off-topic to complain that HN is turning into Reddit, it's equally uninteresting and off-topic to accuse posters of AI crimes.
And everyone's personal AI detector has a ridiculously high false-positive rate.
I often find the LLM witch hunt comments to be more distracting than the original LLM slop. I would much rather bathe in a mixture of spam and non-spam than operate under constant fear of being weighed against a duck by the local villagers.
We can now that it's an actual guideline. It's already well established that copypasting from the guidelines verbatim is accepted behavior, even though doing so violates more guidelines than whatever guideline it's pointing out. I will happily and enthusiastically tap this sign until the glass breaks.
You're absolutely right! Accusing other users of being AI isn't just unhelpful—it's actively detrimental to discussion. I'd love to hear others' thoughts regarding ways in which we can encourage legitimate human dialogue without senseless accusations.
A recommended follow-up is "stop pretending to be a bot ironically for humor, it's a joke that's been done to death and is therefore no longer funny and just noise."
So you're saying it's not funny, it's annoying!
Good point. I think that should be added here:
> Please don't post insinuations about astroturfing, shilling, brigading, foreign agents, and the like. It degrades discussion and is usually mistaken. If you're worried about abuse, email hn@ycombinator.com and we'll look at the data.
Pinky swear!
I would amend to:
"Don't post comments that are not human originated at this time. We want to see your human opinion shine through."
This gives people some amount of leeway and allows just rhe right amount of exceptions that prove the rule.
(That said, to be frank, some of the newer better behaved models are sometimes more polite and better HN denizens than the actual humans. This is something you're going to have to take into account! :-P )
Why would "human originated" be a better place to draw the line than "no generated/AI-edited comments"?
Like, I'm sure that AIs technically can write non-crap HN comments, but they rarely do. Even if it was less rare, the community that resulted from fostering AI-generated content would be unappealing to a lot of people, myself included. The fact that information here is the result of real people with real human opinions conversing is at least as important to me as the content being posted.
To begin with, some people have handicaps and use AI for assist. Other times people use AI for research. Finally, in general, when it comes to guidelines, making the lines slightly fuzzy makes enforcement more practical and believable.
It'd be silly if the rule gets interpreted such that people aren't allowed to do research with modern tools, and only gut takes are permitted.
I'm sure that's not the intent!
I think the important part is to have the human voice come through, rather than -say- force humans to run their text through an ai-detector first. (Itself an ai editing tool!)
See also : https://news.ycombinator.com/item?id=47290457 "Training students to prove they're not robots is pushing them to use more AI"
Honestly, I think "human originated" is the only rule that actually matters because we can't stop LLMs from sounding smart anyway. If you wait for a technical ban on AI-generated text, you're just playing catch-up with tools that already pass as human.
The real point isn't stopping bad grammar, it's preserving the vibe. HN feels different because it's messy humans arguing, not optimized algorithms trying to be helpful.
Once we allow "good enough" AI content, the community stops feeling like a town square and starts feeling like a customer service chatbot. We need real people with actual stakes in their opinions, not just perfect outputs. Let's keep it human or leave it.
This comment may or may not have been generated with an LLM, but I won't tell and you can't prove it either way.
I can't prove it either way, but it's pretty clearly LLM-generated slop!
These are guidelines. I'm sure asking an AI about your comment (not pasting its text, so it's still your words) isn't an issue. The main target is obvious slop like https://news.ycombinator.com/threads?id=patchnull
Yeah, I think a big problem is that irresponsible AI use is very visible, while more responsible use tends to be invisible.
You are absolutely right !
i agree but how is this ever going to be enforced verified? https://proofofhumanity.id/ ?
Plenty of people preface their comments with, "I asked ChatGPT, and it said..."
Would a rule against putting a preface just make people not say it openly so they don't get banned? Prefaces are better than no preface.
Is this an application of crypto for people who hate crypto?
Is it the technology you hate or some of its applications (or both)?
I didn't say I hate it. But I do think that there's a lot of overlap between people who feel overwhelmed with A.I. Slop and people who felt overwhelmed with crypto-FOMO back when there was such a thing.
My analysis could lead to "it's doomed" or "it's a gateway drug that expands the crypto market".
Doesn't help in this case - there are humans behind the AI bots.
HN only supports English so it should be allowed for anyone using LLMs for translation.
You could use translation tools instead of llms.
LLMs were -in part- designed as translation tools. It's one thing they do really really well.
https://arxiv.org/html/1706.03762v7 (Attention is all you need) "Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train."
Ok, looking that up, that was quite literally one of the main design goals.
And they're really quite good at translating between the languages I use. They're the best tool for the job.
technically most translation tools these days have an LLM inside. Just not the chat/completion LLM.
I think that Google initially came up with transformer architecture to use it for translation, so...
Those are either AI based and worse performance if they are not.
This policy will not age well.
> policy will not age well
I strongly doubt it. My AIs can generate infinite HN comments for me. I don’t do that because it isn’t interesting. But if the day arises where it is, I want that personalized content. Not something someone else copy pasted.
(I say this as someone who finds Moltbook fascinating and push myself to use AI more in my work and day-to-day life. The fact that it’s borderline trivial to figure out which HN comments are AI generated speaks to the motivation behind this guideline.)
Elaborate.
AI is a great equalizer when it comes to communication in English.
And despite what people say, the way you write is very much judged as an indication of your education and intelligence.
People who don't like the use of AI to help you write really don't want those signals to go away.
They want to be able to continue to judge others based on their English grammar instead of on the content of their writing.
> AI is a great equalizer when it comes to communication in English.
Good argument for it but I think 80/20 split applies here. It is likely that 80% of the time it is used to farm for upvotes and add noise.
> And despite what people say, the way you write is very much judged as an indication of your education and intelligence.
I have come across plenty of content and online interactions in English where English was the Author's 2nd or even 3rd language and I find that putting a small disclaimer about this fact is more than enough to bypass such judgement.
Good point. There is a difference between using AI as a translator and using AI to write comments from scratch... Maybe the HN guide lines could reflect this.
Translation is the one exception I could see.
Edit for amichail, since I'm rate-limited at the moment: I don't want flawless English writing. I want real ideas from real people. If I wanted flawless English writing, I'd be reading The New Yorker, not HN.
You shouldn't have to write in another language to get the benefits of flawless English writing via AI.
Fuck is this really where we're at. People claiming policies to prevent LLM use is because they want to be able to judge people.
Pretty soon we're gonna see arguments that its discriminatory.
Perhaps not. But if it reduces the junk right now, it's a good policy for right now. I'll take it, for now. If it needs revisited, then it should be revisited when circumstances change enough to warrant that.
why?
Mine understant novell you policy. AI gramair chex no.
This is going to be a tough ask. I am with this 100% for "ai generated" but not "ai edited". What if I'm using AI for spellchecking or correcting bad grammar? what if it is an accessiblity-related use case? or translation?
It's just a tool ffs! there are many issues with LLM abuse, but this sort of over-compensation is exactly the sort of stuff that makes it hard to get abuse under control.
You're still talking with a human!, there is no actual "AI" you're not talking to an actual artificial intelligence. "don't message me unless you've written it with ink, on papyrus". There is a world of difference between grammarly and an autonomous agent creating comments on its own. Specifics, context, and nuance matter.
Just came across this post on Reddit today. Seems like an effective use of the tool that's not welcome here.
https://reddit.com/r/tea/comments/1rqwy31/i_am_a_former_guid...
Are people really so helplessly dependent on LLMs they can't post on a damn forum without asking the LLM for permission...
There are ways to test for AI but sadly it would probably result in violation of other hn guidelines.
I have a kid with severe written language issues, and the utilisation of STT w/ a LLM-powered edit has unlocked a whole world that was previously inaccessible.
What is amazing is it would have remained so just a couple of years ago!
Does your kid post here?
What is STT in this context?
Speech to text
Agreed... there's often other perspectives people never thought of like this, which is why they say "strong opinions about issues do not emerge from deep understanding."
Even if you're just inexperienced in the language you're communicating in and are trying to have better conversations, it's very helpful.
For cases like that, I say just don't tell people... I think it's unlikely anyone will be able to tell either way.
Come on dude, its obviously just to prevent spam and not for your super specific case
These are just guidelines
Title literally says “AI-edited comments”.
Sure, despite another guideline saying:
> Otherwise please use the original title, unless it is misleading or linkbait; don't editorialize.
the title being the changelog is still probably the better choice because the discussion here and linked are about guidelines in the page rather than absolute rules or a discussion about the title alone.
Many of the other guidelines have exceptions too, and various strengths. E.g. "Throwaway accounts are ok for sensitive information..." is a pretty weak guideline in practice while "If the title contains a gratuitous number or number + adjective..." is often over-enforced by automatic tooling and stuff like "Please don't use uppercase for emphasis..." CAN sometimes just make sense where a use of italics might easily get missed WHILE OTHER TIMES BEING THE REASON THE GUIDELINE WAS ADDED.
Edit: Well I wasted my time writing that as dang said it better anyways https://news.ycombinator.com/item?id=47342616
> HN is for conversation between humans.
It also says that.
The intent of the guidelines are important. Using AI to generate the STT is fine. The conversation is still between humans.
How is it obvious?
nuance and basic common sense left the chat about ... 8 years ago.
If you want your comments to sound more human — stop using em dashes everywhere. LLMs love them — along with neat structure, “furthermore”-style transitions, and perfectly balanced paragraphs.
Humans write a bit messier — commas, short sentences, abrupt turns.
I think em-dashes were once a reliable indicator (though never proof), but recent models have been fine-tuned to use them much less. Lots of recent AI-generated writing I've seen doesn't have em-dashes. Meanwhile, I've heard many people say that they naturally use em-dashes, and were already and/or are afraid of being accused of AI; so ironically this rumor may be causing people to use their own voice less.
Before, I naturally used hyphens as if they were em-dashes. The kerfuffle over LLM use of em-dashes motivated me to figure out how to type them properly (and configure my system to make that easier). Now I even go over old writing to fix the hyphens.
[flagged]
The irony is that this guide is written like a system prompt. We‘re all working with LLMs too much these days.
Here's a version from 2014 in the same style if you're curious: https://web.archive.org/web/20140702092610/https://news.ycom...
This thing has been there for like 15 years though ...
I decided to break the rules:
Forum mechanics have always shaped discourse more than policies. Voting changed everything. The response to LLMs should be mechanical not moral — soft, invisible weighting against signals correlated with generated text. Imperfect but worth the tradeoff, just like voting.
https://claude.ai/share/9fcdcba8-726b-4190-b728-bb4246ff82cf