Basically a deer with a human face. Despite probably being some sort of magical nature spirit, his interests are primarily in technology and politics and science fiction.

Spent many years on Reddit before joining the Threadiverse as well.

  • 0 Posts
  • 1.91K Comments
Joined 2 years ago
cake
Cake day: March 3rd, 2024

help-circle





  • It’s a common phenomenon. Look at how stunned so many people were when Trump won, for example. How could he have won? Everyone they knew voted against him! Everyone they talked to agreed that he was terrible! Except of course they were only talking to like-minded people, so they had no idea what “everyone” actually thought. Same thing applies to Trump supporters when he lost in 2020, they couldn’t believe it.

    Places like the Fediverse are practically designed to become echo chambers. Look at the upvote/downvote mechanism, how’s the balance look in here for comments critical of AI versus comments that aren’t critical of it? I know karma is meaningless, even moreso here than on places like Reddit, but it’s still a psychological and social pressure declaring “you’re not one of us, your opinion is bad and you should feel bad.” Naturally the people who would post such unpopular opinions will tend to stop posting over time.

    Heck, I’ve got my own personal stalker following me around lately posting about how much of a “troll” I am because I don’t toe the “AI bad” line. Disagreement with the local consensus is bad and wrong and I’m supposed to just shut up and go away I suppose. Doesn’t bother me, but people are social beings so I’m not surprised it bothers others.












  • FaceDeer@fedia.iotoTechnology@beehaw.org*Permanently Deleted*
    link
    fedilink
    arrow-up
    1
    arrow-down
    1
    ·
    13 days ago

    Alright, so instead of simply saying “include external data in your training run”, extend that to “and also filter the data to exclude erroneous stuff.” That’s a routine part of curating training data in real-world AI training as well, I was already writing a lot so I didn’t feel like adding more detail there would have enhanced it.

    The basic point remains the same, that real world training accounts for the things that were necessary to force model collapse to happen in that old paper I linked. It’s a solved problem. We can see that it’s solved by the fact that AI models continue to get better, despite an increasing amount of AI-generated data being present in the world that training data is being drawn from. Indeed, most models these days use synthetic training data that is intentionally AI-generated.

    A lot of people really want to believe that AI is going to just “go away” somehow, and this notion of model collapse is a convenient way to support that belief. So it’s very persistent and makes for great clickbait. But it’s just not so. If nothing else, the exact same training data that was used to create those earlier models is still around. AI models are never going to get worse than they are now because if they did get worse we’d just throw them out and go back to the earlier ones that worked better, perhaps re-training with the same data but better training techniques or model architectures.


  • FaceDeer@fedia.iotoTechnology@beehaw.org*Permanently Deleted*
    link
    fedilink
    arrow-up
    1
    arrow-down
    1
    ·
    13 days ago

    Model collapse comes from using only training data generated by previous generations.

    All that’s needed to avoid it is to add training data that isn’t directly from the previous “generation” of the LLM in question. The thing that causes model collapse is the loss of data from generation to generation, so you just need to keep the training data “fresh” with stuff that wasn’t directly generated by the earlier generation of your model.

    You could do that with archived material you used for previous training runs. For more recent events you could do that with social media feeds. The Fediverse, for example, would probably be a perfectly fine source of new stuff. Sure, there’s some AI-generated stuff mixed in, but that’s not “poison.”

    As I mentioned, the article that demonstrated model collapse did it using a very artificial set of circumstances. It’s not how real AI training is done.


  • FaceDeer@fedia.iotoTechnology@beehaw.org*Permanently Deleted*
    link
    fedilink
    arrow-up
    3
    arrow-down
    2
    ·
    13 days ago

    The main mechanism leading to model collapse in that paper, as I understand it, are the loss of “rare” elements in the training data as each generation of model omits things that just don’t happen to be asked of it. Like, if the original training data has just one single line somewhere that says “birds are nice”, but the first generation of model never happens to be asked what it thinks of birds then this bit of information won’t be present in the second generation. Over time the training data becomes homogenized. It probably also picks up an increasing load of false or idiosyncratic bits of information that were hallucinated and get reinforced due to random happenstance, it’s been a long time since I read the article and details slip my mind.

    I’m really not seeing how human filtering would mimic this process, so I think it’s safe. The filtering is being done with intent in that case, not due to random drift as is done with a purely automated generation like was done in the paper.


  • FaceDeer@fedia.iotoTechnology@beehaw.org*Permanently Deleted*
    link
    fedilink
    arrow-up
    11
    arrow-down
    2
    ·
    13 days ago

    Semantic quibbling is one of the least interesting kinds of internet debate, so replace the word “understanding” with whatever word makes you happy. I continued with “and talking about” right afterwards so you can just delete the word entirely and the sentence still works fine. You could have just kept reading.

    Since you didn’t read the rest of my comment, I should note that the rest of it after that sentence is about the other issue that OP raised and not even about model collapse at all.

    Anyway. The article about model collapse that I see still crop up every once in a while is this one. It’s not that it has “methodological errors”, though, it’s just that it uses a very artificial training protocol to illustrate model collapse that doesn’t align with how LLMs are actually trained in real life. It’s like demonstrating the effects of inbreeding in animals by crossing brothers and sisters for twenty generations straight - you’ll almost certainly see some strong evidence, but it’s not a pattern of breeding that you are actually going to see in the wild.