cross-posted from: https://lemmy.world/post/76533

One of the arguments made for Reddit’s API changes is that they are now the go to place for LLM training data (e.g. for ChatGPT).

https://www.reddit.com/r/reddit/comments/145bram/addressing_the_community_about_changes_to_our_api/jnk9izp/?context=3

I haven’t seen a whole lot of discussion around this and would like to hear people’s opinions. Are you concerned about your posts being used for LLM training? Do you not care? Do you prefer that your comments are available to train open source LLMs?

(I will post my personal opinion in a comment so it can be up/down voted separately)

  • HairHeel@programming.dev
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Reddit has every right to charge for their API, but the amount they wanted to charge was too high.

    Other use cases aren’t relevant here either. They could have come to an agreement with Apollo etc that would have charged them reasonable rates while charging more to data scrapers. They could have done ads and dev share on the mobile apps. Most people wouldn’t have objected to that.

    That part’s not a Reddit-specific problem though. I’ve seen a similar pattern play out at several companies I work for:

    • charge extra for a new premium feature
    • a new client with deep pockets comes along and wants part of that feature, but doesn’t want all of it, so doesn’t want to pay for it
    • sales really wants to catch this big fish
    • sales promises to build a new feature that does the same thing as the existing feature
    • the company loses more money than they would have by just giving the feature away for free, since now they’re also paying engineers to build the free version.