OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

  • Hildegarde@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    A copyright holder has the right to control who has the right to create derivative works based on their copyright. If you want to take someone’s copyright and use it to create something else, you need permission from the copyright holder.

    The one major exception is Fair Use. It is unlikely that AI training is a fair use. However this point has not been adjudicated in a court as far as I am aware.

    • LordShrek@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      this is so fucking stupid though. almost everyone reads books and/or watches movies, and their speech is developed from that. the way we speak is modeled after characters and dialogue in books. the way we think is often from books. do we track down what percentage of each sentence comes from what book every time we think or talk?

    • FatCat@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      It is not a derivative it is transformative work. Just like human artists “synthesise” art they see around them and make new art, so do LLMs.