Pynchon works among set to train AI systems
Joseph Tracy
brook7 at sover.net
Fri Sep 29 16:03:53 UTC 2023
Wow, can you say something about where this data set was stored. Do you think it likely that a similar set exists of visual images/artworks. I hope this leads to some serious restraint on AI tech, and legal accountability for copyright infringement.
> On Sep 29, 2023, at 11:17 AM, rich <richard.romeo at gmail.com> wrote:
>
> FYI
>
> https://www.theatlantic.com/technology/archive/2023/09/books3-database-generative-ai-training-copyright-infringement/675363/
>
> This summer, I acquired a data set of more than 191,000 books that were
> used without permission to train generative-AI systems by Meta, Bloomberg,
> and others. I wrote in *The Atlantic *about
> <https://www.theatlantic.com/technology/archive/2023/08/books3-ai-meta-llama-pirated-books/675063/>
> how
> the data set, known as “Books3,” was based on a collection of pirated
> ebooks, most of them published in the past 20 years.
>
>
> - Against the Day
> - Al Límite (spanish Edition)
> - Bleeding Edge: A Novel
> - Inherent Vice
> - La subasta del lote 49 (Andanzas) (Spanish Edition)
> - Mason & Dixon
> - Mason & Dixon
> - Slow Learner
> - The Crying of Lot 49
> - Vente à la criée du lot 49
> - Vicio propio
> - Vineland
> --
> Pynchon-L: https://waste.org/mailman/listinfo/pynchon-l
More information about the Pynchon-l
mailing list