In the second of two “fair use” court cases involving language model training this week, a court has ruled that Meta did not infringe copyright when it used copyrighted book material to train its models. The latest decision, though, does turn on procedural grounds, not the core issue of whether copyrighted material is fair use under all circumstances, perhaps most notably when permission to use the materials for this purpose is absent.
The prior decision upholding fair use of copyrighted material for model training was won by Anthropic.
That copyright case also tested fair use law as it applies to reading books, which is essentially what model training involves. For humans, reading books is considered fair use, and not a copyright violation. It has been unclear whether an artificial intelligence model, reading books, also is fair use, or not.
The case partially validates the idea of fair use for training purposes, without compensation or permission, even if in this case the court upheld Anthropic’s use of purchased books for training purposes.
In other words, training AI on lawfully acquired copyrighted works was deemed fair use.