1 of 19

Fair Use and Licensing

with AI

Rachael Samberg

CCDO

21 June 2024

2 of 19

Leveling the pitch: How is AI being used?

  • Non-generative AI (in use for years)
  • Generative AI (new)

3 of 19

  • Reproduction
  • Distribution
  • Display
  • Performance
  • Derivative works

Why does fair use matter in computational research?

4 of 19

FAIR USE IS

ESSENTIAL SUPPORT

5 of 19

Computa-

tional research is fair use

6 of 19

In(put)-n-Out(put)

7 of 19

For scientific research by cultural heritage institutions & research organizations:

  • TDM Permitted: Can conduct TDM & retain copies of mined works for scientific research and verification.
  • No AI Training Opt-Outs: Copyright owners may not opt out of allowing works to be used for AI training for scientific research
  • No contractual Override: License agreements cannot abrogate either of these rights.
  • Appropriate security measures

8 of 19

CONTRACTUAL

OVERRIDE

Even if a use is fair, or if the content is not protected by copyright at all, there may be a contract that restricts scraping, TDM, AI, and/or breaking DRM to do TDM or use AI

9 of 19

New agreements are banning AI

…and fair use savings clauses

aren’t enough to preserve AI rights

10 of 19

Known publisher concerns

  1. Security
  2. Competing or commercial product
  3. Training public version of third-party tool
  4. Charging separately to make more money

11 of 19

Sample Language

Restrictions on Use of Subscribed Products: Licensee and Authorized Users may not:

  1. use the Subscribed Products in combination with an artificial intelligence tool (including to train an algorithm, test, process, analyse, generate output and/or develop any form of artificial intelligence tool);

12 of 19

Fixing Sample Language

Restrictions on Use of Subscribed Products: Licensee and Authorized Users may not:

  1. use the Subscribed Products in combination with an artificial intelligence tool (including to train an algorithm, test, process, analyse, generate output and/or develop any form of artificial intelligence tool) to the extent doing so would: create a competing or commercial product or service for use by third parties; unreasonably disrupt the functionality of the Subscribed Products; or reproduce or redistribute the original Subscribed Products to third parties. Further, artificial intelligence tools may not be used without commercially reasonable information security standards to undertake, mount, load, or integrate the Subscribed Products on Licensee’s or Authorized Users’ servers or equipment.

13 of 19

Concern #3

AI in the wild

Solve this by addressing

USING AI VS. TRAINING AI

14 of 19

THIRD PARTY MOST RESTRICTIVE

Licensee and Authorized Users may not: use the Subscribed Products in combination with an artificial intelligence tool to the extent doing so would: create a competing or commercial product or service for use by third parties; unreasonably disrupt the functionality of the Subscribed Products; or reproduce or redistribute the original Subscribed Products to third parties. Further, artificial intelligence tools may not be used without commercially reasonable information security standards to undertake, mount, load, or integrate the Subscribed Products on Licensee’s or Authorized Users’ servers or equipment.

In addition, Licensee and Authorized Users are explicitly prohibited from using the Subscribed Products in combination with any third-party generative AI tool except pursuant to an enterprise or API license under which the use of Subscribed Products does not train the AI tool or improve the third party’s services. Further, any such use of the third-party generative AI tool under these circumstances is limited to use in a controlled computing environment (e.g. ring fenced) operating under the University’s control; no data or Subscribed Products are shared with third parties; and all Subscribed Products are removed from the secure computing environment at the termination of this Agreement.

15 of 19

THIRD PARTY LEAST RESTRICTIVE

Licensee and Authorized Users may not: use the Subscribed Products in combination with third-party generative artificial intelligence tool …except where such third-party generative artificial intelligence tool:

  1. is used locally in a self-hosted environment or closed hosted environment solely for use by Subscriber or Authorized Users;
  2. is not trained or fine-tuned using the Subscribed Products or any part thereof, unless pursuant to a license entered into by Subscriber that imposes commercially reasonable security measures and precludes public release or exchange of the trained tool or its data with a third party; and
  3. does not share the Subscribed Product or any part thereof with a third party.

16 of 19

Home-grown non-generative;

Home-grown generative;

Third-party Non-Generative AI

Can be used, provided:

  • Not used to make competing or commercial product for third parties
  • Doesn’t unreasonably disrupt functioning of licensed products
  • Doesn’t reproduce/redistribute licensed products to third parties
  • Commercially reasonable security measures undertaken

Third-party Generative AI

Can be used, provided:

  • No training of the third-party tool occurs, unless such training is pursuant to a license that precludes public release or exchange of the trained tool or its data with a third party, and also imposes commercially reasonable security measures
  • No licensed products are shared with any third party

17 of 19

We need a

united front

Consistency, expertise, and labor required

18 of 19

https://osc.universityofcalifornia.edu/2024/03/fair-use-tdm-ai-restrictive-agreements/

19 of 19

See also: