Copyright in AI Code — What's Protected and What Isn't

Software copyright thresholds, AI-assisted authorship, and the Oracle v Google aftermath

Ip law for ai builders — Copyright in AI Code — What's Protected and What Isn't
Key takeaways
  • Copyright in software protects the original creative expression in code: specific variable names chosen for non-obvious reasons, the structure of a particular implementation, documentation that goes beyond mere functional description. It does not protect: algorithms, mathematical operations, programming conventions, or the idea of what the code does. The threshold for originality in code is low — but it exists, and purely mechanically generated output that involves no human creative choice may not meet it.
  • AI-generated code authorship: the US Copyright Office (2023) has stated that copyright requires human authorship and will not register works where a human did not meaningfully shape the output. For code where a developer provided detailed prompts, reviewed the output, edited it, and integrated it into a larger system — the developer has a plausible authorship claim for the final integrated work. The AI-generated component alone, without human selection and arrangement, sits in an ambiguous zone.
  • Oracle v Google (2021 SCOTUS): the Supreme Court held that Google's use of the Java SE API declarations in Android was fair use under the four-factor test. The Court did not decide whether APIs are copyrightable — that question remains open. The ruling narrows but does not eliminate the possibility that API structure and method signatures carry copyright protection.
  • Derivative works: code that incorporates LGPL v2.1 libraries may or may not be a derivative work depending on how the linking occurs. AGPL code run as a network service triggers source disclosure obligations. The analysis is license-specific and jurisdiction-specific.
  • Practical steps for collaborative AI development: maintain an authorship log documenting which developer directed, reviewed, and integrated each major module; use copyright notices in all source files; document AI tool usage in code review notes as evidence of human creative involvement.
Risk signals
  • Assuming that because an AI generated code, nobody owns it — this leaves IP in an ambiguous state where competitors can copy without consequence while you cannot enforce.
  • Copying code verbatim from AI output into a commercial product without reviewing training data provenance — if the model was trained on GPL-licensed code, there is a GPL contamination risk.
  • Failing to execute IP assignment agreements with contractors — work for hire applies automatically to employees but not to independent contractors under most jurisdictions' copyright law.
Action items
  • Implement a code provenance policy: document whether each major module was written by a human, generated by AI and reviewed by a human, or generated by AI with minimal human review. The level of human involvement determines the copyright position.
  • Add copyright notices to all source files and ensure employment and contractor agreements include express IP assignment clauses that cover AI-assisted output.
  • For AI-generated code that forms a core competitive component, consider having it reviewed and substantially re-implemented by a human developer to establish clear human authorship for copyright purposes.

Software copyright protects original expression — not algorithms, interfaces, or ideas. The line between what is protected and what is not is genuinely unclear for AI-assisted code, and the 2023–2025 US Copyright Office guidance has not resolved everything.

Key Analysis

Copyright in software protects the original creative expression in code: specific variable names chosen for non-obvious reasons, the structure of a particular implementation, documentation that goes beyond mere functional description. It does not protect: algorithms, mathematical operations, programming conventions, or the idea of what the code does. The threshold for originality in code is low — but it exists, and purely mechanically generated output that involves no human creative choice may not meet it.
AI-generated code authorship: the US Copyright Office (2023) has stated that copyright requires human authorship and will not register works where a human did not meaningfully shape the output. For code where a developer provided detailed prompts, reviewed the output, edited it, and integrated it into a larger system — the developer has a plausible authorship claim for the final integrated work. The AI-generated component alone, without human selection and arrangement, sits in an ambiguous zone.
Oracle v Google (2021 SCOTUS): the Supreme Court held that Google's use of the Java SE API declarations in Android was fair use under the four-factor test. The Court did not decide whether APIs are copyrightable — that question remains open. The ruling narrows but does not eliminate the possibility that API structure and method signatures carry copyright protection.
Derivative works: code that incorporates LGPL v2.1 libraries may or may not be a derivative work depending on how the linking occurs. AGPL code run as a network service triggers source disclosure obligations. The analysis is license-specific and jurisdiction-specific.
Practical steps for collaborative AI development: maintain an authorship log documenting which developer directed, reviewed, and integrated each major module; use copyright notices in all source files; document AI tool usage in code review notes as evidence of human creative involvement.

Risk Signals

Assuming that because an AI generated code, nobody owns it — this leaves IP in an ambiguous state where competitors can copy without consequence while you cannot enforce.
Copying code verbatim from AI output into a commercial product without reviewing training data provenance — if the model was trained on GPL-licensed code, there is a GPL contamination risk.
Failing to execute IP assignment agreements with contractors — work for hire applies automatically to employees but not to independent contractors under most jurisdictions' copyright law.

Action Items

Implement a code provenance policy: document whether each major module was written by a human, generated by AI and reviewed by a human, or generated by AI with minimal human review. The level of human involvement determines the copyright position.
Add copyright notices to all source files and ensure employment and contractor agreements include express IP assignment clauses that cover AI-assisted output.
For AI-generated code that forms a core competitive component, consider having it reviewed and substantially re-implemented by a human developer to establish clear human authorship for copyright purposes.

LinkedIn

Technical Deep Dive

Read the technical deep dive

See the implementation walkthrough on govindpreetsingh.com

Read on govindpreetsingh.com →

Request a consultation

This is a lightweight intake endpoint for now. It is structured so the practice management system can later take over scheduling, conflict checks and matter creation.

Submitting this form does not create an advocate-client relationship. Please avoid sending confidential details until engagement is confirmed.