{"id":14904,"date":"2025-10-22T08:58:13","date_gmt":"2025-10-22T06:58:13","guid":{"rendered":"https:\/\/www.gulliksson.se\/?p=14904"},"modified":"2025-11-03T11:20:32","modified_gmt":"2025-11-03T09:20:32","slug":"can-copyright-protected-material-be-used-to-train-ai-models","status":"publish","type":"post","link":"https:\/\/www.gulliksson.se\/en\/can-copyright-protected-material-be-used-to-train-ai-models\/","title":{"rendered":"Can copyright-protected material be used to train AI models?"},"content":{"rendered":"<div class=\"wpb-content-wrapper\"><p>[vc_row][vc_column][vc_column_text css=&#8221;&#8221;]<\/p>\n<h2>Can copyright-protected material be used to train AI models?\u00a0The European TDM exception in the spotlight<\/h2>\n<p><strong>Introduction: The Copyright Wars<br \/>\n<\/strong>If you have glanced at tech headlines lately, you will know that the battle lines are drawn. On one side: rightsholders, publishers like <em>The New York Times<\/em>, and creative minds such as Sarah Silverman. On the other: AI companies, including OpenAI, eager to train their models with as much data as possible. The burning question? <em>Can copyright-protected material be used to train an AI model?<\/em> The answer is, perhaps not surprisingly: \u201cIt depends.\u201d Let us dive into the European legal jungle, with a special focus on the <em>text and data mining<\/em> (TDM) exception in the EU\u2019s Directive (EU) 2019\/790 (the Copyright Directive), its Swedish implementation, and the legal uncertainties that follow.<\/p>\n<p><strong>The TDM exception: A legal loophole or a license to train?<br \/>\n<\/strong>Directive (EU) 2019\/790 (the Copyright Directive) is implemented in the Swedish Copyright Act. By way of implementation of the Copyright Directive, the Copyright Act now contains an exception that allows works protected by copyright to be used for TDM purposes under certain conditions, as long as the creator has not reserved such use in an appropriate manner.<\/p>\n<p>Does this mean that copyright-protected works can be used to train generative AI models? Well, this is a hotly debated topic. Some say the exception was not devised with artificial intelligence in mind, but rather for more traditional machine learning models. Moreover, the exception is thought to restrict copyrights in a disproportionate way.<\/p>\n<p>Others argue that the law is flexible enough to cover generative AI models. The Swedish Copyright Act\u2019s definition of TDM is broad:<em> \u201can automated technique used to analyze text and data in digital form for the purpose of generating information.\u201d<\/em> Comparing the aforementioned text in the Swedish Copyright Act with Article 4 of the Copyright Directive, it is clear that the examples expressed in the Copyright Directive (<em>patterns, trends, or correlations<\/em>) are not present in the Copyright Act. From a systematic approach, this could mean that the Swedish implementation covers a wider array of actions. Does this mean that the training of generative AI models falls within the definition? As we can see below, there are several arguments to support such a conclusion. But first, how does the TDM exception work?<\/p>\n<p><strong>How does the exception work in practice?<br \/>\n<\/strong>Let us break down the four key conditions for using the TDM exception in Sweden:<\/p>\n<ol>\n<li><strong>TDM purposes:<\/strong> Perhaps self-explanatory, but the purpose of the actions must be to use an <em>automated technique to analyze text and data in digital form for the purpose of generating information<\/em>.<\/li>\n<li><strong>Lawful Access:<\/strong> You must have legally obtained the material used for the TDM purpose. This means no scraping behind paywalls or downloading from pirate sites. Open-access or purchased content only.<\/li>\n<li><strong>Temporary Copies:<\/strong> Any copies of the works made during the process must be deleted once the mining is done. No hoarding for future use!<\/li>\n<li><strong>Opt-Out Mechanism:<\/strong> The author can reserve their rights, for example, via machine-readable means or website terms. If they do, their works are out of bounds unless you get a license.<\/li>\n<\/ol>\n<p>If you tick all these boxes, you can use copyright-protected works for TDM purposes according to the Swedish Copyright Act.<\/p>\n<p><strong>What counts as \u201cdata mining\u201d?<br \/>\n<\/strong>Now, to the most challenging part: does the training of generative AI fit within the definition of TDM?<\/p>\n<p>As you may be aware, the EU recently adopted the AI Act, which regulates the provision of generative AI models. This Act contains references to the Copyright Directive that can be used to support the conclusion that the training of generative AI is indeed permissible on the basis of the TDM exception.<\/p>\n<p>Firstly, in Recital 105 of the AI Act, it is acknowledged that the training of generative AI models requires access to vast amounts of information, which may interfere with copyright. In the same recital, reference is made to the TDM exception, emphasizing that under certain conditions, authorization from rightsholders is not necessary, unless the rightsholders have opted out of the TDM exception.<\/p>\n<p>Secondly, in Article 53 of the AI Act, which contains obligations for providers of general-purpose AI models, another explicit reference is made to the TDM exception. The article states that AI operators shall adopt a policy for complying with EU copyright regulations, including ensuring compliance with rightsholders\u2019 opt-outs from the TDM exception. In other words, AI operators must ensure that opt-outs from rightsholders can be detected when scanning for useful material.<\/p>\n<p>In addition to the hints in the AI Act, regardless of whether generative AI models were perceived by the legislator when the TDM exception was drafted in 2019, the European Court of Justice has expressed the view that originally narrow exceptions can be interpreted more broadly to adapt to new technologies.1<\/p>\n<p>Seen from a rightsholder\u2019s perspective, an argument can be made that the TDM exceptions strike a fair balance between the interests of AI operators and creators. The growing need for material to train AI models means that creators can leverage the opt-out mechanism to obtain lucrative license deals.<\/p>\n<p>Considering the arguments above, as well as the broad definition of TDM, there is reason to include the training of generative AI models within the exception.<\/p>\n<p><strong>Conclusion: The debate continues, but the door is ajar<br \/>\n<\/strong>From the trenches of legal opinions and legislative texts, it seems possible to use copyright-protected material for AI training in Sweden, as long as all the conditions of the TDM exception are fulfilled. However, this question is not one to take lightly, and no guiding judgments on the subject have yet been delivered. Thus, this question is far from certain, and interpretations may shift. If you have any questions or would like to discuss this further, please don\u2019t hesitate to reach out to someone on the <a href=\"https:\/\/www.gulliksson.se\/en\/services\/emerging-tech\/\" target=\"_blank\" rel=\"noopener\">team<\/a>.<\/p>\n<p><strong>Gulliksson celebrates its 50th anniversary<\/strong><br \/>\n<a href=\"https:\/\/www.gulliksson.se\/50-years\/\" target=\"_blank\" rel=\"noopener\">Take part in glimpses and highlights from our journey from the start in 1975 to today!<\/a><\/p>\n<p>[1] See the judgment in the joined cases C\u2011403\/08 and C\u2011429\/08, p. 164.[\/vc_column_text][\/vc_column][\/vc_row]<\/p>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>[vc_row][vc_column][vc_column_text css=&#8221;&#8221;] Can copyright-protected material be used to train AI models?\u00a0The European TDM exception in the spotlight Introduction: The Copyright Wars If you have glanced at tech headlines lately, you will know that the battle lines are drawn. On one side: rightsholders, publishers like The New York Times, and creative minds such as Sarah Silverman&#8230;.  <a class=\"excerpt-read-more\" href=\"https:\/\/www.gulliksson.se\/en\/can-copyright-protected-material-be-used-to-train-ai-models\/\" title=\"Read Can copyright-protected material be used to train AI models?\">Read more &raquo;<\/a><\/p>\n","protected":false},"author":7,"featured_media":14896,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[157],"tags":[],"specialomraden":[],"class_list":["post-14904","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-front-news-en"],"acf":[],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/posts\/14904","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/comments?post=14904"}],"version-history":[{"count":1,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/posts\/14904\/revisions"}],"predecessor-version":[{"id":14905,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/posts\/14904\/revisions\/14905"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/media\/14896"}],"wp:attachment":[{"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/media?parent=14904"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/categories?post=14904"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/tags?post=14904"},{"taxonomy":"specialomraden","embeddable":true,"href":"https:\/\/www.gulliksson.se\/en\/wp-json\/wp\/v2\/specialomraden?post=14904"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}