diff --git a/Meetings/AIsessions/index.md b/Meetings/AIsessions/index.md
index 53d693b..e4a3441 100644
--- a/Meetings/AIsessions/index.md
+++ b/Meetings/AIsessions/index.md
@@ -5,7 +5,7 @@ title: AI sessions
 
 During 2025, a set of meeting presented and interrogated the use of Artificial Intelligence (AI) in reading systems (RS).
 
-* July 17th, [Daniel Weck, AI use cases and technical considerations in Thorium Reader, an open source reading system]
+* July 17th, [Daniel Weck, AI use cases and technical considerations in Thorium Reader, an open source reading system](https://w3c.github.io/publishingcg/Meetings/Minutes/2025-07-17-publishingcg.html)
 * June 19th, [Lars Wallin, Colibrio approach of AI](https://www.w3.org/2025/06/11-publishingcg-minutes.html)
 * May 15th, [Senthil Nathan, CEO of Ailaysa, about Steps towards Responsible Digital Publishing: Content Exclusion and AI Training, and present the Chaï reader AI capacities.](https://www.w3.org/2025/05/15-publishingcg-minutes.html)
 * March 20th [Nick Brown (VP Product Vitalsource): From Principles to Practice - Responsible AI for Enhanced Student Engagement in Reading Systems](https://www.w3.org/2025/03/20-publishingcg-minutes.html)
diff --git a/Meetings/Minutes/2024-02-15-publishingcg.html b/Meetings/Minutes/2024-02-15-publishingcg.html
new file mode 100644
index 0000000..01b3682
--- /dev/null
+++ b/Meetings/Minutes/2024-02-15-publishingcg.html
@@ -0,0 +1,230 @@
+
+<!DOCTYPE html>
+<html lang=en>
+<head>
+<meta charset=utf-8>
+<title>Publishing Community Group: Plenary Session &ndash; 15 February 2024</title>
+<meta name=viewport content="width=device-width">
+<link rel="stylesheet" type="text/css" title="2018" href="https://www.w3.org/StyleSheets/scribe2/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/base.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/2004/02/minutes-style.css">
+<link rel="alternate stylesheet" type="text/css" title="Fancy" href="https://www.w3.org/StyleSheets/scribe2/fancy.css">
+<link rel="alternate stylesheet" type="text/css" title="Typewriter" href="https://www.w3.org/StyleSheets/scribe2/tt-member.css">
+</head>
+
+<body>
+<header>
+<p><a href="https://www.w3.org/"><img src="https://www.w3.org/StyleSheets/TR/2016/logos/W3C" alt=W3C border=0 height=48 width=72></a></p>
+
+<h1>
+Publishing Community Group: Plenary Session</h1>
+<h2>15 February 2024</h2>
+
+<nav id=links>
+<a href="https://www.w3.org/events/meetings/aaf7944d-e567-4a18-8f49-4096f3e62990/"><img alt="Agenda." title="Agenda" src="https://www.w3.org/StyleSheets/scribe2/chronometer.png"></a>
+<a href="https://www.w3.org/2024/02/15-publishingcg-irc"><img alt="IRC log." title="IRC log" src="https://www.w3.org/StyleSheets/scribe2/text-plain.png"></a>
+</nav>
+</header>
+
+<div id=prelims>
+<div id=attendees>
+<h2>Attendees</h2>
+<dl class=intro>
+<dt>Present</dt><dd><details><summary>ashley,</summary> AvneeshSingh, CharlesL, duga, George, gpellegrino, graham, Hadrien, ivan, jeffrey_griggs, jonas_lillqvist, ken_jones, LauraB, laurent_le_meur, liisamk, miia_kirsi, mike_baker, paul_belfanti, paul_gilius, rickj, sagiv, tzviya, vincent_nicotina, wendyreid, wolfgang</details></dd>
+<dt>Regrets</dt><dd>-</dd>
+<dt>Chair</dt><dd>wolfgang</dd>
+<dt>Scribe</dt><dd>duga, wendyreid</dd>
+</dl>
+</div>
+
+<nav id=toc>
+<h2>Contents</h2>
+<ol>
+<li><a href="#t01">Accessibility task force</a></li>
+<li><a href="#t02">Anti counterfeit Task Force</a></li>
+<li><a href="#t022">Fixed Layout Accessibility TF</a></li>
+<li><a href="#t03">Generate reflowable fixed layout books</a></li>
+<li><a href="#t04">Extracting textual content</a></li>
+<li><a href="#t05">Webtoons</a></li>
+</ol>
+</nav>
+</div>
+
+<main id=meeting class=meeting>
+<h2>Meeting minutes</h2>
+<section><p id=x003 class=irc><cite>&lt;wolfgang&gt;</cite> date: 2024-02-15</p>
+<p id=x019 class=irc><cite>&lt;tzviya&gt;</cite> scribe?</p>
+<p id=x026 class="phone s01"><cite>wolfgang:</cite> Welcome to the plenary, starting with update with accessibility task force</p>
+<p id=x027 class=irc><cite>&lt;AvneeshSingh&gt;</cite> <a href="https://w3c.github.io/publ-a11y/UX-Guide-Metadata/draft/principles/?updated">https://<wbr>w3c.github.io/<wbr>publ-a11y/<wbr>UX-Guide-Metadata/<wbr>draft/<wbr>principles/?updated</a></p>
+</section>
+
+<section>
+<h3 id=t01>Accessibility task force</h3>
+<p id=x038 class="phone s02"><cite>AvneeshSingh:</cite> Focus is on the guide for retailers and distributors for understanding the metadata<br>
+<span id=x039>… this is an update to an existing doc</span><br>
+<span id=x040>… we have found that one approach doesn't work for everyone</span><br>
+<span id=x041>… so now we have different types of metadata targeted to different groups</span><br>
+<span id=x042>… VitalSource and ??? have committed to implement</span><br>
+<span id=x043>… We are no longer using such fixed recs, instead being a little broader</span></p>
+<p id=x044 class=irc><cite>&lt;AvneeshSingh&gt;</cite> <a href="https://w3c.github.io/publ-a11y/UX-Guide-Metadata/draft/principles/?updated">https://<wbr>w3c.github.io/<wbr>publ-a11y/<wbr>UX-Guide-Metadata/<wbr>draft/<wbr>principles/?updated</a></p>
+<p id=x047 class="phone s02"><cite>AvneeshSingh:</cite> There are some technique documents explaining how to extract metadata<br>
+<span id=x048>… There are more technique docs coming for other types of metadata</span></p>
+<p id=x049 class=irc><cite>&lt;AvneeshSingh&gt;</cite> <a href="https://www.w3.org/publishing/a11y/schema-a11y-summary/">https://<wbr>www.w3.org/<wbr>publishing/<wbr>a11y/<wbr>schema-a11y-summary/</a></p>
+<p id=x050 class=irc><cite>&lt;AvneeshSingh&gt;</cite> <a href="https://www.w3.org/publishing/a11y/audio-playback/">https://<wbr>www.w3.org/<wbr>publishing/<wbr>a11y/<wbr>audio-playback/</a></p>
+<p id=x052 class="phone s02"><cite>AvneeshSingh:</cite> See the two links to the docs</p>
+<p id=x056 class="phone s03"><cite>George:</cite> Trying to get feedback for the schema document.<br>
+<span id=x057>… There are some English strings in there, we are trying to make sure they are good in English, but we plan to add a localization method for them</span><br>
+<span id=x058>… VitalSource will localize some of these</span></p>
+<p id=x060 class="phone s02"><cite>AvneeshSingh:</cite> We realized there are some things on the edge of accessibility, but have broader implications<br>
+<span id=x061>… We want to bring these back to the group so we only work on things people are interested in</span></p>
+<p id=x063 class="phone s03"><cite>George:</cite> #70 - get citation<br>
+<span id=x064>… It's hard to get a page number for citations</span></p>
+<p id=x065 class=irc><cite>&lt;wendyreid&gt;</cite> <a href="https://github.com/w3c/publishingcg/issues/70">w3c/<wbr>publishingcg#70</a></p>
+<p id=x066 class="phone s03"><cite>George:</cite> we want to see if the CG is interested in the feature</p>
+<p id=x067 class=irc><cite>&lt;wendyreid&gt;</cite> <a href="https://github.com/w3c/publishingcg/issues/71">w3c/<wbr>publishingcg#71</a></p>
+<p id=x072 class="phone s03"><cite>George:</cite> Next bookmarks, annotations and export of such<br>
+<span id=x073>… There is partial support in some reading systems</span><br>
+<span id=x074>… hard to compare. Purely RS, but we would test and evaluate</span><br>
+<span id=x075>… #72 - read aloud</span></p>
+<p id=x076 class=irc><cite>&lt;wendyreid&gt;</cite> <a href="https://github.com/w3c/publishingcg/issues/72">w3c/<wbr>publishingcg#72</a></p>
+<p id=x081 class="phone s03"><cite>George:</cite> There are lot of distracting things (footnotes, citation refs, etc)<br>
+<span id=x082>… DAISY has skippability that can be toggled</span><br>
+<span id=x083>… Finally virtual pages, discussed but never resolved</span><br>
+<span id=x084>… VitalSource has implemented something, as has Lars</span></p>
+<p id=x085 class=irc><cite>&lt;ivan&gt;</cite> <a href="https://github.com/w3c/publishingcg/issues/73">w3c/<wbr>publishingcg#73</a></p>
+<p id=x088 class="phone s03"><cite>George:</cite> But companies like Ebsco (???) could insert real page numbers, but they want it to be a generally used algorithm<br>
+<span id=x089>… So other versions of the book would have the same page breaks</span><br>
+<span id=x090>… Would like to hear what people think</span></p>
+<p id=x094 class="phone s04"><cite>ivan:</cite> For 71, EDRlab may have a project going here - can we work with them?<br>
+<span id=x095>… Are 70 and 73 related?</span></p>
+<p id=x097 class=irc><cite>&lt;wendyreid&gt;</cite> <a href="https://docs.google.com/document/d/11GypOjE9xOTaINATl5bxVIA3Mc9jzNBGCr6GT_KNaQ4/edit?pli=1">https://<wbr>docs.google.com/<wbr>document/<wbr>d/<wbr>11GypOjE9xOTaINATl5bxVIA3Mc9jzNBGCr6GT_KNaQ4/<wbr>edit?pli=1</a></p>
+<p id=x098 class="phone s04"><cite>ivan:</cite> The page numbers for 70 seem to be very related to 73</p>
+<p id=x100 class="phone s03"><cite>George:</cite> Unless there is an alternative to page numbers</p>
+<p id=x103 class="phone s04"><cite>ivan:</cite> Also related to annotations (want to refer to the text)<br>
+<span id=x104>… need to reference the page somehow</span><br>
+<span id=x105>… Need to anchor somehow</span></p>
+<p id=x106 class="phone s03"><cite>George:</cite> These aren't shared annotations</p>
+<p id=x118 class="phone s05"><cite>Hadrien:</cite> Not from EDRlab technically, but involved with them<br>
+<span id=x119>… EPUB itself contains the annotations, then open the epub you get the annotations</span><br>
+<span id=x120>… idea is it self contained</span><br>
+<span id=x121>… tricky part is not just anchoring, but also context</span><br>
+<span id=x122>… often need to embed a lot of information</span><br>
+<span id=x123>… Also know percentage into the book, DOM ranges, etc</span><br>
+<span id=x124>… need both anchors and context</span></p>
+<p id=x129 class="phone s06"><cite>duga:</cite> Just wanted to say 70/73 are likely the same<br>
+<span id=x130>… the TF determined that people only understand progress is through page numbers, no one likes %s, it makes 70 and 73 the same</span><br>
+<span id=x131>… also the impression of the indexers</span></p>
+</section>
+
+<section>
+<h3 id=t02>Anti counterfeit Task Force</h3>
+<p id=x143 class="phone s07"><cite>liisamk:</cite> Anti counterfeit TF<br>
+<span id=x144>… not much discussion since the start of the year</span><br>
+<span id=x145>… We are socializing the ISCC in the hopes of bringing in more people</span><br>
+<span id=x146>… We are kicking back up on Friday</span><br>
+<span id=x147>… The next piece is digging in to how ISCC and ??? work together</span><br>
+<span id=x148>… how to we start socializing the next piece of the trust chain</span></p>
+</section>
+
+<section>
+<h3 id=t022>Fixed Layout Accessibility TF</h3>    
+<p id=x150 class="phone s08"><cite>wendyreid:</cite> FL a11y TF</p>
+<p id=x151 class=summary><a href="https://w3c.github.io/epub-specs/epub33/fxl-a11y/">https://<wbr>w3c.github.io/<wbr>epub-specs/<wbr>epub33/<wbr>fxl-a11y/</a></p>
+<p id=x155 class="phone s08"><cite>wendyreid:</cite> Working on guidance doc for FL a11y - it is now in complete draft state<br>
+<span id=x156>… please read and give feedback</span><br>
+<span id=x157>… Mostly for publishers/authors, but there are also recs for RS at the end</span></p>
+<p id=x159 class=irc><cite>&lt;ivan&gt;</cite> i/FL A11y TF/Topic: FL A11y TF/</p>
+<p id=x161 class="phone s08"><cite>wendyreid:</cite> discuss media overlay, tables, etc<br>
+<span id=x162>… will produces some samples</span></p>
+<p id=x164 class=summary><a href="https://github.com/w3c/publishingcg/issues/69">w3c/<wbr>publishingcg#69</a></p>
+<p id=x165 class=irc><cite>&lt;vince&gt;</cite> can someone repost the fixed layout guidelines here?  Sorry, I just joined the IRC now, and can't see any links above</p>
+<p id=x167 class="phone s03"><cite>George:</cite> I heard that Hadrien recommended that if you could get correct reading order, we could claim accessibility for FL documents<br>
+<span id=x168>… that is the biggest issue, getting things in the correct order</span></p>
+<p id=x172 class="phone s08"><cite>wendyreid:</cite> Have discussed a reflowable mode<br>
+<span id=x173>… It's still experimental, not in the main document</span><br>
+<span id=x174>… would like to explore possibility of this and how to specify this for reading systems</span></p>
+<p id=x176 class="phone s03"><cite>George:</cite> Would there be a validator?</p>
+<p id=x177 class="phone s08"><cite>wendyreid:</cite> Still way too early</p>
+<p id=x184 class="phone s05"><cite>Hadrien:</cite> Context - this is looking at the EU directive and what people can actually do<br>
+<span id=x185>… For instance you need to able to do ??? and you can't, but we might be able to make a system that could</span><br>
+<span id=x186>… We need to document how TTS actually works, very few people know</span><br>
+<span id=x187>… Out of that work we can make a best practice document</span><br>
+<span id=x188>… documenting would be the first step</span></p>
+<p id=x192 class="phone s09"><cite>CircularKen:</cite> The document we have been working on is the start of what we will eventually be able to do<br>
+<span id=x193>… already in the doc is reading order and image descriptions</span><br>
+<span id=x194>… Then we can add an additional way to read it once we have this groundwork laid</span></p>
+<p id=x195 class=summary><a href="https://github.com/w3c/publishingcg/issues/69">w3c/<wbr>publishingcg#69</a></p>
+</section>
+
+<section>
+<h3 id=t03>Generate reflowable fixed layout books</h3>
+<p id=x197 class="phone s01"><cite>wolfgang:</cite> One aspect is a11l and the other is adapting to different viewports</p>
+<p id=x199 class=irc><cite>&lt;tzviya&gt;</cite> c/a11l/a11y</p>
+<p id=x206 class="phone s09"><cite>CircularKen:</cite> Fundamentally need to start with well made and designed docs<br>
+<span id=x207>… we should discard placement, etc and just have a simple replacement of CSS that disregards positioning and styling</span><br>
+<span id=x208>… Then we have the order with descriptions, etc with page markers. Should plan on designed version and a stripped back reflow version</span></p>
+<p id=x218 class="phone s08"><cite>wendyreid:</cite> Ken described it as we have it<br>
+<span id=x219>… A lot of what we say now is text should be live (actual text), visual order and programmatic should match, we should have image descriptions</span><br>
+<span id=x220>… Also some recs on what to do when you cross the fold (spread over 2 FL pages)</span><br>
+<span id=x221>… That is the current emphasis. Follow best practices and the advanced stuff will follow</span></p>
+<p id=x226 class="phone s02"><cite>AvneeshSingh:</cite> Have we looked at the problem from the other side?<br>
+<span id=x227>… That is can we start with something that has all the proper structure, then create a FL doc entirely from CSS</span></p>
+<p id=x228 class="phone s08"><cite>wendyreid:</cite> You mean convert reflow to FL?</p>
+<p id=x231 class=irc><cite>&lt;Zakim&gt;</cite> tzviya, you wanted to respond</p>
+<p id=x232 class="phone s02"><cite>AvneeshSingh:</cite> Yes, basically start with a proper flowing doc, then just apply CSS to make it look right</p>
+<p id=x235 class="phone s10"><cite>tzviya:</cite> This isn't really considering how publishers work<br>
+<span id=x236>… doing something like that is probably not feasible</span></p>
+<p id=x248 class="phone s07"><cite>liisamk:</cite> There is a moment here to socialize good use of FL<br>
+<span id=x249>… there are a lot of people who use FL when they just don't want to make a flowing doc</span><br>
+<span id=x250>… because it is easier</span><br>
+<span id=x251>… This may be a good opportunities to push people to flowing text since they are thinking about what it really means to make something FL</span><br>
+<span id=x252>… very little needs to be FL</span></p>
+<p id=x256 class="phone s08"><cite>wendyreid:</cite> We need to get people to question whether content needs to be FL<br>
+<span id=x257>… Sometimes positioning helps with a11y (having images adjacent to text may help some readers)</span><br>
+<span id=x258>… so sometimes FL can help a11y</span></p>
+<p id=x263 class="phone s07"><cite>liisamk:</cite> This gets me back to mixed formats<br>
+<span id=x264>… A single FL page in a reflowable book would be really nice</span></p>
+<p id=x265 class=irc><cite>&lt;AvneeshSingh&gt;</cite> +1 Liisa</p>
+<p id=x266 class=summary>Also +1 to Liisa</p>
+</section>
+
+<section>
+<h3 id=t04>Extracting textual content</h3>
+<p id=x277 class="phone s05"><cite>Hadrien:</cite> Goes way beyond extracting text<br>
+<span id=x278>… eg language used</span><br>
+<span id=x279>… Need a separate structure for TTS and creating reader mode</span><br>
+<span id=x280>… though could use the same structure for both</span></p>
+<p id=x285 class="phone s03"><cite>George:</cite> We are getting feedback from students with dyslexia that read aloud is inadequate for their needs<br>
+<span id=x286>… I agree we need to improve the description of how it is done (highlighting, speed, etc)</span></p>
+<p id=x287 class=irc><cite>&lt;wolfgang&gt;</cite> qß</p>
+<p id=x289 class="phone s03"><cite>George:</cite> Need a lot of control in the TTS</p>
+<p id=x297 class="phone s01"><cite>wolfgang:</cite> Next, how do we render in reflowable mode</p>
+<p id=x311 class="phone s02"><cite>AvneeshSingh:</cite> This kind of thing is done by the screen readers<br>
+<span id=x312>… Is there a need to tie this to FL?</span><br>
+<span id=x313>… isn't more of a generic thing, how to extract and read the text?</span><br>
+<span id=x314>… Seems like a big topics, and screen readers have researched it for years</span></p>
+<p id=x319 class="phone s05"><cite>Hadrien:</cite> Agree, this is beyond FL<br>
+<span id=x320>… some specific FL things do exist (e.g. small content chunks)</span><br>
+<span id=x321>… but in general it should be for all epub</span></p>
+<p id=x324 class="phone s08"><cite>wendyreid:</cite> ARIA wg is also interested in the same topic<br>
+<span id=x325>… may even be become an all-web topic</span></p>
+<p id=x329 class="phone s03"><cite>George:</cite> Do they join us or do we join them?</p>
+<p id=x334 class="phone s08"><cite>wendyreid:</cite> Good question. May even need it's own CG. Hard to tell at this point</p>
+</section>
+
+<section>
+<h3 id=t05>Webtoons</h3>
+<p id=x338 class="phone s08"><cite>wendyreid:</cite> PMWG is discussing a potential change for this<br>
+<span id=x339>… current proposal is to expand FLOW-CONTINUOUS to FL.</span></p>
+<p id=x344 class="phone s11"><cite>laurent_:</cite> We did not discuss pronunciation for TTS purposes</p>
+</section>
+</main>
+
+
+<address>Minutes manually created (not a transcript), formatted by <a
+href="https://w3c.github.io/scribe2/scribedoc.html"
+>scribe.perl</a> version 221 (Fri Jul 21 14:01:30 2023 UTC).</address>
+
+</body>
+</html>
diff --git a/Meetings/Minutes/2025-03-20-publishingcg.html b/Meetings/Minutes/2025-03-20-publishingcg.html
new file mode 100644
index 0000000..3cf5915
--- /dev/null
+++ b/Meetings/Minutes/2025-03-20-publishingcg.html
@@ -0,0 +1,129 @@
+
+<!DOCTYPE html>
+<html lang=en>
+<head>
+<meta charset=utf-8>
+<title>W3C Publishing Community Group Plenary: Nick Brown (VP Product Vitalsource): From Principles to Practice - Responsible AI for Enhanced Student Engagement in Reading Systems</title>
+<meta name=viewport content="width=device-width">
+<link rel="stylesheet" type="text/css" title="2018" href="https://www.w3.org/StyleSheets/scribe2/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/base.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/2004/02/minutes-style.css">
+<link rel="alternate stylesheet" type="text/css" title="Fancy" href="https://www.w3.org/StyleSheets/scribe2/fancy.css">
+<link rel="alternate stylesheet" type="text/css" title="Typewriter" href="https://www.w3.org/StyleSheets/scribe2/tt-member.css">
+</head>
+
+<body>
+<header>
+<p><a href="https://www.w3.org/"><img src="https://www.w3.org/StyleSheets/TR/2016/logos/W3C" alt=W3C border=0 height=48 width=72></a></p>
+
+<h1>W3C Publishing Community Group Plenary: Nick Brown (VP Product Vitalsource): From Principles to Practice - Responsible AI for Enhanced Student Engagement in Reading Systems</h1>
+<p>20 March 2025</p>
+
+<nav id=links>
+<a href="https://www.w3.org/2025/03/20-publishingcg-irc"><img alt="IRC log." title="IRC log" src="https://www.w3.org/StyleSheets/scribe2/text-plain.png"></a>
+</nav>
+</header>
+
+<div id=prelims>
+<div id=attendees>
+<h2>Attendees</h2>
+<dl class=intro>
+<dt>Present</dt><dd>gautierchomel, liisamk, rickj, wolfgang</dd>
+<dt>Regrets</dt><dd>-</dd>
+<dt>Chair</dt><dd>wolfgang</dd>
+<dt>Scribe</dt><dd>rickj</dd>
+</dl>
+</div>
+
+<nav id=toc>
+<h2>Contents</h2>
+<ol>
+</ol>
+</nav>
+</div>
+
+<main id=meeting class=meeting>
+<h2>Meeting minutes</h2>
+<section><p id=1a9f class=irc><cite>&lt;wolfgang&gt;</cite> Agenda: Nick Brown (VP Product Vitalsource): From Principles to Practice - Responsible AI for Enhanced Student Engagement in Reading Systems</p>
+<p id=00d2 class="phone s01"><cite>wolfgang:</cite> welcome to all</p>
+<p id=cb30 class="phone s02"><cite>Nick:</cite> From Principles to Practice overview<br>
+<span id=93b6>… VitalSource overview... talking about 'the engagement challenge'</span><br>
+<span id=f00d>… who are we?  28 m units delivered last year, 19+m users served, users around the world world, localized in 37 languages</span><br>
+<span id=7580>… we act as a 'learning delivery network' sitting between learning providers and institutions/students/bookstores</span><br>
+<span id=4563>… How do we do this?  1. Driving day 1 affordable access for millions.  2. Helping students stay engaged on Day 2 and beyond</span><br>
+<span id=3b3d>… <a href="https://research.vitalsource.com/research">https://<wbr>research.vitalsource.com/<wbr>research</a> researching what works</span><br>
+<span id=1631>… more than 30 published papers about learning science</span><br>
+<span id=46dd>… engagement is step 1</span></p>
+<p id=ee39 class=summary><a href="https://dl.acm.org/doi/10.1145/3576050.3576086">https://<wbr>dl.acm.org/<wbr>doi/<wbr>10.1145/<wbr>3576050.3576086</a> referenced<br>
+<span id=a203>… shows students not reading</span></p>
+<p id=a319 class=summary>14% average number of assigned textbook pages read by students<br>
+<span id=9d32>… one solution: VitalSource CoachMe.</span></p>
+<p id=0770 class=summary>feature inside Bookshelf reader, adds AI generated low-stakes formative practice questions inside the textbook reading experience</p>
+<p id=7712 class=summary>generated from the textbook, not a general LLM model</p>
+<p id=b7a4 class=summary>instructors can assign engagement with these questions as a part of their grade</p>
+<p id=1437 class=summary>uses VitalSource proprietary AI for automatic question generation.  Based on the Doer Effect from Carnegie Mellon OLI work</p>
+<p id=6259 class=summary>'practice while reading causes 6x more learning gain than reading alone'</p>
+<p id=4493 class=summary>bring active learning to a 'passive medium'</p>
+<p id=4b75 class=summary>(deeper dive into how it works)</p>
+<p id=9c8c class=summary>timeline of deployment and achievement</p>
+<p id=38b8 class=summary>21 million AI generated questions answered in a learning context</p>
+<p id=7e70 class=summary>published 35 peer reviewed papers</p>
+<p id=79d8 class=summary>3 best paper awards</p>
+<p id=ab45 class=summary>examination at Iowa State University course with A/B test with the same book, looking at how many times students opened their book showing dramatic improvement in engagement when CoachMe questions are assigned as a part of the grade</p>
+<p id=abc4 class=summary>multiple school study showing increase in engagement by students<br>
+<span id=df52>… 'but what about Gen AI?'</span></p>
+<p id=0ce8 class=summary><a href="https://www.aacu.org/research/leading-through-disruption">https://<wbr>www.aacu.org/<wbr>research/<wbr>leading-through-disruption</a> discussed</p>
+<p id=ac9a class=summary>explosion of GenAI usage</p>
+<p id=2d79 class=summary>Key opportunities: things we can do for students and faculty that could not be done before</p>
+<p id=1179 class=summary>Pro's and Con's of using AI</p>
+<p id=3694 class=summary>The importance of responsible AI use:  <a href="https://get.vitalsource.com/ai-principles">https://<wbr>get.vitalsource.com/<wbr>ai-principles</a></p>
+<p id=283f class=summary>referenced 1EdTech rubric <a href="https://www.1edtech.org/standards/ai-rubric">https://<wbr>www.1edtech.org/<wbr>standards/<wbr>ai-rubric</a></p>
+<p id=9878 class=summary>Why do these principles matter?</p>
+<p id=d9c3 class=summary>avoid &quot;AI for AI's sake', focus on real learning gains not hype, maintain strong publisher and institutional partnerships</p>
+<p id=8b1c class=summary>where we are heading next...</p>
+<p id=ffff class=summary>High quality AI answers aligned with textbook content, no model training, no IP leakage, SOC2 compliant use with LLMs, DRM protected</p>
+<p id=f39e class=summary>Q&amp;A</p>
+<p id=5157 class="phone s03"><cite>gautierchomel:</cite> when content/questions are generated, we need an evaluation methodology and a way to advise the user this is AI generated.  Have you been thinking about this?  Is it possible?</p>
+<p id=d93c class="phone s04"><cite>uptownnickbrown:</cite> that concern drove a lot of our development and decisions.  'generation' is a misnomer, as nothing is 'generated'.  There is zero risk, as the sentence for the 'fill in the blank' is from the book.<br>
+<span id=a1cc>… there are feedback mechanisms inside the book to give positive/negative feedback on the questions</span><br>
+<span id=7c9b>… recommend: have clear disclaimers that this is AI generated content.  It's the right ethical thing to do.</span><br>
+<span id=a01a>… measuring quality is also critical, and hard.</span><br>
+<span id=8d0a>… automated judges that use LLMs to judge LLM responses, trained to do different tasks in different ways.  Judge a few different things at the same time (factually accurate, ...)</span><br>
+<span id=6cb9>… reduce hallucinations by having the concrete source material</span><br>
+<span id=7b79>… comfortable with a 'the book does not address that question' response</span><br>
+<span id=bd6d>… also trying to evaluate the underlying pedagogy behind the model</span><br>
+<span id=cc10>… 'can answer that... best I can do is recommend you read page 62 of the book' type of answers</span></p>
+<p id=e8a6 class="phone s05"><cite>liisamk:</cite> have you thought about using AI for 'other things' with your reading system?</p>
+<p id=84e2 class="phone s04"><cite>uptownnickbrown:</cite> a few ways to tie this into the reading system.  Things like making good flashcards (hard to do now).<br>
+<span id=4b6d>… also looking at ways to evolve search beyond a 'find' function</span><br>
+<span id=455d>… search with mixed languages...</span><br>
+<span id=0ed6>… alt-text for screen readers</span><br>
+<span id=dadf>… lots of places to pervade the reading system</span></p>
+<p id=4fb2 class="phone s06"><cite>Michalis0:</cite> are questions created in real time, or pre-loaded?</p>
+<p id=7225 class="phone s04"><cite>uptownnickbrown:</cite> yes, the CoachMe questions are pre-generated<br>
+<span id=9654>… also aligned with where to ask the question in the book flow</span><br>
+<span id=cf0e>… may evolve over time as AI improvements come</span><br>
+<span id=8122>… over 1,000,000 questions in production at scale</span></p>
+<p id=f518 class="phone s01"><cite>wolfgang:</cite> am I right that you mainly use NLP and not LLMs?</p>
+<p id=6635 class="phone s04"><cite>uptownnickbrown:</cite> Yes.  That's exactly how we built CoachMe</p>
+<p id=464d class="phone s01"><cite>wolfgang:</cite> did you take into account the AI legislation from the EU?</p>
+<p id=7b76 class="phone s04"><cite>uptownnickbrown:</cite> we are starting to think more about that now, as it was not in place at the time.</p>
+<p id=234a class=summary>rssagent, generate minutes</p>
+</section>
+</main>
+
+
+<address>Minutes manually created (not a transcript), formatted by <a
+href="https://w3c.github.io/scribe2/scribedoc.html"
+>scribe.perl</a> version 244 (Thu Feb 27 01:23:09 2025 UTC).</address>
+
+<div class=diagnostics>
+<h2>Diagnostics</h2>
+<p class=warning>Found 'Agenda:' not followed by a URL: 'Nick Brown (VP Product Vitalsource): From Principles to Practice - Responsible AI for Enhanced Student Engagement in Reading Systems'.</p>
+<p class=warning>Maybe present: Michalis0, Nick, uptownnickbrown</p>
+<p class=warning>All speakers: gautierchomel, liisamk, Michalis0, Nick, uptownnickbrown, wolfgang</p>
+<p class=warning>Active on IRC: gautierchomel, liisamk, Michalis0, rickj, wolfgang</p>
+</div>
+</body>
+</html>
diff --git a/Meetings/Minutes/2025-05-15-publishingcg.html b/Meetings/Minutes/2025-05-15-publishingcg.html
new file mode 100644
index 0000000..3c141e5
--- /dev/null
+++ b/Meetings/Minutes/2025-05-15-publishingcg.html
@@ -0,0 +1,98 @@
+
+<!DOCTYPE html>
+<html lang=en>
+<head>
+<meta charset=utf-8>
+<title>Publishing CG plenary: Senthil Nathan (Ailaysa, Chennai) discuss steps towards Responsible Digital Publishing: Content Exclusion and AI Training</title>
+<meta name=viewport content="width=device-width">
+<link rel="stylesheet" type="text/css" title="2018" href="https://www.w3.org/StyleSheets/scribe2/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/base.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/2004/02/minutes-style.css">
+<link rel="alternate stylesheet" type="text/css" title="Fancy" href="https://www.w3.org/StyleSheets/scribe2/fancy.css">
+<link rel="alternate stylesheet" type="text/css" title="Typewriter" href="https://www.w3.org/StyleSheets/scribe2/tt-member.css">
+</head>
+
+<body>
+<header>
+<p><a href="https://www.w3.org/"><img src="https://www.w3.org/StyleSheets/TR/2016/logos/W3C" alt=W3C border=0 height=48 width=72></a></p>
+
+<h1>Publishing CG plenary: Senthil Nathan (Ailaysa, Chennai) discuss steps towards Responsible Digital Publishing: Content Exclusion and AI Training</h1>
+<p>15 May 2025</p>
+
+<nav id=links>
+<a href="https://www.w3.org/2025/05/15-publishingcg-irc"><img alt="IRC log." title="IRC log" src="https://www.w3.org/StyleSheets/scribe2/text-plain.png"></a>
+</nav>
+</header>
+
+<div id=prelims>
+<div id=attendees>
+<h2>Attendees</h2>
+<dl class=intro>
+<dt>Present</dt><dd>miia</dd>
+<dt>Regrets</dt><dd>-</dd>
+<dt>Chair</dt><dd>-</dd>
+<dt>Scribe</dt><dd>wolfgang</dd>
+</dl>
+</div>
+
+<nav id=toc>
+<h2>Contents</h2>
+<ol>
+</ol>
+</nav>
+</div>
+
+<main id=meeting class=meeting>
+<h2>Meeting minutes</h2>
+<section><p id=2f67 class="phone s01"><cite>gautier:</cite> Talk of Senthil (Ailaysa, Chennai) - we are taking notes</p>
+<p id=3a82 class=irc><cite>&lt;gautierchomel&gt;</cite> presentmiia</p>
+<p id=ffff class="phone s02"><cite>senthil:</cite> speak about the concept, then provide a demo, then Q &amp; A<br>
+<span id=83ec>… Senthil Nathan from Ailaysa - AI company - content translation based on AI - taking content in different languages - international book fair in Chennai - introduced products into publsihing - before mainly translation/localization - automatic translations using AI</span><br>
+<span id=c64c>… concepts: how to develop a responsible content in an AI context - we cannot have walled days - great data rush for training AI systems without knowledge and permission of owners - awareness that quality content is very important for AI - quality data should come from publishers, media companies, research institutes - shifting to being active</span></p>
+<p id=6833 class=summary>negotiators<br>
+<span id=bc8b>… Content exclusion of content as training data - in case of use responsible usage + permission needed - in 2024 ppl are actively discussing - should be a fair deal with proper compensation - illegal scraping was a big problem - is coming to an end - much more reduced now</span><br>
+<span id=b2fb>… terms of permission are set by both parties - technical barriers can now be easily implemented - clear legal terms prohibiting use without limits - content watermarking and provenance tracking tools</span><br>
+<span id=33ef>… to include: fair licensing terms - mandatory source citations in AI output - quality control: selective participation with responsible AI companies - usage tracking: monitoring how content influences AI responses - consent frameworks: granular control over AI uses</span><br>
+<span id=e0a1>… factors: technical, business, regulatory and market dynamics</span><br>
+<span id=c854>… AI-specific exclusion protocols (better than robots.txt) - rise of new AI-crawlers (require new blocking mechanisms) - dynamic paywalls and anti-scraping tech - emergence for content-tracking tools</span><br>
+<span id=ac82>… blockers (NYT, Guardian) vs. partners (Axel Springer with OpenAI) vs. open access (But seeking attribution) vs. wait-and-see</span><br>
+<span id=2910>… EU: Ai-Act - US: considering legal framework - courses of copyright offices</span><br>
+<span id=b117>… market: growing need for high-quality content - AI is not thinking, algorithmic, not creative - publishers see new revenue streams via partnerships - data brokers like literary agency - syndication rights</span><br>
+<span id=abad>… principle of fair monetization - important to track extent of usage and kinds of usage</span><br>
+<span id=74c6>… from authoring to reading: AI environment is set - book discovery enhanced through LLM recommendation and search systems - going beyond metadata and keywords: asking questions on the contents of the book (e.g. ChaiReader)</span><br>
+<span id=7028>… future options: read book in another language such as Tamil thx to automatic translation or as audiobook - in libraries, bookstores, schools use of books may be changed -</span><br>
+<span id=7b6a>… HarperCollins works with MS, also Sage, CUP,</span><br>
+<span id=d81c>… have to find common ground between publishers and AI companies</span></p>
+<p id=10000 class=summary>Demo Chai Reader: Reading, Chatting and Buying in one portal - multilingual Q&amp;A - buy routine integrated - in future: book recommendations based on search terms - translation of a book into a target language</p>
+<p id=ba98 class="phone s01"><cite>gautier:</cite> when I'm chatting with a book, answers only from book content - LLM only used to prepare a nice answer - not training each book in LLM -</p>
+<p id=ae9a class="phone s02"><cite>Senthil:</cite> completely separated</p>
+<p id=10001 class="phone s03"><cite>michalis:</cite> concerned that access to content should be fair use - esp. in the US -next months will be critical in legal aspects</p>
+<p id=d3e4 class="phone s02"><cite>senthil:</cite> big publishers have great interest - different for small publishers or even authors -</p>
+<p id=3482 class="phone s03"><cite>michalis:</cite> in education or academic this would be quite useful</p>
+<p id=b23a class="phone s02"><cite>senthil:</cite> exactly useful to expolore several books in parallel to formulate an answer - we work with EDRLabs to improve on it - ChaiReader still in Beta - working with publishers - can chat with a collection of books, not only one at the same time - impact of &quot;AI on economics&quot; - reasoning capacity - more important than just referring back - great</p>
+<p id=c10e class=summary>thing for book</p>
+<p id=9f45 class=summary> discovery</p>
+<p id=3411 class="phone s04"><cite>ivan:</cite> aren't you forced to make some sort of ranking between books consumed - need a local ranking for books you have</p>
+<p id=cec1 class="phone s02"><cite>senthil:</cite> possible to rank or categorize dependent on prompting</p>
+<p id=9358 class="phone s05"><cite>vishal:</cite> the more correct the prompt, the more precise the answer will be - if 3 books have an answer - semantic ranking combined with keyword level ranking - still experimental feature - as Google and Amazon do</p>
+<p id=7ff5 class="phone s04"><cite>ivan:</cite> in some cases this is not the best answer - in scholarly usage - ranking by systems outside your bookshop - based on reputation of answers - you use LLM only for niceties of input and output</p>
+<p id=e49a class="phone s05"><cite>vishal:</cite> reinforcement learning - librarian knows the authors - deepseek uses this feature - integrate human expertise into machine</p>
+<p id=8327 class="phone s02"><cite>senthil:</cite> good question</p>
+<p id=fa7c class=irc><cite>&lt;gautierchomel&gt;</cite> RSSAgent make minutes</p>
+</section>
+</main>
+
+
+<address>Minutes manually created (not a transcript), formatted by <a
+href="https://w3c.github.io/scribe2/scribedoc.html"
+>scribe.perl</a> version 244 (Thu Feb 27 01:23:09 2025 UTC).</address>
+
+<div class=diagnostics>
+<h2>Diagnostics</h2>
+<p class=warning>Maybe present: gautier, ivan, michalis, senthil, vishal</p>
+<p class=warning>All speakers: gautier, ivan, michalis, senthil, vishal</p>
+<p class=warning>Active on IRC: gautierchomel, Michalis, wolfgang</p>
+</div>
+</body>
+</html>
diff --git a/Meetings/Minutes/2025-06-11-publishingcg.html b/Meetings/Minutes/2025-06-11-publishingcg.html
new file mode 100644
index 0000000..efe710a
--- /dev/null
+++ b/Meetings/Minutes/2025-06-11-publishingcg.html
@@ -0,0 +1,100 @@
+
+<!DOCTYPE html>
+<html lang=en>
+<head>
+<meta charset=utf-8>
+<title>W3C Publishing Community Group Plenary: &quot;Advanced Features in Colibrio Reader&quot; &ndash; 11 June 2025</title>
+<meta name=viewport content="width=device-width">
+<link rel="stylesheet" type="text/css" title="2018" href="https://www.w3.org/StyleSheets/scribe2/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/base.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/2004/02/minutes-style.css">
+<link rel="alternate stylesheet" type="text/css" title="Fancy" href="https://www.w3.org/StyleSheets/scribe2/fancy.css">
+<link rel="alternate stylesheet" type="text/css" title="Typewriter" href="https://www.w3.org/StyleSheets/scribe2/tt-member.css">
+</head>
+
+<body>
+<header>
+<p><a href="https://www.w3.org/"><img src="https://www.w3.org/StyleSheets/TR/2016/logos/W3C" alt=W3C border=0 height=48 width=72></a></p>
+
+<h1>&ndash; DRAFT &ndash;<br>
+W3C Publishing Community Group Plenary: &quot;Advanced Features in Colibrio Reader&quot;</h1>
+<h2>11 June 2025</h2>
+
+<nav id=links>
+<a href="https://www.w3.org/2025/06/11-publishingcg-irc"><img alt="IRC log." title="IRC log" src="https://www.w3.org/StyleSheets/scribe2/text-plain.png"></a>
+</nav>
+</header>
+
+<div id=prelims>
+<div id=attendees>
+<h2>Attendees</h2>
+<dl class=intro>
+<dt>Present</dt><dd>gautierchomel, jimsaya, jonas, Lars, wolfgang</dd>
+<dt>Regrets</dt><dd>-</dd>
+<dt>Chair</dt><dd>wolfgang</dd>
+<dt>Scribe</dt><dd>gautierchomel</dd>
+</dl>
+</div>
+
+<nav id=toc>
+<h2>Contents</h2>
+<ol>
+</ol>
+</nav>
+</div>
+
+<main id=meeting class=meeting>
+<h2>Meeting minutes</h2>
+<section><p id=0b5c class="phone s01"><cite>Lars:</cite> I have no formal presentation. I have been experimenting with AI in Colibrio since long time now. I am particulary interested in having conversation with a book. Because I am a fan of fictions.</p>
+<p id=ffff class="phone s01"><cite>Lars:</cite> I have beeen using openAI, experimenting with theyre API is easy, it runs in the browser and it is client side. Last year, they released an asistant API that takes care of the boring stuff.</p>
+<p id=29a2 class="phone s01"><cite>Lars:</cite> to use LLM we need tools, those are the APIs. Without tool, the LLM is a stupid huge base of knowledge. To be more precise we feed with very contextual information. You can give the context in a prompt. prompt engineering is about yourself being the tool. The more precise you are, the more accurate you get an answer. To go further in using LLMs we use context inputs from other databases. That's the role of the API.</p>
+<p id=c9f4 class="phone s01"><cite>Lars:</cite> adding context is costly and time consuming. It needs to be structured and expressed in a way understandable by the LLM. This complexity needs to be managed.</p>
+<p id=7fc3 class="phone s01"><cite>Lars:</cite> showing screen. This is the vanilla reader, available online. You need an openAI jey to make it work. It can be costly, when you work with images. Less when it is about text. At the opening of the book, i strip away unnecessary markup and keep only the HTML semantic. I clean to the bare minimum of code. That's pushed to the LLM as embedded. Think about it as a computer edition of the book. An edition made for computers. A</p>
+<p id=24fa class=summary>numerical representation of the book. It feeds a vector database. You can then ask question, queries, to the database.</p>
+<p id=d370 class="phone s01"><cite>Lars:</cite> A note, this embedded version, in my opinion, should be built and sold by the publisher.</p>
+<p id=45fa class="phone s01"><cite>Lars:</cite> Anyway that's an important part because this is the step allowing to get contextualised answers.</p>
+<p id=ded7 class="phone s01"><cite>Lars:</cite> next, I open the dialog, a chat box built in the app, and start to ask.</p>
+<p id=ce98 class="phone s01"><cite>Lars:</cite> the question goes to the vector database, which performs semantic search and provides chucks of 500 caracters to the LLM who is formulating the answers displayed to me.</p>
+<p id=69af class="phone s01"><cite>Lars:</cite> To make sure the responses are from the book, the app performs a search and provides link references for each part of the answer. So you can activate the link and go to the part of the book stating that.</p>
+<p id=a3c8 class="phone s01"><cite>Lars:</cite> the models are not smart, it is the context and the dispositive deployed by the App developers that make it usefull. As a consequence, the best quality of the book make the best answers. Metadata are important too, we exctract and use them to feed the database.</p>
+<p id=7b4c class="phone s01"><cite>Lars:</cite> Metadata, semantics, Table of content, all the ebook appareal is used here. It is our best chance to get good results.</p>
+<p id=60d7 class=irc><cite>&lt;wolfgang&gt;</cite> Gautier: publishers rely on AI systems - risks involved for customers</p>
+<p id=52aa class="phone s02"><cite>Gautier:</cite> there is a risk of loop. AI analysing data created by AI.</p>
+<p id=c461 class="phone s01"><cite>Lars:</cite> yes, that's a major problem actually, on every digital contents</p>
+<p id=d721 class="phone s02"><cite>Gautier:</cite> so probably it is of use to have a refine property to indicate that &quot;this metadata or content was AI produced&quot;.</p>
+<p id=1ea4 class="phone s01"><cite>Lars:</cite> For sure! So we could alert the user, give a proportion of risk.</p>
+<p id=4270 class="phone s01"><cite>Lars:</cite> the LLM hype is too much, but still, the results are good. Let see with images. Here I send Image + context, including visible content aside of the image (the visible range we call it) , and always in the context of the book thanks to the embedded version stored in a database. I get good result. Trying with a contemporary art photo and a world map with data represented on it. This is complex to achieve on the production pipeline.</p>
+<p id=65b2 class=summary>It is easier in the reading system because we have the complete numerical representation of the book stored in a database.</p>
+<p id=cf88 class="phone s03"><cite>jonas:</cite> what is included in the visible range?</p>
+<p id=8976 class="phone s01"><cite>Lars:</cite> text that is available on the visual page. It is risky to expand too much, it could interpolate topics from other parts of the book. We could experiment adding title structure per example.</p>
+<p id=a8c8 class="phone s04"><cite>wolfgang:</cite> I feel, for science content, the chapter level can be the context.</p>
+<p id=85ce class="phone s01"><cite>Lars:</cite> this is to experiment, there are many different books fortunately! The solution will differ largely depending on this diversity. The more granular you are in the information (semantic, metadata, structure) you give, the best result you'll get. A schema attribute would bring a strong help, per example. Be smart when you build your ebook, you'll get strong feedback.</p>
+<p id=9d71 class="phone s01"><cite>Lars:</cite> I am also adding semantic search and translation. All we add is meant for non visual readers, they have a stronger need.</p>
+<p id=8480 class="phone s01"><cite>Lars:</cite> it also works with local models so you are not obliged to send your content to feed the LLM. It is slower but it works.</p>
+<p id=e83c class="phone s03"><cite>jonas:</cite> what happens with copyrighted material?</p>
+<p id=7de0 class="phone s01"><cite>Lars:</cite> never use free services. I pay for openAI, the contract say they don't use my contents for training. That's why we just provide a way to give your API key, then you are responsible. I don't want to take that responsability.</p>
+<p id=de5f class="phone s01"><cite>Lars:</cite> also, publishers should build and sell rights on embedded version. Meaning licensing your content, but ready for machine usage.</p>
+<p id=1e86 class="phone s03"><cite>jonas:</cite> for libraries it's tricky, we usually don't own copyright.</p>
+<p id=aefa class="phone s01"><cite>Lars:</cite> you would need to buy two licences, one for public reading and one for machine usage.</p>
+<p id=19e1 class="phone s04"><cite>wolfgang:</cite> in fact all the knowledge used in your system comes from the book. The LLM is only a vehicule here.</p>
+<p id=227d class="phone s01"><cite>Lars:</cite> yes, the LLM is a conversonial interface, good at language, but we need to give them the knowledge by running other code aside.</p>
+<p id=f363 class="phone s01"><cite>Lars:</cite> and adding control checks to make the answer accurate and verifiable. That's part of the agrement with computers, we want to be able to check because they don't always tell the truth.</p>
+</section>
+</main>
+
+
+<address>Minutes manually created (not a transcript), formatted by <a
+href="https://w3c.github.io/scribe2/scribedoc.html"
+>scribe.perl</a> version 244 (Thu Feb 27 01:23:09 2025 UTC).</address>
+
+<div class=diagnostics>
+<h2>Diagnostics</h2>
+<p class=warning>Succeeded: s/LLm/LLM/</p>
+<p class=warning>Succeeded: s/by/buy/</p>
+<p class=warning>Maybe present: Gautier</p>
+<p class=warning>All speakers: Gautier, jonas, Lars, wolfgang</p>
+<p class=warning>Active on IRC: gautierchomel, wolfgang</p>
+</div>
+</body>
+</html>
diff --git a/Meetings/Minutes/2025-07-17-publishingcg.html b/Meetings/Minutes/2025-07-17-publishingcg.html
new file mode 100644
index 0000000..379229e
--- /dev/null
+++ b/Meetings/Minutes/2025-07-17-publishingcg.html
@@ -0,0 +1,97 @@
+
+<!DOCTYPE html>
+<html lang=en>
+<head>
+<meta charset=utf-8>
+<title>Publishing CG plenary: "AI use cases and technical considerations in Thorium Reader, an open source reading system"</title>
+<meta name=viewport content="width=device-width">
+<link rel="stylesheet" type="text/css" title="2018" href="https://www.w3.org/StyleSheets/scribe2/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/base.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/StyleSheets/public.css">
+<link rel="alternate stylesheet" type="text/css" title="2004" href="https://www.w3.org/2004/02/minutes-style.css">
+<link rel="alternate stylesheet" type="text/css" title="Fancy" href="https://www.w3.org/StyleSheets/scribe2/fancy.css">
+<link rel="alternate stylesheet" type="text/css" title="Typewriter" href="https://www.w3.org/StyleSheets/scribe2/tt-member.css">
+</head>
+
+<body>
+<header>
+<p><a href="https://www.w3.org/"><img src="https://www.w3.org/StyleSheets/TR/2016/logos/W3C" alt=W3C border=0 height=48 width=72></a></p>
+
+<h1>Publishing CG plenary: "AI use cases and technical considerations in Thorium Reader, an open source reading system"</h1>
+<p>17 July 2025</p>
+
+<nav id=links>
+<a href="https://www.w3.org/2025/07/17-publishingcg-irc"><img alt="IRC log." title="IRC log" src="https://www.w3.org/StyleSheets/scribe2/text-plain.png"></a>
+</nav>
+</header>
+
+<div id=prelims>
+<div id=attendees>
+<h2>Attendees</h2>
+<dl class=intro>
+<dt>Present</dt><dd>DanielWeck, gautierchomel, George, james, ori, vladimir, wolfgang</dd>
+<dt>Regrets</dt><dd>-</dd>
+<dt>Chair</dt><dd>gautierchomel, wolfgang</dd>
+<dt>Scribe</dt><dd>wolfgang</dd>
+</dl>
+</div>
+
+<nav id=toc>
+<h2>Contents</h2>
+<ol>
+</ol>
+</nav>
+</div>
+
+<main id=meeting class=meeting>
+<h2>Meeting minutes</h2>
+<section><p id=305c class="phone s01"><cite>Gautier:</cite>This session is part of a series with AI in RS.<br>
+<span id=7805>… today Daniel Weck, lead developer of Thorium (reading system from EDRLab)</span></p>
+<p id=ffff class="phone s02"><cite>DanielWeck:</cite> AI-generated content descriptions in Thorium - unreleased experiment, thus work in progress - lays foundations for user experience  we want to follow - demo two page spread with image of Jules Verne with description - inspect the HTML<br>
+<span id=59eb>… image has empty @alt and empty title - lnik takes me to an appendix where the image is displayed - but here the @alt is not empty - it says &quot;linked image&quot;</span><br>
+<span id=c816>… would be great if we hade some help from AI - Thorium has a zoom feature - leads to room for textual description - can choose an LLM to generate description -</span><br>
+<span id=6128>… decide between &quot;short&quot; or &quot;extended&quot; description - I can edit the system prompt, but there is a default system prompt - Gemini very good at discovering ppl in images - extended description would have two paragraphs - advanced view of system prompt in JSON format - additional information in prompt in this format</span><br>
+<span id=64df>… select text fron answer - run a search on the Internet to get more information</span><br>
+<span id=10000>… new work from W3C WG - complex image (bar chart) - link to extended description - rich text that is not part of a short description - plan to create a modal interface where you might consult AI</span></p>
+<p id=10001 class=summary>(1) user see descriptions (2) chat with AI (3) do further research on the web - familiar chat UI - modal interlay - default system prompt which sets useful boundaries - we also feed in metadata<br>
+<span id=ec9e>… request short or extended descriptions easily - just &quot;one shot&quot; - we need to inform the user that an AI will hallucinate</span><br>
+<span id=e641>… MCP Model Context Protocol for tool calls out of scope - RAG also not implemented - beyond basic embedding - also not local LLMs - response times OK, but not the quality - Gemini better for image descriptions</span><br>
+<span id=9abb>… you may give metadata as embedded context for the prompt - advanced user may edit the systemprompt and might remove blatantly irrelevant metadata</span></p>
+<p id=1a90 class="phone s03"><cite>George:</cite> publishers are not happy with AI getting trained with their copyrighted materials. Any protections?</p>
+<p id=8514 class="phone s04"><cite>Daniel:</cite> All conversations in the chat with AI, are used for training if I don't pay for using the LLM - If I were to pay for the service, the data remain private -always depends on the terms and conditions of a particular model - for publishers TDM reservation protocol allows to opt in or out - Thorium would respect this<br>
+<span id=1b32>… , any ideas how that could be solved?</span></p>
+<p id=1372 class="phone s03"><cite>George:</cite> if image is not used for training, publishers are OK with that.</p>
+<p id=6781 class="phone s04"><cite>Daniel:</cite> Thorium would have to police the use of data by an LLM - Would Thorium have to blacklist some models?</p>
+<p id=da0b class="phone s05"><cite>James:</cite> Publishers are very twitchy about copyrighted material - with an epub you can mark the TDM or place a couple of metatags - 6 or 7 different ways to signal that training ist not accepted - training is an issue -<br>
+<span id=9d4a>… on-device LLMS would be helpful</span></p>
+<p id=c756 class="phone s04"><cite>Daniel:</cite> Publishers don't want RS to create friction - with images copies and text scanning, it's so easy to be done (e.g. on a Mac) - we have to send the image to the AI, but can't control what the LLM wil be doing with it</p>
+<p id=e01e class="phone s05"><cite>James:</cite> could a publisher embed a token ?</p>
+<p id=b7d7 class="phone s04"><cite>Daniel:</cite> agreement with Mistral - access token for EDRLab - could run on a Thorium server - but Thorium doesn't transport the key itself- but uses it in accessing the LLM to answer users' requests</p>
+<p id=10002 class="phone s06"><cite>Ori:</cite> if you using the user's API key, you can't know what the AI does with it - Gemini say they don't use it for training, no idea what OpenAI does - using another key is problematic<br>
+<span id=3b16>… Gemini doesn't feed requests for image descriptions to humans</span></p>
+<p id=b9e5 class="phone s04"><cite>Daniel:</cite> main stumbling block: potential of legal issues - we could enable it in nightly build, but not in production builds</p>
+<p id=08ef class="phone s03"><cite>George:</cite> JPEG has metadata in it - is that transmitted?</p>
+<p id=50cc class="phone s04"><cite>Daniel:</cite> in FB Messenger or Signal I check that GPS data is erased before I share pictures - with AI once the image payload is transmitted - it will be readable for AI</p>
+<p id=f141 class="phone s06"><cite>Ori:</cite> guess it will not ingest geographical data</p>
+<p id=462d class="phone s04"><cite>Daniel:</cite> most LLMs have restrictions - in Thorium we don't create requests for LLMs manually - we feed image data into an abstraction interface<br>
+<span id=c7b6>… abstraction layer is fully client-side - it allows us to speak Javascript -</span></p>
+<p id=2ae5 class="phone s06"><cite>Ori:</cite> had to reduce size of image - don't send EXIf or geographical data</p>
+<p id=4b98 class="phone s04"><cite>Daniel:</cite> images processed before sending them on the wire - reduction in size before sending</p>
+<p id=6f70 class="phone s01"><cite>gautier:</cite> WCAG criteria; description must offer same service as the image - a way to fulfil this - focus on authored description (if available) - real success for WCAG requirement</p>
+</section>
+</main>
+
+
+<address>Minutes manually created (not a transcript), formatted by <a
+href="https://w3c.github.io/scribe2/scribedoc.html"
+>scribe.perl</a> version 244 (Thu Feb 27 01:23:09 2025 UTC).</address>
+
+<div class=diagnostics>
+<h2>Diagnostics</h2>
+<p class=warning>No scribenick or scribe found. Guessed: wolfgang</p>
+<p class=warning>Maybe present: Daniel, Gautier</p>
+<p class=warning>All speakers: Daniel, DanielWeck, Gautier, George, James, Ori</p>
+<p class=warning>Active on IRC: DanielWeck, gautierchomel, George, wolfgang</p>
+</div>
+</body>
+</html>
diff --git a/Meetings/Minutes/index.md b/Meetings/Minutes/index.md
index db800d7..fb0ddc2 100644
--- a/Meetings/Minutes/index.md
+++ b/Meetings/Minutes/index.md
@@ -5,6 +5,13 @@ title: minutes
 
 # Meeting Minutes
 
-[Search for minutes in the mailing list](https://www.w3.org/Search/Mail/Public/search?lists=public-publishingcg&keywords=minutes)
+* [2025-07-17](2025-07-17-publishingcg.html) 
+* [2025-06-11](2025-06-11-publishingcg.html)
+* [2025-05-15](2025-05-15-publishingcg.html) 
+* [2025-03-20](2025-03-20-publishingcg.html)
+* [2024-02-15](2024-02-15-publishingcg.html) 
+* [2022-08-10](2022-08-10-publishingcg.html)
+* [2020-10-21](2020-10-21-publishingcg.html)
 
-March 20th,2025 <https://www.w3.org/2025/03/20-publishingcg-minutes.html>.
\ No newline at end of file
+Not all minutes have been pasted to this github repo, to find more, you can 
+[Search for minutes in the mailing list](https://www.w3.org/Search/Mail/Public/search?lists=public-publishingcg&keywords=minutes)