Nothing to do with me guv. My changes were only to the jsi profile which calls docx to tei conversion, and to the Makefile to invoke same. On 11/11/17 21:56, Martin Holmes wrote:
There is a recently-active ticket on docx:
https://github.com/TEIC/Stylesheets/issues/262
where Lou made changes. Could that be connected?
On 2017-11-11 01:50 PM, Syd Bauman wrote:
[s.bauman] Update from deprecated name of attribute to current name
So I fixed the easy one. There is still an expected results problem.
I believe that when testing the Stylesheets, the build process
* converts Test/test-indexes.docx to Test/test-indexes.xml * compares that file to Test/expected-results/test-indexes.xml
and that comparison is failing. I suspect (but don't know) that the difference may be just the new Saxon, as a lot of the difference is just whitespace. But I also don't grok why a lot of the content of <p> elements is now surrounded by a <hi style="font-size:12pt">.
Has anyone changed docx -> TEI lately?
Relevant bits of console:
../bin/docxtotei test-indexes.docx temp.xml Convert test-indexes.docx to http://teijenkins.hcmc.uvic.ca/job/Stylesheets-dev/ws/Test/temp.xml (docx to tei) using profile default [echo] using http://teijenkins.hcmc.uvic.ca/job/Stylesheets-dev/ws/profiles/default/docx/... BUILD SUCCESSFUL Total time: 4 seconds
xmllint --format temp.xml | perl cleanup.pl > test-indexes.xml && rm temp.xml diff test-indexes.xml expected-results/test-indexes.xml; fi 44,93c44,45 < <p> < <anchor xml:id="foo"/> < <hi style="font-size:12pt">Here we are talking about the Web</hi> < <index indexName="XE"> < <term>Web</term> < </index> < <hi style="font-size:12pt">, about Web services</hi> < <index indexName="XE"> < <term>Web</term> < <index> < <term>Web service</term> < </index> < </index> < <hi style="font-size:12pt">, and Web apps</hi> < <index indexName="XE"> < <term>Web</term> < <index> < <term>Web app</term> < </index> < </index> < <hi style="font-size:12pt">, but also about bugs</hi> < <index indexName="XE"> < <term>bugs</term> < <index> < <term>horrible bugs<ref type="xr">See bugs</ref></term> < </index> < </index> < <hi style="font-size:12pt">and errors</hi> < <index indexName="XE"> < <term>error</term> < </index> < <hi style="font-size:12pt">. Note that the index terms can be in Word also formatted</hi> < <index indexName="XE"> < <term>formatting</term> < </index> < <hi style="font-size:12pt">, which is lost in the TEI. We can have ranges though, like this</hi> < <index indexName="XE" spanTo="#foo"> < <term>Range</term> < <index> < <term>Subrange</term> < </index> < </index> < <hi style="font-size:12pt">.</hi> < </p> < <p> < <hi rend="underline" style="font-size:12pt" xml:space="preserve">This is a simpler paragraph with a word in </hi> < <hi rend="italic underline" style="font-size:12pt">italic</hi> < <hi rend="underline" style="font-size:12pt" xml:space="preserve"> in the middle, though the whole paragraph is underlined</hi> < <hi style="font-size:12pt">.</hi> < </p> ---
<p><anchor xml:id="foo"/>Here we are talking about the Web<index indexName="XE"><term>Web</term></index>, about Web services<index indexName="XE"><term>Web</term><index><term>Web service</term></index></index>, and Web apps<index indexName="XE"><term>Web</term><index><term>Web app</term></index></index>, but also about bugs<index indexName="XE"><term>bugs</term><index><term>horrible bugs<ref type="xr">See bugs</ref></term></index></index> and errors<index indexName="XE"><term>error</term></index>. Note that the index terms can be in Word also formatted<index indexName="XE"><term>formatting</term></index>, which is lost in the TEI. We can have ranges though, like this<index indexName="XE" spanTo="#foo"><term>Range</term><index><term>Subrange</term></index></index>.</p> <p><hi rend="underline" xml:space="preserve">This is a simpler paragraph with a word in </hi><hi rend="italic underline">italic</hi><hi rend="underline" xml:space="preserve"> in the middle, though the whole paragraph is underlined</hi>.</p> 98,124c50,57 < <p rend="index 1"> < <hi style="font-size:12pt" xml:space="preserve">bug. </hi> < <hi rend="italic" style="font-size:12pt">See</hi> < <hi style="font-size:12pt" xml:space="preserve"> error</hi> < </p> < <p rend="index 1"> < <hi style="font-size:12pt">error, 1</hi> < </p> < <p rend="index 1"> < <hi rend="bold" style="font-size:12pt">formatting</hi> < <hi style="font-size:12pt">, 1</hi> < </p> < <p rend="index 1"> < <hi style="font-size:12pt">Range</hi> < </p> < <p rend="index 2"> < <hi style="font-size:12pt">Subrange, 1</hi> < </p> < <p rend="index 1"> < <hi style="font-size:12pt">Web, 1</hi> < </p> < <p rend="index 2"> < <hi style="font-size:12pt">Web app, 1</hi> < </p> < <p rend="index 2"> < <hi style="font-size:12pt">Web service, 1</hi> < </p>
<p rend="index 1">bug. <hi rend="italic">See</hi> error</p> <p rend="index 1">error, 1</p> <p rend="index 1"><hi rend="bold">formatting</hi>, 1</p> <p rend="index 1">Range</p> <p rend="index 2">Subrange, 1</p> <p rend="index 1">Web, 1</p> <p rend="index 2">Web app, 1</p> <p rend="index 2">Web service, 1</p>