Evaluating the "Copia" version of the Guidelines
Dear Council, After I had to exit early from this morning's Stylesheets meeting I was eager to see how Syd's "Copia" output of the Guidelines turned out. (For those who missed it, Syd found a quick way to output the Guidelines in a way that would prevent suppression of examples in other languages, and this seems mainly to affect the way the Guidelines Spec pages turn out (not the Guidelines chapter examples). I saw in the minutes that we're asked to review Syd's interesting output, named "Copia" (for more copious examples) and report back to the Council list. So started exploring element spec pages a little this evening. Here's what I'm finding: The bibl element is much improved by the Copia output: the examples are far more diverse, but not bewildering in number. I think it is fascinating to see non-English variety: - Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-bibl.h... - Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-bibl.html The byline element is similarly a lot more interesting now: - Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-byline... - Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-byline.html The gap element seems similarly improved, though there's a little problem here: - Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-gap.ht... - Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-gap.html What I noticed right away was, the number of examples did not seem overwhelming, and the variety of languages evident before you had to click "Show All" reflects a good diversity with Asian languages rising to the fore. However(!) the duplicate example jumps out--duplicates in English. Turns out the French exemplum is identical to the English here, and I"m not really sure why. I think I've seen that in other specs as well. Meanwhile, the element p looks a bit more interesting now, and I think it improves on the current output: - Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-p.html - Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-p.html How about ab? Completely boring: no change at all, and it's sadly evident that we don't have enough examples in the first place. (If that isn't a ticket yet, it should be!) - Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-ab.htm... - Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-ab.html Okay-that's a start: a review of 5 out of 582 element specs (less than one percent...sigh). But so far I like the Copia version for what it illuminates about our range of exempla, both where it is diverse and where it is lacking. Looking forward to hearing what others think! Elisa -- Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College Development site: https://newtfire.org
Dear Elisa,
I very much agree with this assessment!
N
On 4 Nov 2020, at 03:03, Elisa Beshero-Bondar
I wasn't in the discussion that led to it, but I am not sure that I see the point of this. It reminds us that (a) we have translated examples for French and Chinese many times much more than for other languages (b) they are of rather varied quality -- many of the Chinese ones, I believe, simply translate the English ones, rather than seeking out comparable natively Chinese examples. And other than that, it enables little beyond linguistic tourism (for which surely there are easier interfaces). And checking up on the quality and quantity of examples is not hard to do with a simple xPath of the source is it? Or don't we do xpath any more? I do, so here are some numbers for my copy of the last release number of exemplum elements : 1913 number in languages other than English : 1152 (60%) or 1015 (53%) if you discount the non-linguistic or multi-language ones of which: 547 (nearly half) are in french; 433 in zh-TW (nearly the other half); 88 in 'und' (i.e. not in any language) ; 49 in 'mul' (i.e. multiple languages) ; 31 are in 'de', and 2 each for la and no so, if you want more diversity, just look at the French version! On 04/11/2020 03:03, Elisa Beshero-Bondar wrote:
Dear Council, After I had to exit early from this morning's Stylesheets meeting I was eager to see how Syd's "Copia" output of the Guidelines turned out. (For those who missed it, Syd found a quick way to output the Guidelines in a way that would prevent suppression of examples in other languages, and this seems mainly to affect the way the Guidelines Spec pages turn out (not the Guidelines chapter examples). I saw in the minutes that we're asked to review Syd's interesting output, named "Copia" (for more copious examples) and report back to the Council list. So started exploring element spec pages a little this evening. Here's what I'm finding:
The bibl element is much improved by the Copia output: the examples are far more diverse, but not bewildering in number. I think it is fascinating to see non-English variety:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-bibl.h... * Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-bibl.html
The byline element is similarly a lot more interesting now:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-byline... * Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-byline.html
The gap element seems similarly improved, though there's a little problem here:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-gap.ht... * Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-gap.html
What I noticed right away was, the number of examples did not seem overwhelming, and the variety of languages evident before you had to click "Show All" reflects a good diversity with Asian languages rising to the fore. However(!) the duplicate example jumps out--duplicates in English. Turns out the French exemplum is identical to the English here, and I"m not really sure why. I think I've seen that in other specs as well.
Meanwhile, the element p looks a bit more interesting now, and I think it improves on the current output:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-p.html
* Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-p.html
How about ab? Completely boring: no change at all, and it's sadly evident that we don't have enough examples in the first place. (If that isn't a ticket yet, it should be!)
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-ab.htm... * Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-ab.html
Okay-that's a start: a review of 5 out of 582 element specs (less than one percent...sigh). But so far I like the Copia version for what it illuminates about our range of exempla, both where it is diverse and where it is lacking.
Looking forward to hearing what others think! Elisa
-- Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College Development site: https://newtfire.org https://newtfire.org/
_______________________________________________ Tei-council mailing list Tei-council@lists.tei-c.org http://lists.lists.tei-c.org/mailman/listinfo/tei-council
As you might imagine, Lou, the point of this exercise was not a review of the available exempla in the specs. The point was to test an alternative way to output the Guidelines when we do not suppress non-English examples, and we are investigating the XSLT code that guides the filtering process. We simply wanted to see how the Guidelines look with an alternative approach to processing. As someone who teaches XPath, I’m certainly glad that you don’t find it passé. Of course it helps to inspect the Guidelines if only to see where we have disproportionate exempla. I think the internationalization group may be interested in applying it to mark off which elements are lacking a range of languages. Our exempla are under development with new contributions forthcoming from Japanese, Spanish, and German, and we are generally exploring questions of how to prepare better options for multilingual customizations of the Guidelines. So, just for the moment, we are reviewing the output of some modified XSLT here to get a sense of what happens when we alter something, and actually wanted to see whether the Specs come out looking cluttered and repetitive, or more diverse and interesting. I hope that provides some better context for our activity. Elisa Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College Typeset by hand on my iPhone
On Nov 4, 2020, at 6:47 AM, Lou Burnard
wrote: I wasn't in the discussion that led to it, but I am not sure that I see the point of this. It reminds us that (a) we have translated examples for French and Chinese many times much more than for other languages (b) they are of rather varied quality -- many of the Chinese ones, I believe, simply translate the English ones, rather than seeking out comparable natively Chinese examples. And other than that, it enables little beyond linguistic tourism (for which surely there are easier interfaces). And checking up on the quality and quantity of examples is not hard to do with a simple xPath of the source is it? Or don't we do xpath any more? I do, so here are some numbers for my copy of the last release
number of exemplum elements : 1913
number in languages other than English : 1152 (60%) or 1015 (53%) if you discount the non-linguistic or multi-language ones
of which: 547 (nearly half) are in french; 433 in zh-TW (nearly the other half); 88 in 'und' (i.e. not in any language) ; 49 in 'mul' (i.e. multiple languages) ; 31 are in 'de', and 2 each for la and no
so, if you want more diversity, just look at the French version!
On 04/11/2020 03:03, Elisa Beshero-Bondar wrote:
Dear Council, After I had to exit early from this morning's Stylesheets meeting I was eager to see how Syd's "Copia" output of the Guidelines turned out. (For those who missed it, Syd found a quick way to output the Guidelines in a way that would prevent suppression of examples in other languages, and this seems mainly to affect the way the Guidelines Spec pages turn out (not the Guidelines chapter examples). I saw in the minutes that we're asked to review Syd's interesting output, named "Copia" (for more copious examples) and report back to the Council list. So started exploring element spec pages a little this evening. Here's what I'm finding:
The bibl element is much improved by the Copia output: the examples are far more diverse, but not bewildering in number. I think it is fascinating to see non-English variety:
Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-bibl.h... Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-bibl.html
The byline element is similarly a lot more interesting now:
Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-byline... Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-byline.html
The gap element seems similarly improved, though there's a little problem here: Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-gap.ht... Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-gap.html
What I noticed right away was, the number of examples did not seem overwhelming, and the variety of languages evident before you had to click "Show All" reflects a good diversity with Asian languages rising to the fore. However(!) the duplicate example jumps out--duplicates in English. Turns out the French exemplum is identical to the English here, and I"m not really sure why. I think I've seen that in other specs as well.
Meanwhile, the element p looks a bit more interesting now, and I think it improves on the current output: Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-p.html Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-p.html
How about ab? Completely boring: no change at all, and it's sadly evident that we don't have enough examples in the first place. (If that isn't a ticket yet, it should be!)
Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-ab.htm... Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-ab.html
Okay-that's a start: a review of 5 out of 582 element specs (less than one percent...sigh). But so far I like the Copia version for what it illuminates about our range of exempla, both where it is diverse and where it is lacking.
Looking forward to hearing what others think! Elisa
-- Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College Development site: https://newtfire.org
_______________________________________________ Tei-council mailing list Tei-council@lists.tei-c.org http://lists.lists.tei-c.org/mailman/listinfo/tei-council
Tei-council mailing list Tei-council@lists.tei-c.org http://lists.lists.tei-c.org/mailman/listinfo/tei-council
OK thanks for the contextualisation Elisa, but if the point was just to evaluate the way the Guidelines look when not filtering out by language, I am still somewhat at a loss, since the output you get from the "show all" button already does precisely that, and arguably rather well. But in any case I look forward to the arrival of new exempla in other languages... Lou On 04/11/2020 13:18, Elisa Beshero-Bondar wrote:
As you might imagine, Lou, the point of this exercise was not a review of the available exempla in the specs. The point was to test an alternative way to output the Guidelines when we do not suppress non-English examples, and we are investigating the XSLT code that guides the filtering process. We simply wanted to see how the Guidelines look with an alternative approach to processing.
As someone who teaches XPath, I’m certainly glad that you don’t find it passé. Of course it helps to inspect the Guidelines if only to see where we have disproportionate exempla. I think the internationalization group may be interested in applying it to mark off which elements are lacking a range of languages. Our exempla are under development with new contributions forthcoming from Japanese, Spanish, and German, and we are generally exploring questions of how to prepare better options for multilingual customizations of the Guidelines. So, just for the moment, we are reviewing the output of some modified XSLT here to get a sense of what happens when we alter something, and actually wanted to see whether the Specs come out looking cluttered and repetitive, or more diverse and interesting.
I hope that provides some better context for our activity.
Elisa
Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College
Typeset by hand on my iPhone
On Nov 4, 2020, at 6:47 AM, Lou Burnard
wrote:
I wasn't in the discussion that led to it, but I am not sure that I see the point of this. It reminds us that (a) we have translated examples for French and Chinese many times much more than for other languages (b) they are of rather varied quality -- many of the Chinese ones, I believe, simply translate the English ones, rather than seeking out comparable natively Chinese examples. And other than that, it enables little beyond linguistic tourism (for which surely there are easier interfaces). And checking up on the quality and quantity of examples is not hard to do with a simple xPath of the source is it? Or don't we do xpath any more? I do, so here are some numbers for my copy of the last release
number of exemplum elements : 1913
number in languages other than English : 1152 (60%) or 1015 (53%) if you discount the non-linguistic or multi-language ones
of which: 547 (nearly half) are in french; 433 in zh-TW (nearly the other half); 88 in 'und' (i.e. not in any language) ; 49 in 'mul' (i.e. multiple languages) ; 31 are in 'de', and 2 each for la and no
so, if you want more diversity, just look at the French version!
On 04/11/2020 03:03, Elisa Beshero-Bondar wrote:
Dear Council, After I had to exit early from this morning's Stylesheets meeting I was eager to see how Syd's "Copia" output of the Guidelines turned out. (For those who missed it, Syd found a quick way to output the Guidelines in a way that would prevent suppression of examples in other languages, and this seems mainly to affect the way the Guidelines Spec pages turn out (not the Guidelines chapter examples). I saw in the minutes that we're asked to review Syd's interesting output, named "Copia" (for more copious examples) and report back to the Council list. So started exploring element spec pages a little this evening. Here's what I'm finding:
The bibl element is much improved by the Copia output: the examples are far more diverse, but not bewildering in number. I think it is fascinating to see non-English variety:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-bibl.h... * Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-bibl.html
The byline element is similarly a lot more interesting now:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-byline... * Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-byline.html
The gap element seems similarly improved, though there's a little problem here:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-gap.ht... * Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-gap.html
What I noticed right away was, the number of examples did not seem overwhelming, and the variety of languages evident before you had to click "Show All" reflects a good diversity with Asian languages rising to the fore. However(!) the duplicate example jumps out--duplicates in English. Turns out the French exemplum is identical to the English here, and I"m not really sure why. I think I've seen that in other specs as well.
Meanwhile, the element p looks a bit more interesting now, and I think it improves on the current output:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-p.html
* Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-p.html
How about ab? Completely boring: no change at all, and it's sadly evident that we don't have enough examples in the first place. (If that isn't a ticket yet, it should be!)
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-ab.htm... * Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-ab.html
Okay-that's a start: a review of 5 out of 582 element specs (less than one percent...sigh). But so far I like the Copia version for what it illuminates about our range of exempla, both where it is diverse and where it is lacking.
Looking forward to hearing what others think! Elisa
-- Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College Development site: https://newtfire.org https://newtfire.org/
_______________________________________________ Tei-council mailing list Tei-council@lists.tei-c.org http://lists.lists.tei-c.org/mailman/listinfo/tei-council
Tei-council mailing list Tei-council@lists.tei-c.org http://lists.lists.tei-c.org/mailman/listinfo/tei-council
Dear Lou,
The “show all” button shows examples from the Specs and the Guidelines. We only wanted to have a look at the examples in the Specs.
Best,
Martina
Von: Tei-council
Hi all, This does reveal some examples that might need a bit of fixing, that have presumably been unexamined for a good while: <byline>Par le docteur<docAuthor>Martin Becanus,</docAuthor> au tome troisiesme de ses opuscules</byline> I would say the comma should be outside the docAuthor tag, and there should be a space after docteur. So I think this is a useful output for Council to look at from that point of view too. Cheers, Martin On 2020-11-03 7:03 p.m., Elisa Beshero-Bondar wrote:
Dear Council, After I had to exit early from this morning's Stylesheets meeting I was eager to see how Syd's "Copia" output of the Guidelines turned out. (For those who missed it, Syd found a quick way to output the Guidelines in a way that would prevent suppression of examples in other languages, and this seems mainly to affect the way the Guidelines Spec pages turn out (not the Guidelines chapter examples). I saw in the minutes that we're asked to review Syd's interesting output, named "Copia" (for more copious examples) and report back to the Council list. So started exploring element spec pages a little this evening. Here's what I'm finding:
The bibl element is much improved by the Copia output: the examples are far more diverse, but not bewildering in number. I think it is fascinating to see non-English variety:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-bibl.h... * Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-bibl.html
The byline element is similarly a lot more interesting now:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-byline... * Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-byline.html
The gap element seems similarly improved, though there's a little problem here:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-gap.ht... * Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-gap.html
What I noticed right away was, the number of examples did not seem overwhelming, and the variety of languages evident before you had to click "Show All" reflects a good diversity with Asian languages rising to the fore. However(!) the duplicate example jumps out--duplicates in English. Turns out the French exemplum is identical to the English here, and I"m not really sure why. I think I've seen that in other specs as well.
Meanwhile, the element p looks a bit more interesting now, and I think it improves on the current output:
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-p.html
* Current: https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-p.html
How about ab? Completely boring: no change at all, and it's sadly evident that we don't have enough examples in the first place. (If that isn't a ticket yet, it should be!)
* Copia: https://bauman.zapto.org/~syd/temp/4TEICouncil/html_GLs_all_eg_en/ref-ab.htm... * Current: https://tei-c.org/release/doc/tei-p5-doc/en/html/ref-ab.html
Okay-that's a start: a review of 5 out of 582 element specs (less than one percent...sigh). But so far I like the Copia version for what it illuminates about our range of exempla, both where it is diverse and where it is lacking.
Looking forward to hearing what others think! Elisa
-- Elisa Beshero-Bondar, PhD Program Chair of Digital Media, Arts, and Technology | Professor of Digital Humanities | Director of the Digital Humanities Lab at Penn State Erie, The Behrend College Development site: https://newtfire.org https://newtfire.org/
_______________________________________________ Tei-council mailing list Tei-council@lists.tei-c.org http://lists.lists.tei-c.org/mailman/listinfo/tei-council
-- ------------------------------------- Humanities Computing and Media Centre University of Victoria mholmes@uvic.ca
participants (5)
-
Elisa Beshero-Bondar
-
Lou Burnard
-
Martin Holmes
-
Nicholas Cole
-
Scholger, Martina (martina.scholger@uni-graz.at)