-
Notifications
You must be signed in to change notification settings - Fork 717
[css-text] Providing alternative breaking behaviours for Ethiopic #4765
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Yes. Requiring a minimum of X letters on a line is not addressed, in css-text-3, but it expected to be addressed in css-text-4. See https://drafts.csswg.org/css-text-4/#last-line-limits
This bit of text, to be found under the definition of the
I think this is clear enough, but if you have a suggestion for improvement, or a concern about some of it, feedback is very much welcome. On the other hand, an example about using |
Thanks @frivoal. The 'around punctuation' text is a little vague for me. Note also that none of the major browser engines does what you'd expect here. Try changing the width of the bounding box in this test. You'll see that the wordspace wraps to the next line alone. It shouldn't do that, and note in particular that none of them wrap the wordspace to the next line by default (try this test), so i'm assuming that the browser implementers all misunderstood the point here too, since they made a change that does the wrong thing. Here's some suggested text (inline markup showing here, but just for C&P convenience): As a final example, in modern use of the Ethiopic script words are surrounded by spaces and usually wrap, unbroken, to the next line. Sometimes, however, Ethiopic may be written with I can provide a screen shot of some Amharic text, if you like. |
A bichromatic scan of the work that @r12a referenced is here: https://drive.google.com/open?id=1wJm53QevGzAZGBiMHBnGB6oPfhP-sA8B , the copyright declaration is no longer applicable. The work also presents an example of justification in presence of the wordspace. |
Clarification on this point: it's not about the last line in a para, but rather about the last word on a (any) line, ie. no word is broken such that only the last character in the word appears at the start of a line. (I vaguely remember hearing about a similar rule recently, but i can't remember which script/language was the context. So this might possibly be a rule that affects other languages than those that use Ethiopic script.) |
I wonder if there's a way we could tie this in to the |
@r12a Breaking before Ethiopic Word Space in that test case looks like a mistake. For example, UAs don't break before commas and periods and colons. I think this might just need a WPT testcase and some bugs filed. We can also put an example in the spec, given some sample text. @astearns Good point, though I think this is a little different than hyphenation in that you can break between any two characters, not a particular lexically-allowed points in the word. It would be nice to re-use the same controls, though. |
Alright, edited in Ethiopic as one of the writing samples for word-break. @r12a Let me know if the text looks correct. I suggest we file break limits as a separate issue. |
Thanks @fantasai. I have a small suggestion: add a link to https://www.w3.org/TR/css-text-3/#word-separator on the following text.
That will clarify that we're talking about both spaces and ethiopic word space characters. |
done: 1feb256 |
@r12a, if you agree to file the break limits as a separate issue (against level 4), I think we're done here and can close. Can you confirm? |
… a=testonly Automatic update from web-platform-tests Add tests for word breaking in Ethiopic See w3c/csswg-drafts#4765 -- wpt-commits: 6555517ac5179b32a26705f2e27f0c8daadc59ec wpt-pr: 37697
… a=testonly Automatic update from web-platform-tests Add tests for word breaking in Ethiopic See w3c/csswg-drafts#4765 -- wpt-commits: 6555517ac5179b32a26705f2e27f0c8daadc59ec wpt-pr: 37697
https://www.w3.org/TR/css-text-3/#word-break-property
https://www.w3.org/TR/css-text-3/#line-break-property
Modern Ethiopic text is generally wrapped word by word. If wordspace separators are used, they are wrapped with the word, and should not appear alone at the beginning of a line.
However, older Ethiopic text is generally wrapped wherever it hits the right margin, whether wordspace or space are used to separate words, and no hyphenation occurs.
Observation: It's possible that a rule is sometimes applied to letter-based wrapping that requires a minimum of 2 letters at the end of a line for printed text (as opposed to handwritten manuscripts). This was observed by Daniel Yacob in the book, "ዜናዊ ፓርልማ" from 1953 (1946EC).
Whatever style of wrapping is used, however, punctuation wrapping rules apply, which means that a wordspace separator should not appear at the start of a line, nor various other punctuation, even when letter-by-letter wrapping occurs.
So my question is: how can an Ethiopic author can apply the different wrapping styles to Ethiopic?
My best guess is that
line-break:anywhere
is not appropriate, since it doesn't respect punctuation rules. However,word-break:break-all
may be the right thing, although it doesn't specifically mention punctuation-specific rules. Am i correct? It wasn't abundantly clear from reading the spec.The text was updated successfully, but these errors were encountered: