Orthography and phonology
The phonology and orthography of Ŋarâþ Crîþ can be divided into eight layers in two modes (writing and speaking):
- Layer 0 is the underlying morphographemic representation. In this grammar, text in this layer is written in double square brackets: ⟦tanc-a⟧.
- Layer 1 is the graphemic representation. This representation is subsequently exported to the spoken and written modes. Text in this layer is written with angle brackets: ⟨tanca⟩.
- Layer 2w is the surface glyphic representation. This represents the sequence of Cenvos glyphs that is written, observing required ligatures and final forms. Text in this layer is written with double angle brackets: ²⟨tanca⟩; for a more interesting example, ⟨mencoc⟩ becomes ²⟨mencoc$⟩.
- Layer 2w* is an intermediate layer between 2w and 3w, in which discretionary ligatures are introduced to 2w text. For instance, ²⟨#flirora⟩ can be realized as ²*⟨#fliro ra⟩.
- Layer 3w is the topological representation, showing optional ligatures as well as stroke order variations. Text in this layer is written with double angle brackets: ³⟨t1α a1γ n1α c1α a1α ⟩. More interestingly, ²⟨mencoc$⟩ could become ³⟨me1α n1α c1α o1α c$1α ⟩.
- Layer 4w is the presentational representation, adding to 3w variations in the strokes themselves and how strokes within a glyph are joined. Text in this layer is written with double angle brackets: ⁴⟨t1α a1γ n1α c1α a1α ⟩.
- Layer 2s is the phonemic representation. We use slashes for this, as usual: /tanka/.
- Layer 3s is the phonetic representation, or what is pronounced. We use square brackets for this, as usual: [tʰa⁴ɲcʰa²].
The conversions from 0 to 1, 1 to 2w, and 2s to 3s are functional: each valid input corresponds to exactly one output. The conversion from 1 to 2s is almost so, except when a ⟨&⟩ is present. In the opposite direction, the conversions from 4w to 3w, from 3w to 2w*, and from 2w* to 2w are functional. Furthermore, for any conversion, it can be determined whether a given input can be converted into a given output without external information.
In addition, the conversion between 1 and 2w is bijective: valid layer-1 and layer-2w representations can be paired with each other.
Layers 0, 1, and 2w: Cenvos and its romanization
Cenvos, the native script of Ŋarâþ Crîþ, is written from right to left. This script can be analyzed on two levels: graphemes, which constitute the abstract level and glyphs, which are the characters being written. For instance, Cenvos has one grapheme romanized as ⟨c⟩ that corresponds to two different glyphs: the non-final form 𐲀𐲢 (denoted as ²⟨c⟩) and the final form 𐲀 (²⟨c$⟩). As another example, the sequence 𐲌𐲁 (⟨me⟩ = ²⟨me⟩) consists of one glyph but two graphemes.
In this grammar, we primarily use the romanization, whose symbols largely map one-to-one with Cenvos graphemes. Cenvos has four kinds of graphemes:
- True letters are graphemes that represent sounds.
- Markers, while considered letters, do not represent sounds. Instead, they indicate that the words affected are treated specially. They occur on the level of a word and do not actively participate in morphology.
- Punctuation includes the clause-end punctuation ⟨.⟩, ⟨;⟩, ⟨?⟩, and ⟨!⟩; the clitic boundary mark ⟨’⟩; the lenition mark ⟨·⟩; the grouping brackets ⟨{}⟩; and the quotation marks ⟨«»⟩.
- Digits can be used to write short numerals.
Of course, there is also the space. Layer 0 also contains the morpheme boundary, ⟦-⟧.
Cen | Name | Rom | Cen | Name | Rom | Cen | Name | Rom |
---|---|---|---|---|---|---|---|---|
True letters | ||||||||
𐲀𐲢 | ca | c | 𐲌 | ma | m | 𐲘 | ar | h |
𐲁 | e | e | 𐲍 | a | a | 𐲙 | ħo | ħ |
𐲂 | na | n | 𐲎 | fa | f | 𐲚 | ên | ê |
𐲃𐲢 | ŋa | ŋ | 𐲏 | ga | g | 𐲛 | ôn | ô |
𐲄 | va | v | 𐲐 | pa | p | 𐲜 | ân | â |
𐲅 | o | o | 𐲑 | ta | t | 𐲝 | uħo | u |
𐲆 | sa | s | 𐲒 | ča | č | 𐳀 | cełaŋa | w |
𐲇 | þa | þ | 𐲓 | în | î | 𐳁 | avarte | x |
𐲈 | ša | š | 𐲔 | ja | j | 𐳂 | priþnos | y |
𐲉 | ra | r | 𐲕 | i | i | 𐳃 | telrigjon | z |
𐲊 | la | l | 𐲖 | da | d | |||
𐲋 | ła | ł | 𐲗 | ða | ð | |||
Final forms and ligatures (layer 2w) | ||||||||
𐲀 | c$ | 𐲌𐲁 | me | 𐳀𐳀 | ww | |||
𐲃 | ŋ$ | 𐲌𐲌 | mm | 𐳁𐳁 | xx | |||
𐲁𐲁 | ee | 𐲔𐲜 | jâ | 𐳂𐳂 | yy | |||
𐲁𐲌 | em | 𐲜𐲔 | âj | 𐳃𐳃 | zz | |||
Markers | ||||||||
𐲤 | carþ | # | 𐲦 | njor | +* | 𐲨 | nef | * |
𐲥 | tor | + | 𐲧 | es | @ | 𐲯 | sen | & |
Punctuation | ||||||||
𐲞 | gen | . | 𐲩 | ŋos | ’ | 𐲭 | fos | « |
𐲟 | tja | ; | 𐲪 | łil | · | 𐲮 | þos | » |
𐲠 | šac | ? | 𐲫 | rin | { | 𐳄 | jedva | / |
𐲡 | cjar | ! | 𐲬 | cin | } | 𐲣 | mivaf·ome | - |
The letters ⟨w⟩, ⟨x⟩, ⟨y⟩, and ⟨z⟩ are USR letters. These are used in foreign languages written in Cenvos to represent phonemes that are not approximated by the phonology of Ŋarâþ Crîþ. Each foreign orthography is free to assign them as it pleases.
Cenvos has two graphemes that change form at the end of the word: ⟨c⟩ and ⟨ŋ⟩, as well as several ligatures. We do not distinguish these forms in the romanization.
The marker ⟨*⟩ is used for foreign words, such as loanwords and foreign names. ⟨#⟩ is used to prefix given names. ⟨+⟩ is used to prefix surnames passed by native conventions (i.e. from parent to child within the same gender); ⟨+*⟩ marks a surname passed using non-native conventions. Place names are prefixed with ⟨@⟩. ⟨#⟩, ⟨+⟩, ⟨+*⟩, and ⟨@⟩ can all be used with ⟨*⟩, in which case ⟨*⟩ occurs first. Note that ⟨+*⟩ is a single letter of its own and not a ligature.
At the start of a word, ⟨&⟩ indicates reduplication of an unspecified prefix of the rest of the word. For instance, ⟨&cên⟩ can be pronounced as if it were ⟨cêcên⟩ or ⟨cêncên⟩. (⟨&⟩ occurs after all other markers in this case.) This usage is not productive in standard Ŋarâþ Crîþ, but it appears in a few words, as well as in some idiosyncratic cases. At the middle or the end of a word, or alone, it indicates ellipsis of part or all of the word, most often to abbreviate or censor a word. Lastly, ⟨&{}⟩ is used similarly to the ellipsis in Western punctuation.
Markers can be applied to multi-word strings by surrounding the string with the delimiters ⟨{}⟩. In legal language, ⟨{}⟩ are also used around phrases to resolve ambiguities.
The sentence punctuation ⟨.⟩, ⟨?⟩, and ⟨!⟩ are used as expected. ⟨;⟩ is used to separate two independent clause phrases within the same sentence. The quotation marks, ⟨«»⟩, are used around quotations, direct or indirect. A ⟨.⟩ at the end of a quotation embedded within another sentence is omitted.
⟨’⟩ is used to separate clitics from the rest of the word to which they are attached. ⟨·⟩ indicates lenition; it could be described as a “letter modifier”. It is also used as a decimal point: officially, it is used after the most significant digit of an inexact numeral when written with digits, but it also used unofficially to write non-integers.
⟨/⟩, as its derivation from ⟨i⟩ suggests, is used to separate the number of mjari from the number of edva when writing currency amounts.
Spaces are placed in the following places:
- between orthographic words, but not between a clitic and the word to which it is attached
- after (but not before) ⟨.⟩, ⟨;⟩, ⟨?⟩, and ⟨!⟩
- before ⟨«⟩ and after ⟨»⟩ (but not on the other sides)
- around ⟨&{}⟩
[TODO: cover mentions of letters within the language, corresponding to v7 p17 “When letters or markers are referred to, … but the effects on other glyphs are not standardized”]
Digits are interchangeable with short-form numerals, but not with long-form numerals. They are also written right-to-left in Cenvos, with the most significant digit first: 𐲲 is 0x2A3 = 675.
Cen | # | Cen | # | Cen | # | Cen | # |
---|---|---|---|---|---|---|---|
𐲰 | 0 | 𐲱 | 1 | 𐲲 | 2 | | 3 |
| 4 | | 5 | | 6 | | 7 |
| 8 | | 9 | | A | | B |
| C | | D | | E | | F |
Phonotactics
We express the phonotactic rules of Ŋarâþ Crîþ in terms of layer 0.
A manifested grapheme phrase is either a true letter not followed by a lenition marker (plain letter), any of ⟦p t d č c g m f v ð⟧ followed by a lenition mark (lenited letter), or, word-initially, one of the digraphs ⟦mp vp dt nd gc ŋg vf ðþ lł⟧ (eclipsed letter). All other graphemes are ignored for the purposes of phonotactics.
A manifested grapheme phrase has a base letter. The base letter of a plain letter is itself. The base letter of a lenited letter is the letter without the lenition mark. The base letter of an eclipsed letter is the second letter of the digraph.
A vowel is any of ⟦e o a î i ê ô â u⟧. ⟦j⟧ is a semivowel. All other manifested grapheme phrases are consonants.
An effective plosive is a manifested grapheme phrase whose base letter is any of ⟦p t d c g⟧. An effective fricative is a manifested grapheme phrase whose base letter is any of ⟦f v þ ð s š h ħ⟧.
A word consists of one or more syllables, each of which has an initial, a medial, a nucleus, and a coda. An initial consists of one of the following:
- nothing at all
- a single consonant
- an effective plosive or fricative plus ⟦r⟧ or ⟦l⟧
- any of ⟦cf cþ cs cš gv gð tf dv⟧; that is, a plosive plus a fricative of the same voicing, such that the plosive has a more retracted place of articulation than the fricative
The only valid medial, if present, is ⟦j⟧. A nucleus is a vowel.
A coda is either a simple coda or a complex coda. A simple coda is one of ⟦s r n þ rþ l t c f m⟧ or nothing at all. A complex coda is one of ⟦st lt ns ls nþ cþ⟧. While complex codas are allowed in any syllable in layer 0, instances of such codas in the middle of a syntactic word are simplified during the conversion to layer 1, and such instances immediately before a clitic boundary are simplified during the conversion to layer 2. The coda ⟦-m⟧ is used in only a few words.
In addition, ⟦h⟧ is forbidden word-initially. Doubled consonants and vowels are allowed.
If there is more than one way to split a word into syllables, the maximal-onset principle is used. However, clitic boundaries always start a new syllable.
An onset is an initial plus a medial. A bridge is the coda of one syllable plus the onset of the following syllable.
Conversion from layer 0 to layer 1
The following changes are applied as a part of morphology. They occur only when the subsequence involved in a change (that is, the substring being replaced as well as the environment that triggers the change) crosses a morpheme boundary but not a word boundary. For instance, ⟦*@vav-el⟧ becomes ⟨*@vavel⟩ instead of ⟨*@navel⟩. For clarity, however, we omit any ⟦-⟧s from the rules below. (These changes apply from left to right.)
- v → n / _ V[-creaky] {v, m·}
- ð → ŋ / _ V[-creaky] {ð, d·}
Here, “V[-creaky]” means any of ⟦e o a i u⟧.
The following changes are made to simplify complex codas within a syntactic word if (and only if) the consonant cluster cannot be reinterpreted to avoid the mid-word complex coda.
- stšr → šr
- stšl → šl
- stš → sč
- sts → st
- st → t / _ C[+nasal]
- st → s / _ C
- ltšr → ltr
- ltšl → ltl
- ltš → lč
- lts → ls
- lt → t / _ C[+nasal]
- lt → l / _ C
- ss → þ / {n, l} _
- ns → n / _ C[+obstruent]²
- C[+coronal, -voiced] → ∅ / ns _
- ns C[+coronal, +voiced] → nð
- ns {ħ, g·} → nð
- ns C[+dorsal, -voiced] → nh
- ns C[+dorsal, +voiced] → ŋ / _ V
- ns C[+dorsal, +voiced] → n
- ns C[+labial, -voiced] → nf
- ns C[+labial, +voiced] → nv
- ls → l / _ C[+obstruent]²
- C[+coronal, -voiced] → ∅ / ls _
- ls C[+coronal, +voiced] → lð
- ls {ħ, g·} → lð
- ls C[+dorsal, -voiced] → lh
- ls C[+dorsal, +voiced] → lħ
- ls C[+labial, -voiced] → lf
- ls C[+labial, +voiced] → lv
- nþ → þ / _ C[+obstruent]²
- nþ C[-voiced] → nþ
- nþ C[+voiced] → nð
- cþ → þ / _ C[+obstruent]
- cþ C[+nasal] → nþ
Here, the consonant graphemes are considered to be organized in the following way based on their pronunciations (with voiceless/voiced pairs):
Labial | Coronal | Dorsal | Other | |
---|---|---|---|---|
Obstruent | p, f, p· / v, vp | t, č, þ, s, š, ł, t·, č· / d, ð, dt, d·, ðþ | c, h, c· / g, gc | / ħ, g· |
Nasal | / m, mp | / n, nd | / ŋ, ŋg | |
Other | / l, lł, r |
Finally, the ⟦j⟧ is removed from any instances of ⟦ji jî ju⟧.
Letter numbering
Sometimes, an integer must be assigned to each letter. In this case, the assignment shown in the table below is used. Note that numbers are not assigned fully sequentially. Furthermore, this function is valid only for layer 1 graphemes.
Letter | Hex | Dec | Letter | Hex | Dec | Letter | Hex | Dec |
---|---|---|---|---|---|---|---|---|
True letters | ||||||||
c | 0 | 0 | m | 20 | 32 | h | 11 | 17 |
e | 1 | 1 | a | 9 | 9 | ħ | 12 | 18 |
n | 2 | 2 | f | A | 10 | ê | 101 | 257 |
ŋ | 2B | 43 | g | B | 11 | ô | 104 | 260 |
v | 3 | 3 | p | C | 12 | â | 109 | 265 |
o | 4 | 4 | t | D | 13 | u | 13 | 19 |
s | 5 | 5 | č | DE | 222 | w | −1 | −1 |
þ | 55 | 85 | î | E | 14 | x | −2 | −2 |
š | 5E | 94 | j | 6E | 110 | y | −3 | −3 |
r | 6 | 6 | i | F | 15 | z | −4 | −4 |
l | 7 | 7 | d | 10 | 16 | |||
ł | 77 | 119 | ð | 155 | 341 | |||
Markers | ||||||||
# | 14 | 20 | +* | 16 | 22 | * | 19 | 25 |
+ | 15 | 21 | @ | 17 | 23 | & | 1A | 26 |
The letter sum of a word is the sum of all of its letters. This value is used in some of the noun declension paradigms.
It is theorized that letter numbers were assigned in the following manner:
- The basic true letters inherited from Necarasso Cryssesa (i.e. those corresponding to ⟨c e n v o s r l m a f g p t î i d h⟩) received sequential numbers from zero. The number of ⟨m⟩ was changed due to superstitions against the number eight.
- ⟨ŋ þ š ł č ð⟩ received numbers based on what letter pairs (or triplets in the case of ⟨ð⟩) they were based on.
- ⟨ê⟩, ⟨ô⟩, and ⟨â⟩ were numbered as 256 + base glyph number.
- The other letters and the markers received sequential numbers after ⟨h⟩, skipping 0x18.
Collation
The true letters and the markers are collated in their respective order, except for ⟨&⟩, which is ignored. Lenited letters are treated as their respective base letters, except when two words differ only by the presence or absence of a lenition mark, in which case the lenited variant is collated after the base letter: ⟨saga⟩ < ⟨sag·a⟩ < ⟨sada⟩ < ⟨saħa⟩. Numerals are collated after all letters.
In a directory of personal names, entries are collated on surnames, with given names considered only when surnames are identical. Headings in such a list include the prefix up to an including the first true letter: ⟨+merlan #flirora⟩ would be found under ⟨+m⟩.
Ordered items can be labeled using numerals (starting from 0) or letters. In the latter case, only the letters ⟨c e n v o s r l m a f g p t î i d h⟩ are used.
Numquotes
A digit immediately preceding text surrounded by quotation or grouping marks constitutes a numquote. The digit is usually not pronounced in this case. Numquotes are mainly used for secondary purposes that lack any dedicated punctuation.
Numquote | Meaning |
---|---|
B{} | Contains parenthetical information: provides supplementary information. The sentence should still make sense without the parenthetical content. |
1{} | Lists an alias of a referent mentioned by name. |
2{} | Surrounds a key-value list. Used as such: ⟨2{3{&{}} 4{&{}} 3{&{}} 4{&{}}}⟩ |
3{} | Used for listing a key inside ⟨2{}⟩. |
4{} | Used for listing a value inside ⟨2{}⟩. When not directly inside a ⟨2{}⟩ numquote, marks a list: elements are delimited by spaces, and ⟨{}⟩ can be used to insert multi-word elements. |
9{} | Used to contain abbreviated quantities in the traditional currency system. |
*9{} | Used to contain abbreviated quantities in a currency system other than the traditional one. |
Layer 2s
Before the rest of the conversion to layer 2, the complex coda-simplifying changes are performed to simplify such complex codas before clitic boundaries or at the end of a word. (That is, any occurrences of ⟨’⟩ are ignored this time.)
Traditionally, only manifested grapheme phrases are considered to be significant in the conversion from layer 1 to layer 2s. However, other graphemes such as punctuation can affect prosody.
MGPs | IPA | MGPs | IPA |
---|---|---|---|
c | k | p | p |
e | e | t | t |
n nd | n | č | t͡ʂ |
ŋ ŋg | ŋ | î | ì |
v m· vp | v | j | j |
o | o | i | i |
s | s | d dt | d |
þ t· | θ | ð d· ðþ | ð |
š č· | ʂ | h c· | x |
r | ɹ | ħ g· | ʕ |
l lł | l | ê | è |
ł | ɬ | ô | ò |
m mp | m | â | à |
a | a | u | u̜ |
f p· | f | f· v· ð· | ∅ |
g gc | ɡ |
Layer 2 has a two-way tone contrast between vowels: the high tone (H) is the default, being contrasted with the low tone (L). For historical reasons, the presence or absence of a low tone on a vowel is called [±creaky].
Layer 3s
The conversion from layer 2s to layer 3s is comparatively more complex.
First, the following changes are made:
- kθ → x͡θ
- ʕ → ħ / V[+creaky] _
- n → m / _ C[+labial]
- n → ɱ / _ C[+labiodental]
- n → n̪ / _ C[+dental]
- n → ɳ / _ C[+retroflex]
- n C₁[+velar] → ɲ C₁[+palatal]
- n → ŋ / _ C[+lateral] V[+front]
- sʂ → ʂː
- C₁={ɹ, ɬ} → w / C₁V _
- l → ɾ / V[+back] _ V
- θ → θ̠ / s_, _s
- ʂj → ʃ
- ʂ → ʃ / _ i
- t͡ʂj → t͡ʃ
- t͡ʂ → t͡ʃ / _ i
- C₁[+voiced] → C₁[-voiced, -aspirated] / C₂[-voiced]
Plosives in a coda are unreleased. All unvoiced plosives and affricates outside of a coda are aspirated.
While Ŋarâþ Crîþ has two tone levels phonemically, their realizations in the phonetic level is more complex. It is common to describe phonetic tone using seven levels, from 0 (the lowest) to 6 (the highest). Each syllable has one or more tones.
In order to describe tone, we must introduce the concept of “stress”, which is placed according to the following rules:
- Syllables with a high tone have a priority over syllables with a low tone – that is, a syllable with a low tone will be selected only if the word in question has only low-tone syllables.
- If the coda of the final syllable is either empty, or it consits of only [s] or [n], then the syllables are chosen in the order 2nd-to-last → 3rd-to-last → last → 4th-to-last → … → first.
- If the coda of the final syllable is a complex coda, then the syllables are chosen in the order last → 3rd-to-last → 2nd-to-last → 4th-to-last → … → first.
- If the coda is anything else, then the syllables are chosen from end to start: last → 2nd-to-last → 3rd-to-last → … → first.
- Monosyllabic function words generally lack any stressed syllable.
We also introduce the concept of a tone accounting unit (TAU), which is the level at which tones are realized. That is, the tone of a syllable depends only on the contents of the TAU in which it lies. Instances of content words occupy different TAUs from each other, but some function words occupy the same TAU as the preceding or following word (in particular, such words have no stressed syllable and are confined to a relatively fixed position):
- Head particles, nominalized verb particles, and monosyllabic determiners occupy the same TAU as the following word.
- ⟨so⟩, monosyllabic relationals ... occupy the same TAU as the preceding word.
(Stress is accounted by orthographic word, not by TAU.)
First, two adjacent vowels are fused into a diphthong if the vowels are not identical, the first vowel is stressed, the second vowel is [i] or [u̜], and the syllable to which the second vowel belongs can be interpreted as having an empty coda. For purposes of tonekeeping, a diphthong is considered to be composed of two different syllables.
In general, unstressed H and L syllables have tone levels 4 and 2, respectively; stressed H and L syllables have tone levels 5 and 1. However, an open H or L syllable before a stressed syllable gets level 3 or 1, respectively, instead. Diphthongs get different values: 65 for HH, 53 for HL, 13 for LH, and 21 for LL.
If two adjacent copies of an identical vowel have the same tone level at this stage, then the one closer to the stressed syllable rises by one tone level and the one farther from it falls by one level.
A tone level of n is then changed into a tone contour in the following situations, unless doing so would result in an out-of-bounds tone level:
- n to (n : n + 1): when the coda is [st] or [x͡θ]
- n to (n : n − 1): when the coda is [rθ] or [ns]
- n to (n + 1 : n): when the nucleus is preceded by two or more voiceless consonants
In addition, other syllables change their tone levels:
- Raise the tone level by 1 (if it is not already 6) if the coda is a voiceless fricative, or if the coda is [x͡θ].
- Lower the tone level by 1 if the coda is [ɹ].
- Lower the tone level by 1 if the coda is a nasal followed by a voiced obstruent or nasal.
Finally, if all tones have a level of 4 or higher, then the lowest tone (breaking ties by preferring later tones) is lowered to 3, and all other tones in the same syllable are lowered by the same amount. All level-3 tones are then lowered to level 2.
Isochrony
The isochrony of Ŋarâþ Crîþ falls somewhere between syllable and mora timing, where:
- The body of a syllable is always 1 unit long.
- The coda of a syllable is between 0 and 1 unit long, with the hierarchy /t, k < n < l, ɹ < f, s, θ, ɹθ, kθ < st, lt, ns, ls, nθ/.
- Codas are shortened after two consecutive vowels: for instance, the ⟨l⟩ in ⟨moriel⟩ is pronounced for less time than that in ⟨mjarel⟩.
Mutations
Ŋarâþ Crîþ has two kinds of initial mutations: lenition and eclipsis. Neither kind of mutation has any effect on plosive-fricative onsets or any of ⟦r l n ŋ ħ⟧.
Lenition tends to turn plosives into fricatives and is indicated with a middle dot ⟦·⟧ after the consonant affected. In particular, it affects ⟦p t d č c g m f v ð⟧. (See Layer 2 for pronunciation details.) Partial lenition does not affect any of ⟦f v ð⟧; that is, it does not lenite consonants that would become silent. Unless otherwise qualified, lenition refers to total lenition, which affects ⟦f v ð⟧.
In a word containing ⟦&⟧, both instances of the reduplicated prefix are lenited. For example, ⟨&d·enfo⟩ can be pronounced as [ðeðenfo] but not as *[ðedenfo].
Lenition occurs in the following environments:
- On the stem in abessive forms of nouns in paradigms 7, 8, 9, 10, and 13
- On a noun modified by ⟨šinen⟩ or ⟨nemen⟩ when used as determiners, if that noun is not a form of ⟨ðên⟩
- Partially, on a noun modified by ⟨ruf⟩ not immediately following it
- Partially, on a noun modified by ⟨mê⟩ immediately preceding it
- On a terrestrial noun modified by a participle-form verb belonging to a Type I genus
- To a dative-case nominalized verb phrase as explained in Nominalized forms
- Partially, on a verb when receiving the comparative prefixes ⟦mir-⟧ or ⟦ła-⟧
- On a classifier attached to the numeral ⟨ces⟩ or any numeral ending in ⟨ħas⟩ or ⟨sreþas⟩
- On the second item of a compound noun, if it is neither terrestrial nor a form of ⟨vês⟩
- On a verb with the cessative prefix ⟦car-⟧ or the terminative prefix ⟦er-⟧
Eclipsis tends to add voice to voiceless consonants and change voiced stops into nasals. It is indicated by prefixing a consonant: ⟦t d c g f þ ł⟧ become ⟦dt nd gc ŋg vf ðþ lł⟧, respectively. ⟦p⟧ becomes ⟦vp⟧ before any of ⟦i e u î ê⟧ and ⟦mp⟧ elsewhere. If a word starts with a vowel, then it is eclipsed by prefixing ⟦g⟧.
In a word containing ⟦&⟧, only the first instance of the reduplicated prefix is eclipsed. For example, ⟨n&denfin⟩ can be pronounced as [nedenfin] but not as *[nenenfin].
Eclipsis occurs in the following environments:
- On the genitive dual, plural, and singulative forms of nouns
- On a noun modified by ⟨lê⟩ or ⟨tê⟩ immediately preceding it
- On a noun modified by ⟨dân⟩
- On a finite form of a verb or relational with perfective aspect
- To a locative, instrumental, or abessive-case nominalized verb phrase that is not an object of a modifying relational, as explained in Nominalized forms
- On a short numeral modified by ⟨ceþe⟩
Lenition can happen on any syllabic onset of a word, but eclipsis is limited to word-initial positions.
In this documentation, lenition is sometimes marked with an empty circle ○, and eclipsis with an filled circle ●. Partial lenition is marked with an empty triangle △.
Loanwords
Almost all loanwords in Ŋarâþ Crîþ are nouns. [TODO: we are reworking nouns]
Generally, when borrowing from languages that use the Cenvos script or a script related to it, and whose orthographies in the script in question do not deviate too far from Ŋarâþ Crîþ usage, Ŋarâþ Crîþ prefers to borrow the word graphemically than phonemically.
The typography of Ŋarâþ Crîþ
In principle, layer 2w is the highest written layer needed to write in Ŋarâþ Crîþ. (Note that there is only one valid layer-2w representation for each layer-1 string; in other words, changing a valid layer-2w string in a way that preserves the layer-1 representation always results in an invalid layer-2w string.) However, speakers of Ŋarâþ Crîþ tend to value aesthetics, even in writing. Thus, a mastery of handwriting beyond layer 2w is considered crucial.
Even though movable type has been available for a long time, prominent parts of printed materials (such as titles) often continued to use plates engraved from handwriting. Eventually, typography and calligraphy were considered parts of the same discipline, leading to typefaces supporting more features from the latter. Even today, logos often opt for lettering over typefaces. Because of this unification, we use the term typography to refer to the discipline of laying out writing in general.
Although a full treatment of Ŋarâþ Crîþ typography is out of scope for this grammar, this section gives an overview of the concerns at hand.
Kerning
Cenvos is a script that absolutely requires kerning. To start, some glyphs such as ²⟨e⟩ and ²⟨m⟩ have long leftward tails that necessitate kerning with glyphs such as ²⟨s⟩ or ²⟨o⟩, which lack descenders, or even some glyphs with descenders such as ²⟨j⟩.
Other glyphs such as ²⟨j⟩ and ²⟨ê⟩ have shorter leftward descenders that also require kerning with following glyphs.
²⟨â⟩ has a descender in the opposite direction; thus, it must kern with certain preceding glyphs.
Diagonal strokes with matching slopes (such as in ²⟨âv⟩ or ²⟨rj⟩) should be kerned to bring them closer.
Moreover, even pairs are sometimes insufficient. Since ²⟨e⟩ and ²⟨i⟩ are kerned so closely, ²⟨ei⟩ must itself kern with glyphs such as ²⟨s⟩.
Ligation and shaping
Another important aspect of typography is the use of ligatures (beyond the required ones). The concepts of higher written layers and the hierarchy of graphic variations have been developed to try to formalize this problem.
To explain the idea behind this model, we note that a good ligature will have the end of one glyph near the start of the next. The starting and ending points of a glyph, in turn, depend on the order in which the strokes are written.
Furthermore, natural handwriting tends to join certain strokes together. In some cases, this joining can affect how a glyph ligates; for instance, ³⟨a1α ⟩ cannot ligate with the previous character (ligating through the middle would cause a stroke collision with stroke 2 of ³⟨a1α ⟩), but ³⟨a1β ⟩, in which the two strokes are joined without a loop, can do so.
In addition, rapid handwriting often produces stylistic variations of glyphs. For example, ³⟨i2α ⟩ (“²⟨i⟩ with the stroke going upward”) can often end in a leftward swash at the end of the stroke. Since this deviation does not create any ambiguity, it has been accepted, yielding the stylistic variant ⁴⟨i2αS⟩.
We now cover the formalism itself. Layers 2w*, 3w, and 4w are aesthetic layers; the writer decides the precise sequence of glyphs to realize a layer-2w string in higher layers. Nonetheless, not all layer-3w or -4w strings are valid, even those that correspond to valid layer-2w strings; for instance, ³⟨s1i1⟩ is not a valid realization of ²⟨si⟩ because it requires a base-to-top ligation.
Only some glyphs participate in typesetting. Notably, all letters participate, but no numerals do so, nor does the space.
Each participating layer-2w* glyph has a hierarchy of variations as follows:
- At the top level is the layer-2w* glyph itself.
- These are divided into stroke-order variants, which differ only in stroke order. All strokes must be preserved, and no loops may be introduced or removed, but the relative stroke order might be different, and some strokes may be written in the reverse direction; furthermore, a stroke may be split at a turn, and two strokes may be joined where one ends and another begins. These are denoted with subscript numerals: ²⟨a⟩ has variants ³⟨a1⟩, ³⟨a2⟩, and ³⟨a3⟩. Variant 1 is considered the ‘canonical’ variant.
- Each stroke-order variant has one or more topological variants, which may join strokes together, cause two different strokes to touch each other when they did not (or vice versa), or introduce or remove loops. Lengthening or shortening strokes to alter ligation properties also falls under this level. Topological variants are distinguished using lowercase Greek letters. For instance, ³⟨a1⟩ has three topological variants: ³⟨a1α ⟩, ³⟨a1β ⟩, ³⟨a1γ ⟩. α is reserved for the canonical variant, which preserves all strokes, although it is not always the most common variant.
- Each topological variant has one or more stylistic variants, which can modify the strokes of the glyph themselves. For instance, ⁴⟨i2α ⟩ is the topological variant of ²⟨i⟩ in which the stroke goes from the base to the top. It has two stylistic variants: ⁴⟨i2α ⟩ is the default one, and ⁴⟨i2αS⟩ has a swash to the left at the top of the stroke. Note that the ‘canonical’ stylistic variant has no superscript letter, while the other variants do.
Layer 2w is transliterated using mostly the same symbols as the layer-1 romanization, but required ligatures are notated with an overline (such as in ²⟨me⟩ for 𐲌𐲁), and final forms are written as if they were ligatures with a special $ symbol: ²⟨c$⟩ for 𐲀. Layer 2w* introduces discretionary ligatures, which are similarly marked in our notation. By discretionary ligature, we mean a ligature that the writer may choose to use but is not obligated to do so, and that cannot be derived by simply connecting the ending stroke of one glyph to the starting stroke of another.
Layer 3w works on topological variants. The overline denotes optional ligatures between topological variants; it is now omitted for required and discretionary ligatures, which are their own layer-2w* glyphs in their own right: ³⟨+1α me1α r1α l2β a1α n1α #1α f1α l2δ i1β r1α o1α r2α a3β ⟩ transliterates a particularly fancy realization of ⟨+merlan #flirora⟩.
Layer 4w works on stylistic variants. In the transliteration, the overline is used as in 3w.
Layer 3w can be thought of as the ‘ligation layer’; similarly, layer 4w can be thought of as the ‘shaping layer’.
Table 7 describes the canonical stroke order of each glyph, and Table 8 lists the stroke-order variants.
Glyph | Stroke order |
---|---|
c | (1) Counterclockwise |
e | (1) From top right to bottom left |
n | (1) From top left to bottom right |
ŋ | (1) From top right to bottom |
v | (1) From right to left |
o | (1) From top to bottom left |
s | (1) From top right to bottom left |
þ | (1) Rightmost stroke from right to left (2) Leftmost stroke from right to left |
š | (1) From top right to bottom left |
r | (1a) From bottom to top (1b) to left |
l | (1a) r-stroke from bottom to top (1b) to left (2) Intersecting stroke from right to left |
ł | (1a) o-stroke from top to bottom (1b) to left (2) Intersecting stroke from right to left |
m | (1) e-stroke from top right to bottom left (2) Intersecting stroke from right to left |
a | (1) þ-sloping stroke from left to right (2) f-sloping stroke from right to left |
f | (1) Rightmost stroke from right to left (2) Leftmost stroke from right to left |
g | (1) From top right to bottom |
p | (1) From right to bottom |
t | (1a) v-stroke from right to top (1b) to left (2) Vertical stroke from top to bottom |
č | (1) Ascending stroke from top to bottom (2) f-sloping stroke from right to left |
î | (1) From bottom right to top left |
j | (1) From top right to bottom left |
i | (1) From top to bottom |
d | (1) þ-sloping stroke from left to right (2) f-sloping stroke from right to left |
ð | (1) Leftmost þ-sloping stroke from left to right (2) Rightmost þ-sloping stroke from left to right (3) f-sloping stroke from right to left |
h | (1) From right to left |
ħ | (1) Clockwise, starting and ending at the top |
ê | (1) From top right to bottom left |
ô | (1) From top to bottom |
â | (1) From bottom right to top left |
u | (1) o-stroke from top to bottom left (2) Rightmost dot (3) Leftmost dot |
w | (1) From top to bottom |
x | (1) Stroke with descender, starting from the top-right corner and ending on the descender (2) Wave stroke, from right to left |
y | (1) From right to left |
z | (1) From right to left |
c$ | (1) From right to bottom left |
ŋ$ | (1) ŋ-stroke from top right to bottom (2) Intersecting stroke from right to left |
ee | (1) e-stroke from top right to bottom left (2) Overbar from right to left |
em | (1) e-stroke from top right to bottom left (2) Roof from right to lef |
me | (1) e-stroke from top right to bottom left (2) Intersecting stroke from right to left (3) Overbar from right to left |
mm | (1) e-stroke from top right to bottom left (2) Intersecting stroke from right to left (3) Roof from right to left |
jâ | (1) j-stroke from top right to bottom left (2) Ring clockwise (starting and ending point unspecified) |
âj | (1) â-stroke from bottom right to top left (2) Ring clockwise (starting and ending point unspecified) |
ww | (1) w-stroke, from top to bottom (2) Ring clockwise (starting and ending point unspecified) |
xx | (1) Stroke with descender, starting from the top-right corner and ending on the descender (2) Wave stroke, from right to left (3) Bottom-right tick (4) Top-left tick |
yy | (1) y-stroke, from right to left (2) Tick, from top to bottom |
zz | (1) z-stroke, from right to left (2) Ring clockwise (starting and ending point unspecified) |
# | (1) From bottom right to top left |
+ | (1) From top right to bottom left |
+* | (1) From top right to bottom left (2) Vertical stroke from top to bottom (3) f-sloping stroke from top right to bottom left (4) þ-sloping stroke from bottom right to top left |
@ | (1) Vertical stroke from top to bottom (2) v-stroke from right to left |
* | (1) Vertical stroke from top to bottom (2) Horizontal stroke from right to left (3) f-sloping stroke from top right to bottom left (4) þ-sloping stroke from bottom right to top left |
& | (1) Sinusoid from right to left (2) Arrowhead |
. | (1) Main stroke from right to left (2) Arrowhead |
; | (1) Main stroke from right to left (2) Arrowhead |
? | (1) Main stroke from right to left (2) Arrowhead |
! | (1) Main stroke from right to left (2) Arrowhead |
{ | (1) From right to left |
} | (1) From right to left |
« | (1) From top to bottom |
» | (1) Vertical stroke from top to bottom (2) Left cornered edge from top to bottom |
/ | (1) From bottom, curving at the top toward the left, then descending while crossing to the right half and possibly to the left again |
(ra) | (1) Stroke as in ²⟨r⟩, but with the end extending to the descender line (2) Stroke intersecting the second part of stroke 1 |
(ro) | (1a) The stem of the ²⟨r⟩-stroke, from bottom to top (1b) A ²⟨v⟩-stroke from right to left |
Glyph | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|
c | 1 | |||||
e | 1 | |||||
n | 1 | 1′ | ||||
ŋ | 1 | |||||
v | 1 | |||||
o | 1 | |||||
s | 1 | |||||
þ | 1 2 | |||||
š | 1 | |||||
r | 1 | 1a′ 1b | ||||
l | 1 2 | 1a′ 1b 2 | ||||
ł | 1 2 | |||||
m | 1 2 | 2′ 1 | 1 2′ | |||
a | 1 2 | 2 1′ | 1′ 2 | |||
f | 1 2 | |||||
g | 1 | |||||
p | 1 | |||||
t | 1 2 | 1a+2 1b | ||||
č | 1 2 | |||||
î | 1 | |||||
j | 1 | |||||
i | 1 | 1′ | ||||
d | 1 2 | 2 1′ | 1′ 2 | |||
ð | 1 2 3 | 3 1 2 | ||||
h | 1 | |||||
ħ | 1 | |||||
ê | 1 | |||||
ô | 1 | |||||
â | 1 | |||||
u | 1 2 3 | |||||
w | 1 | |||||
x | 1 2 | |||||
y | 1 | |||||
z | 1 | |||||
c$ | 1 | |||||
ŋ$ | 1 2 | 1 2′ | ||||
ee | 1 2 | 2 1 | ||||
em | 1 2 | 2 1 | ||||
me | 1 2 3 | 2′ 1 3 | 1 2′ 3 | 3 1 2 | 3 2′ 1 | 3 1 2′ |
mm | 1 2 3 | 2′ 1 3 | 1 2′ 3 | 3 1 2 | 3 2′ 1 | 3 1 2′ |
jâ | 1 2 | |||||
âj | 1 2 | |||||
ww | 1 2 | |||||
xx | 1 2 3 4 | |||||
yy | 1 2 | |||||
zz | 1 2 | |||||
# | 1 | |||||
+ | 1 | |||||
+* | 1 2 3 4 | |||||
@ | 1 2 | |||||
* | 1 2 3 4 | |||||
& | 1 2 | |||||
. | 1 2 | |||||
; | 1 2 | |||||
? | 1 2 | |||||
! | 1 2 | |||||
{ | 1 | |||||
} | 1 | |||||
« | 1 | |||||
» | 1 2 | 1+2′ | ||||
/ | 1 | |||||
(ra) | 1 2 | |||||
(ro) | 1 | 1a′ 1b |
Glyph | Start join | End join | Description | Use |
---|---|---|---|---|
c1α | M | — | Default | Default |
e1α | Mv | D | Default | Default |
e1β | Bv | D | Stem shortened to start at base | After glyphs that end at the base |
n1α | — | — | Default | Default |
n2α | B | M | Default | Before glyphs that start at the mid |
ŋ1α | M | Dv | Default | Default |
v1α | B | B | Default | Default |
o1α | Tv | M | Default | Default |
o1β | M | M | Loop on stroke to allow for mid ligation with previous glyph | After glyphs that end at the mid |
s1α | M | B | Default | Default |
þ1α | B | M | Default | Default |
þ1β | B | M | Strokes 1 and 2 connected | Stylistic |
š1α | M | Bv | Default | Default |
r1α | Dv | B | Default | Default |
r2α | Mv | B | Default | Rare (β form is more common), but sometimes after glyphs that end at the mid |
r2β | Bv | B | Stroke 1 disconnected from 2 (starts at base instead) | After glyphs that end at the base |
l1α | Dv | M | Default | Default |
l1β | Dv | M | Strokes 1 and 2 connected | Stylistic |
l2α | Mv | M | Default | Rare (β form is more common), but sometimes after glyphs that end at the mid |
l2β | Bv | M | Stroke 1 disconnected from 2 (starts at base instead) | After glyphs that end at the base |
l2γ | Mv | M | Strokes 2 and 3 connected | Rare (δ form is more common), but stylization of α |
l2δ | Bv | M | Stroke 1 disconnected from 2 (starts at base instead), and strokes 2 and 3 connected | Stylization of β |
ł1α | Tv | BD | Default | Default |
ł1β | Tv | BD | Strokes 1 and 2 connected | Stylistic |
m1α | Mv | — | Default | Default |
m2α | — | D | Default | Rare; β form is more common |
m2β | — | D | Strokes 1 and 2 connected | Stylistic |
m3α | Mv | — | Default | Rare; β form is more common |
m3β | Mv | — | Strokes 1 and 2 connected | Stylistic |
a1α | — | D | Default | Default |
a1β | M | D | Strokes 1 and 2 fused, with 2 beginning where 1 ends (without a loop) | Stylistic (‘italic’ variant) |
a1γ | — | D | Strokes 1 and 2 connected (with a loop) | Stylistic |
a2α | M | M | Default | After glyphs that end at the mid |
a2β | M | M | Strokes 1 and 2 connected (rare) | Stylistic |
a3α | B | D | Default | After glyphs that end at the base |
a3β | B | D | Strokes 1 and 2 connected | Stylistic |
f1α | M | B | Default | Default |
f1β | M | B | Strokes 1 and 2 connected | Stylistic |
g1α | M | Dv | Default | Default |
p1α | B | Dv | Default | Default |
t1α | B | — | Default | Default |
t2α | B | B | Default | Stylistic |
č1α | T | B | Default | Default |
î1α | B | M | Default | Default |
j1α | M | D | Default | Default |
i1α | Tv | Bv | Default | Default |
i1β | M | Bv | Loop on stroke to allow for mid ligation with previous glyph | After glyphs that end at the mid |
i2α | B | T | Default | After glyphs that end at the base |
d1α | — | B | Default | Default |
d2α | M | M | Default | After glyphs that end at the mid |
d3α | B | B | Default | After glyphs that end at the base |
ð1α | B | — | Default | Default |
ð1β | B | — | Strokes 1 and 2 connected | Stylistic |
ð1γ | B | — | Strokes 2 and 3 connected | Stylistic |
ð1δ | B | — | Strokes 1, 2, and 3 connected | Stylistic |
ð2α | M | M | Default | After glyphs that end at the mid, or as a stylization |
ð2β | M | M | Strokes 2 and 3 connected | Stylistic |
h1α | M | M | Default | Default |
ħ1α | — | — | Default | Default |
ê1α | M | D | Default | Default |
ê1β | M | — | Stroke bends to the right at the end, preventing linkage with the next glyph | Stylistic |
ô1α | M | D | Default | Default |
â1α | D | M | Default | Default |
u1α | Tv | DB | Default | Default |
u1β | M | DB | Loop on stroke 1 to allow for mid ligation with previous glyph | After glyphs that end at the mid |
w1α | M | Dv | Default | Default |
x1α | M | M | Default | Default |
y1α | B | B | Default | Default |
z1α | B | B | Default | Default |
c$1α | M | D | Default (in practice, final forms have no successor to ligate to) | Default |
ŋ$1α | M | DB | Default | Default |
ŋ$2α | M | — | Default | Rare; β form is more common |
ŋ$2β | M | — | Strokes 1 and 2 connected | Stylistic |
ee1α | Mv | M | Default | Default |
ee2α | M | D | Default | Sometimes after a glyph that ends at the mid |
ee2β | M | D | Strokes 1 and 2 connected (uncommon) | Stylistic |
em1α | Mv | M | Default | Default |
em2α | M | D | Default | Stylistic |
em2β | M | D | Strokes 1 and 2 connected (uncommon) | Stylistic |
me1α | Mv | M | Default | Default |
me2α | — | M | Default | Stylistic |
me2β | — | M | Strokes 1 and 2 connected | Stylistic |
me3α | Mv | M | Default | Stylistic |
me3β | Mv | M | Strokes 1 and 2 connected | Stylistic |
me3γ | — | M | Strokes 2 and 3 connected | Stylistic |
me3δ | — | M | Strokes 1, 2, and 3 connected | Stylistic |
me4α | M | D | Default | Sometimes after a glyph that ends at the mid |
me4β | M | D | Strokes 1 and 2 connected | Stylistic |
me5α | M | D | Default | Sometimes after a glyph that ends at the mid |
me5β | M | D | Strokes 1 and 2 connected | Stylistic |
me5γ | M | D | Strokes 2 and 3 connected | Stylistic |
me5δ | M | D | Strokes 1, 2, and 3 connected | Stylistic |
me6α | M | — | Default | Sometimes after a glyph that ends at the mid |
me6β | M | — | Strokes 1 and 2 connected | Stylistic |
me6γ | M | — | Strokes 2 and 3 connected | Stylistic |
me6δ | M | — | Strokes 1, 2, and 3 connected | Stylistic |
mm1α | Mv | M | Default | Default |
mm2α | — | M | Default | Stylistic |
mm2β | — | M | Strokes 1 and 2 connected | Stylistic |
mm3α | Mv | M | Default | Stylistic |
mm3β | Mv | M | Strokes 1 and 2 connected | Stylistic |
mm3γ | — | M | Strokes 2 and 3 connected | Stylistic |
mm3δ | — | M | Strokes 1, 2, and 3 connected | Stylistic |
mm4α | M | D | Default | Sometimes after a glyph that ends at the mid |
mm4β | M | D | Strokes 1 and 2 connected | Stylistic |
mm5α | M | D | Default | Sometimes after a glyph that ends at the mid |
mm5β | M | D | Strokes 1 and 2 connected | Stylistic |
mm5γ | M | D | Strokes 2 and 3 connected | Stylistic |
mm5δ | M | D | Strokes 1, 2, and 3 connected | Stylistic |
mm6α | M | — | Default | Sometimes after a glyph that ends at the mid |
mm6β | M | — | Strokes 1 and 2 connected | Stylistic |
mm6γ | M | — | Strokes 2 and 3 connected | Stylistic |
mm6δ | M | — | Strokes 1, 2, and 3 connected | Stylistic |
jâ1α | M | M | Default | Default |
âj1α | D | D | Default | Default |
ww1α | M | — | Default | Default |
xx1α | M | D | Default | Default |
yy1α | B | M | Default | Default |
zz1α | B | — | Default | Default |
#1α | — | M | Default | Default |
+1α | — | M | Default | Default |
+*1α | — | — | Default | Default |
@1α | Tv | M | Default | Default |
@1β | M | M | Loop on stroke 1 to allow for mid ligation with previous glyph | After a glyph that ends at the mid |
*1α | — | M | Default | Default |
&1α | — | — | Default | Default |
.1α | MT | — | Default | Default |
;1α | B | — | Default | Default |
?1α | MT | — | Default | Default |
!1α | M | — | Default | Default |
{1α | T | Tv | Default | Default |
}1α | Bv | B | Default | Default |
«1α | — | — | Default | Default |
»1α | — | — | Default | Default |
»2α | — | — | Default | Stylistic (handwriting variant) |
/1α | — | — | Default | Default |
ra1α | Dv | — | Default | Default |
ro1α | Dv | M | Default | Default |
ro2α | Mv | M | Default | Rare (β form is more common), but sometimes after glyphs that end at the mid |
ro2β | Bv | M | Stroke 1 disconnected from 2 (starts at base instead) | After glyphs that end at the base |
Table 9 lists all topological variants with their possible join positions on each side, with B for base, M for mid (or mean), T for top (ascender line), and D for descender. If more than one position is listed, then any one of them can be used. A v suffix on a position indicates that the stroke end at the appropriate side is vertical.
In general, for two topological variants a and b to ligate to each other (in that order), there must exist a position C such that a can join at C endward and b can join at C startward, with at least one end not being vertical.
There are a few exceptions to this rule: any topological variant of ²⟨l⟩ can be ligated before ³⟨i2α ⟩ (see Figure 4 for an example).
Stylistic variants are much less standardized in comparison, but there are some widely recognized variants:
- Some topological variants (³⟨þ1β ⟩, ³⟨j1α ⟩, ³⟨i2α ⟩, ³⟨c$1α ⟩, ³⟨«1α ⟩) have an S variant that introduces a swash at the end of the last stroke.
- In the standard forms, ²⟨e⟩ and ²⟨m⟩ (as well as the required ligatures involving these) have the tail sloping slightly upwards (as it goes to the left). This tail might sometimes bend downwards (the C variant) or even start with a downward slope (the D variant).
- The rightward descending stem of a glyph such as ³⟨r1α ⟩ can be shortened (in the H variant) after an ²⟨e⟩ or ²⟨m⟩ to allow kerning.
²⟨’⟩ and ²⟨·⟩ are special: they can ligate with any participating glyph on either end, appearing as an extension of the stroke near the ²⟨’⟩ or ²⟨·⟩. Nonetheless, such ligation is not particularly common.
The rules over layers 3w and 4w dictate only what is legal, not what is considered beautiful. (Indeed, it is perfectly legal to use the 1α form of every glyph and abstain from all non-required ligatures.) Nor do they dictate how an eligible pair of glyphs should be ligated. There are some guidelines, however, on what is desirable:
- Avoid stroke collisions
- Minimize horizontal space
- Minimize effort to write
- Prefer to ligate when possible, but avoid doing so excessively
- Prefer to use the canonical stroke-order
- Prefer to use the most common topological forms
- Vary the particular forms of each letter
Connotations associated with choices in layer-4w realization
Of course, context also plays a role in deciding how to realize text into layer 4w. First, the purpose of the writing has an influence (text meant for children or language learners will be less embellished, and header text tneds to be more embellished than body text).
Another part of context is the expressive connotation that the writer wishes to communicate.
Connotation | Properties of realization |
---|---|
Elegant, refined | Increased use of ligation in general; use of ‘broken ²⟨r⟩-stroke forms’ such as ³⟨r2β ⟩ and ³⟨l2β ⟩ |
Rational | Use of the non-H stylistic variants of glyphs such as ³⟨r1α ⟩ after ²⟨e⟩ or ²⟨m⟩ rather than the H variants |
Casual, informal | Use of ³⟨a1β ⟩ |
Vertical ligation
Another desirable practice is vertical ligation, in which the strokes of two glyphs in different lines are connected. This is naturally difficult even in handwriting, let alone in type!