Orthography and phonology

The phonology and orthography of Ŋarâþ Crîþ can be divided into eight layers in two modes (writing and speaking):

Layer 0 is the underlying morphographemic representation. In this grammar, text in this layer is written in double square brackets: ⟦tanc-a⟧.
Layer 1 is the graphemic representation. This representation is subsequently exported to the spoken and written modes. Text in this layer is written with angle brackets: ⟨tanca⟩.
Layer 2w is the surface glyphic representation. This represents the sequence of Cenvos glyphs that is written, observing required ligatures and final forms. Text in this layer is written with double angle brackets: ²⟨tanca⟩; for a more interesting example, ⟨mencoc⟩ becomes ²⟨mencoc$⟩.
Layer 2w* is an intermediate layer between 2w and 3w, in which discretionary ligatures are introduced to 2w text. For instance, ²⟨#flirora⟩ can be realized as ²*⟨#fliro ra⟩.
Layer 3w is the topological representation, showing optional ligatures as well as stroke order variations. Text in this layer is written with double angle brackets: ³⟨t_1α a_1γ n_1α c_1α a_1α ⟩. More interestingly, ²⟨mencoc$⟩ could become ³⟨me_1α n_1α c_1α o_1α c$_1α ⟩.
Layer 4w is the presentational representation, adding to 3w variations in the strokes themselves and how strokes within a glyph are joined. Text in this layer is written with double angle brackets: ⁴⟨t_1α a_1γ n_1α c_1α a_1α ⟩.
Layer 2s is the phonemic representation. We use slashes for this, as usual: /tanka/.
Layer 3s is the phonetic representation, or what is pronounced. We use square brackets for this, as usual: [tʰa⁴ɲcʰa²].

The conversions from 0 to 1, 1 to 2w, and 2s to 3s are functional: each valid input corresponds to exactly one output. The conversion from 1 to 2s is almost so, except when a ⟨&⟩ is present. In the opposite direction, the conversions from 4w to 3w, from 3w to 2w*, and from 2w* to 2w are functional. Furthermore, for any conversion, it can be determined whether a given input can be converted into a given output without external information.

In addition, the conversion between 1 and 2w is bijective: valid layer-1 and layer-2w representations can be paired with each other.

Layers 0, 1, and 2w: Cenvos and its romanization

Cenvos, the native script of Ŋarâþ Crîþ, is written from right to left. This script can be analyzed on two levels: graphemes, which constitute the abstract level and glyphs, which are the characters being written. For instance, Cenvos has one grapheme romanized as ⟨c⟩ that corresponds to two different glyphs: the non-final form 𐲀𐲢 (denoted as ²⟨c⟩) and the final form 𐲀 (²⟨c$⟩). As another example, the sequence 𐲌𐲁 (⟨me⟩ = ²⟨me⟩) consists of one glyph but two graphemes.

In this grammar, we primarily use the romanization, whose symbols largely map one-to-one with Cenvos graphemes. Cenvos has four kinds of graphemes:

True letters are graphemes that represent sounds.
Markers, while considered letters, do not represent sounds. Instead, they indicate that the words affected are treated specially. They occur on the level of a word and do not actively participate in morphology.
Punctuation includes the clause-end punctuation ⟨.⟩, ⟨;⟩, ⟨?⟩, and ⟨!⟩; the clitic boundary mark ⟨’⟩; the lenition mark ⟨·⟩; the grouping brackets ⟨{}⟩; and the quotation marks ⟨«»⟩.
Digits can be used to write short numerals.

Of course, there is also the space. Layer 0 also contains the morpheme boundary, ⟦-⟧.

Cen	Name	Rom	Cen	Name	Rom	Cen	Name	Rom
True letters
𐲀𐲢	ca	c	𐲌	ma	m	𐲘	ar	h
𐲁	e	e	𐲍	a	a	𐲙	ħo	ħ
𐲂	na	n	𐲎	fa	f	𐲚	ên	ê
𐲃𐲢	ŋa	ŋ	𐲏	ga	g	𐲛	ôn	ô
𐲄	va	v	𐲐	pa	p	𐲜	ân	â
𐲅	o	o	𐲑	ta	t	𐲝	uħo	u
𐲆	sa	s	𐲒	ča	č	𐳀	cełaŋa	w
𐲇	þa	þ	𐲓	în	î	𐳁	avarte	x
𐲈	ša	š	𐲔	ja	j	𐳂	priþnos	y
𐲉	ra	r	𐲕	i	i	𐳃	telrigjon	z
𐲊	la	l	𐲖	da	d
𐲋	ła	ł	𐲗	ða	ð
Final forms and ligatures (layer 2w)
𐲀		c$	𐲌𐲁		me	𐳀𐳀		ww
𐲃		ŋ$	𐲌𐲌		mm	𐳁𐳁		xx
𐲁𐲁		ee	𐲔𐲜		jâ	𐳂𐳂		yy
𐲁𐲌		em	𐲜𐲔		âj	𐳃𐳃		zz
Markers
𐲤	carþ	#	𐲦	njor	+*	𐲨	nef	*
𐲥	tor	+	𐲧	es	@	𐲯	sen	&
Punctuation
𐲞	gen	.	𐲩	ŋos	’	𐲭	fos	«
𐲟	tja	;	𐲪	łil	·	𐲮	þos	»
𐲠	šac	?	𐲫	rin	{	𐳄	jedva	/
𐲡	cjar	!	𐲬	cin	}	𐲣	mivaf·ome	-

Table 1: The graphemes of Ŋarâþ Crîþ. (The columns are read from left to right.)

The letters ⟨w⟩, ⟨x⟩, ⟨y⟩, and ⟨z⟩ are USR letters. These are used in foreign languages written in Cenvos to represent phonemes that are not approximated by the phonology of Ŋarâþ Crîþ. Each foreign orthography is free to assign them as it pleases.

Cenvos has two graphemes that change form at the end of the word: ⟨c⟩ and ⟨ŋ⟩, as well as several ligatures. We do not distinguish these forms in the romanization.

The marker ⟨*⟩ is used for foreign words, such as loanwords and foreign names. ⟨#⟩ is used to prefix given names. ⟨+⟩ is used to prefix surnames passed by native conventions (i.e. from parent to child within the same gender); ⟨+*⟩ marks a surname passed using non-native conventions. Place names are prefixed with ⟨@⟩. ⟨#⟩, ⟨+⟩, ⟨+*⟩, and ⟨@⟩ can all be used with ⟨*⟩, in which case ⟨*⟩ occurs first. Note that ⟨+*⟩ is a single letter of its own and not a ligature.

At the start of a word, ⟨&⟩ indicates reduplication of an unspecified prefix of the rest of the word. For instance, ⟨&cên⟩ can be pronounced as if it were ⟨cêcên⟩ or ⟨cêncên⟩. (⟨&⟩ occurs after all other markers in this case.) This usage is not productive in standard Ŋarâþ Crîþ, but it appears in a few words, as well as in some idiosyncratic cases. At the middle or the end of a word, or alone, it indicates ellipsis of part or all of the word, most often to abbreviate or censor a word. Lastly, ⟨&{}⟩ is used similarly to the ellipsis in Western punctuation.

Markers can be applied to multi-word strings by surrounding the string with the delimiters ⟨{}⟩. In legal language, ⟨{}⟩ are also used around phrases to resolve ambiguities.

The sentence punctuation ⟨.⟩, ⟨?⟩, and ⟨!⟩ are used as expected. ⟨;⟩ is used to separate two independent clause phrases within the same sentence. The quotation marks, ⟨«»⟩, are used around quotations, direct or indirect. A ⟨.⟩ at the end of a quotation embedded within another sentence is omitted.

⟨’⟩ is used to separate clitics from the rest of the word to which they are attached. ⟨·⟩ indicates lenition; it could be described as a “letter modifier”. It is also used as a decimal point: officially, it is used after the most significant digit of an inexact numeral when written with digits, but it also used unofficially to write non-integers.

⟨/⟩, as its derivation from ⟨i⟩ suggests, is used to separate the number of mjari from the number of edva when writing currency amounts.

Spaces are placed in the following places:

between orthographic words, but not between a clitic and the word to which it is attached
after (but not before) ⟨.⟩, ⟨;⟩, ⟨?⟩, and ⟨!⟩
before ⟨«⟩ and after ⟨»⟩ (but not on the other sides)
around ⟨&{}⟩

[TODO: cover mentions of letters within the language, corresponding to v7 p17 “When letters or markers are referred to, … but the effects on other glyphs are not standardized”]

Digits are interchangeable with short-form numerals, but not with long-form numerals. They are also written right-to-left in Cenvos, with the most significant digit first: 𐲲𐲺𐲳 is 0x2A3 = 675.

Cen	#	Cen	#	Cen	#	Cen	#
𐲰	0	𐲱	1	𐲲	2	𐲳	3
𐲴	4	𐲵	5	𐲶	6	𐲷	7
𐲸	8	𐲹	9	𐲺	A	𐲻	B
𐲼	C	𐲽	D	𐲾	E	𐲿	F

Table 2: The digits of Ŋarâþ Crîþ. (The columns are read from left to right.)

Phonotactics

We express the phonotactic rules of Ŋarâþ Crîþ in terms of layer 0.

A manifested grapheme phrase is either a true letter not followed by a lenition marker (plain letter), any of ⟦p t d č c g m f v ð⟧ followed by a lenition mark (lenited letter), or, word-initially, one of the digraphs ⟦mp vp dt nd gc ŋg vf ðþ lł⟧ (eclipsed letter). All other graphemes are ignored for the purposes of phonotactics.

A manifested grapheme phrase has a base letter. The base letter of a plain letter is itself. The base letter of a lenited letter is the letter without the lenition mark. The base letter of an eclipsed letter is the second letter of the digraph.

A vowel is any of ⟦e o a î i ê ô â u⟧. ⟦j⟧ is a semivowel. All other manifested grapheme phrases are consonants.

An effective plosive is a manifested grapheme phrase whose base letter is any of ⟦p t d c g⟧. An effective fricative is a manifested grapheme phrase whose base letter is any of ⟦f v þ ð s š h ħ⟧.

A word consists of one or more syllables, each of which has an initial, a medial, a nucleus, and a coda. An initial consists of one of the following:

nothing at all
a single consonant
an effective plosive or fricative plus ⟦r⟧ or ⟦l⟧
any of ⟦cf cþ cs cš gv gð tf dv⟧; that is, a plosive plus a fricative of the same voicing, such that the plosive has a more retracted place of articulation than the fricative

The only valid medial, if present, is ⟦j⟧. A nucleus is a vowel.

A coda is either a simple coda or a complex coda. A simple coda is one of ⟦s r n þ rþ l t c f m⟧ or nothing at all. A complex coda is one of ⟦st lt ns ls nþ cþ⟧. While complex codas are allowed in any syllable in layer 0, instances of such codas in the middle of a syntactic word are simplified during the conversion to layer 1, and such instances immediately before a clitic boundary are simplified during the conversion to layer 2. The coda ⟦-m⟧ is used in only a few words.

In addition, ⟦h⟧ is forbidden word-initially. Doubled consonants and vowels are allowed.

If there is more than one way to split a word into syllables, the maximal-onset principle is used. However, clitic boundaries always start a new syllable.

An onset is an initial plus a medial. A bridge is the coda of one syllable plus the onset of the following syllable.

Conversion from layer 0 to layer 1

The following changes are applied as a part of morphology. They occur only when the subsequence involved in a change (that is, the substring being replaced as well as the environment that triggers the change) crosses a morpheme boundary but not a word boundary. For instance, ⟦*@vav-el⟧ becomes ⟨*@vavel⟩ instead of ⟨*@navel⟩. For clarity, however, we omit any ⟦-⟧s from the rules below. (These changes apply from left to right.)

v → n / _ V[-creaky] {v, m·}
ð → ŋ / _ V[-creaky] {ð, d·}

Here, “V[-creaky]” means any of ⟦e o a i u⟧.

The following changes are made to simplify complex codas within a syntactic word if (and only if) the consonant cluster cannot be reinterpreted to avoid the mid-word complex coda.

stšr → šr
stšl → šl
stš → sč
sts → st
st → t / _ C[+nasal]
st → s / _ C
ltšr → ltr
ltšl → ltl
ltš → lč
lts → ls
lt → t / _ C[+nasal]
lt → l / _ C
ss → þ / {n, l} _
ns → n / _ C[+obstruent]²
C[+coronal, -voiced] → ∅ / ns _
ns C[+coronal, +voiced] → nð
ns {ħ, g·} → nð
ns C[+dorsal, -voiced] → nh
ns C[+dorsal, +voiced] → ŋ / _ V
ns C[+dorsal, +voiced] → n
ns C[+labial, -voiced] → nf
ns C[+labial, +voiced] → nv
ls → l / _ C[+obstruent]²
C[+coronal, -voiced] → ∅ / ls _
ls C[+coronal, +voiced] → lð
ls {ħ, g·} → lð
ls C[+dorsal, -voiced] → lh
ls C[+dorsal, +voiced] → lħ
ls C[+labial, -voiced] → lf
ls C[+labial, +voiced] → lv
nþ → þ / _ C[+obstruent]²
nþ C[-voiced] → nþ
nþ C[+voiced] → nð
cþ → þ / _ C[+obstruent]
cþ C[+nasal] → nþ

Here, the consonant graphemes are considered to be organized in the following way based on their pronunciations (with voiceless/voiced pairs):

	Labial	Coronal	Dorsal	Other
Obstruent	p, f, p· / v, vp	t, č, þ, s, š, ł, t·, č· / d, ð, dt, d·, ðþ	c, h, c· / g, gc	/ ħ, g·
Nasal	/ m, mp	/ n, nd	/ ŋ, ŋg
Other		/ l, lł, r

Table 3: Phonetic features used for complex coda simplification.

Finally, the ⟦j⟧ is removed from any instances of ⟦ji jî ju⟧.

Letter numbering

Sometimes, an integer must be assigned to each letter. In this case, the assignment shown in the table below is used. Note that numbers are not assigned fully sequentially. Furthermore, this function is valid only for layer 1 graphemes.

Letter	Hex	Dec	Letter	Hex	Dec	Letter	Hex	Dec
True letters
c	0	0	m	20	32	h	11	17
e	1	1	a	9	9	ħ	12	18
n	2	2	f	A	10	ê	101	257
ŋ	2B	43	g	B	11	ô	104	260
v	3	3	p	C	12	â	109	265
o	4	4	t	D	13	u	13	19
s	5	5	č	DE	222	w	−1	−1
þ	55	85	î	E	14	x	−2	−2
š	5E	94	j	6E	110	y	−3	−3
r	6	6	i	F	15	z	−4	−4
l	7	7	d	10	16
ł	77	119	ð	155	341
Markers
#	14	20	+*	16	22	*	19	25
+	15	21	@	17	23	&	1A	26

Table 4: Letter numbering in Ŋarâþ Crîþ. (The columns are read from left to right.)

The letter sum of a word is the sum of all of its letters. This value is used in some of the noun declension paradigms.

It is theorized that letter numbers were assigned in the following manner:

The basic true letters inherited from Necarasso Cryssesa (i.e. those corresponding to ⟨c e n v o s r l m a f g p t î i d h⟩) received sequential numbers from zero. The number of ⟨m⟩ was changed due to superstitions against the number eight.
⟨ŋ þ š ł č ð⟩ received numbers based on what letter pairs (or triplets in the case of ⟨ð⟩) they were based on.
⟨ê⟩, ⟨ô⟩, and ⟨â⟩ were numbered as 256 + base glyph number.
The other letters and the markers received sequential numbers after ⟨h⟩, skipping 0x18.

Collation

The true letters and the markers are collated in their respective order, except for ⟨&⟩, which is ignored. Lenited letters are treated as their respective base letters, except when two words differ only by the presence or absence of a lenition mark, in which case the lenited variant is collated after the base letter: ⟨saga⟩ < ⟨sag·a⟩ < ⟨sada⟩ < ⟨saħa⟩. Numerals are collated after all letters.

In a directory of personal names, entries are collated on surnames, with given names considered only when surnames are identical. Headings in such a list include the prefix up to an including the first true letter: ⟨+merlan #flirora⟩ would be found under ⟨+m⟩.

Ordered items can be labeled using numerals (starting from 0) or letters. In the latter case, only the letters ⟨c e n v o s r l m a f g p t î i d h⟩ are used.

Numquotes

A digit immediately preceding text surrounded by quotation or grouping marks constitutes a numquote. The digit is usually not pronounced in this case. Numquotes are mainly used for secondary purposes that lack any dedicated punctuation.

Numquote	Meaning
B{}	Contains parenthetical information: provides supplementary information. The sentence should still make sense without the parenthetical content.
1{}	Lists an alias of a referent mentioned by name.
2{}	Surrounds a key-value list. Used as such: ⟨2{3{&{}} 4{&{}} 3{&{}} 4{&{}}}⟩
3{}	Used for listing a key inside ⟨2{}⟩.
4{}	Used for listing a value inside ⟨2{}⟩. When not directly inside a ⟨2{}⟩ numquote, marks a list: elements are delimited by spaces, and ⟨{}⟩ can be used to insert multi-word elements.
9{}	Used to contain abbreviated quantities in the traditional currency system.
*9{}	Used to contain abbreviated quantities in a currency system other than the traditional one.

Table 5: Numquotes in Ŋarâþ Crîþ.

Layer 2s

Before the rest of the conversion to layer 2, the complex coda-simplifying changes are performed to simplify such complex codas before clitic boundaries or at the end of a word. (That is, any occurrences of ⟨’⟩ are ignored this time.)

Traditionally, only manifested grapheme phrases are considered to be significant in the conversion from layer 1 to layer 2s. However, other graphemes such as punctuation can affect prosody.

MGPs	IPA	MGPs	IPA
c	k	p	p
e	e	t	t
n nd	n	č	t͡ʂ
ŋ ŋg	ŋ	î	ì
v m· vp	v	j	j
o	o	i	i
s	s	d dt	d
þ t·	θ	ð d· ðþ	ð
š č·	ʂ	h c·	x
r	ɹ	ħ g·	ʕ
l lł	l	ê	è
ł	ɬ	ô	ò
m mp	m	â	à
a	a	u	u̜
f p·	f	f· v· ð·	∅
g gc	ɡ

Table 6: Layer 1 to layer 2s conversions.

Layer 2 has a two-way tone contrast between vowels: the high tone (H) is the default, being contrasted with the low tone (L). For historical reasons, the presence or absence of a low tone on a vowel is called [±creaky].

Layer 3s

The conversion from layer 2s to layer 3s is comparatively more complex.

First, the following changes are made:

kθ → x͡θ
ʕ → ħ / V[+creaky] _
n → m / _ C[+labial]
n → ɱ / _ C[+labiodental]
n → n̪ / _ C[+dental]
n → ɳ / _ C[+retroflex]
n C₁[+velar] → ɲ C₁[+palatal]
n → ŋ / _ C[+lateral] V[+front]
sʂ → ʂː
C₁={ɹ, ɬ} → w / C₁V _
l → ɾ / V[+back] _ V
θ → θ̠ / s_, _s
ʂj → ʃ
ʂ → ʃ / _ i
t͡ʂj → t͡ʃ
t͡ʂ → t͡ʃ / _ i
C₁[+voiced] → C₁[-voiced, -aspirated] / C₂[-voiced]

Plosives in a coda are unreleased. All unvoiced plosives and affricates outside of a coda are aspirated.

While Ŋarâþ Crîþ has two tone levels phonemically, their realizations in the phonetic level is more complex. It is common to describe phonetic tone using seven levels, from 0 (the lowest) to 6 (the highest). Each syllable has one or more tones.

In order to describe tone, we must introduce the concept of “stress”, which is placed according to the following rules:

Syllables with a high tone have a priority over syllables with a low tone – that is, a syllable with a low tone will be selected only if the word in question has only low-tone syllables.
If the coda of the final syllable is either empty, or it consits of only [s] or [n], then the syllables are chosen in the order 2nd-to-last → 3rd-to-last → last → 4th-to-last → … → first.
If the coda of the final syllable is a complex coda, then the syllables are chosen in the order last → 3rd-to-last → 2nd-to-last → 4th-to-last → … → first.
If the coda is anything else, then the syllables are chosen from end to start: last → 2nd-to-last → 3rd-to-last → … → first.
Monosyllabic function words generally lack any stressed syllable.

We also introduce the concept of a tone accounting unit (TAU), which is the level at which tones are realized. That is, the tone of a syllable depends only on the contents of the TAU in which it lies. Instances of content words occupy different TAUs from each other, but some function words occupy the same TAU as the preceding or following word (in particular, such words have no stressed syllable and are confined to a relatively fixed position):

Head particles, nominalized verb particles, and monosyllabic determiners occupy the same TAU as the following word.
⟨so⟩, monosyllabic relationals ... occupy the same TAU as the preceding word.

(Stress is accounted by orthographic word, not by TAU.)

First, two adjacent vowels are fused into a diphthong if the vowels are not identical, the first vowel is stressed, the second vowel is [i] or [u̜], and the syllable to which the second vowel belongs can be interpreted as having an empty coda. For purposes of tonekeeping, a diphthong is considered to be composed of two different syllables.

In general, unstressed H and L syllables have tone levels 4 and 2, respectively; stressed H and L syllables have tone levels 5 and 1. However, an open H or L syllable before a stressed syllable gets level 3 or 1, respectively, instead. Diphthongs get different values: 65 for HH, 53 for HL, 13 for LH, and 21 for LL.

If two adjacent copies of an identical vowel have the same tone level at this stage, then the one closer to the stressed syllable rises by one tone level and the one farther from it falls by one level.

A tone level of n is then changed into a tone contour in the following situations, unless doing so would result in an out-of-bounds tone level:

n to (n : n + 1): when the coda is [st] or [x͡θ]
n to (n : n − 1): when the coda is [rθ] or [ns]
n to (n + 1 : n): when the nucleus is preceded by two or more voiceless consonants

In addition, other syllables change their tone levels:

Raise the tone level by 1 (if it is not already 6) if the coda is a voiceless fricative, or if the coda is [x͡θ].
Lower the tone level by 1 if the coda is [ɹ].
Lower the tone level by 1 if the coda is a nasal followed by a voiced obstruent or nasal.

Finally, if all tones have a level of 4 or higher, then the lowest tone (breaking ties by preferring later tones) is lowered to 3, and all other tones in the same syllable are lowered by the same amount. All level-3 tones are then lowered to level 2.

Isochrony

The isochrony of Ŋarâþ Crîþ falls somewhere between syllable and mora timing, where:

The body of a syllable is always 1 unit long.
The coda of a syllable is between 0 and 1 unit long, with the hierarchy /t, k < n < l, ɹ < f, s, θ, ɹθ, kθ < st, lt, ns, ls, nθ/.
Codas are shortened after two consecutive vowels: for instance, the ⟨l⟩ in ⟨moriel⟩ is pronounced for less time than that in ⟨mjarel⟩.

Mutations

Ŋarâþ Crîþ has two kinds of initial mutations: lenition and eclipsis. Neither kind of mutation has any effect on plosive-fricative onsets or any of ⟦r l n ŋ ħ⟧.

Lenition tends to turn plosives into fricatives and is indicated with a middle dot ⟦·⟧ after the consonant affected. In particular, it affects ⟦p t d č c g m f v ð⟧. (See Layer 2 for pronunciation details.) Partial lenition does not affect any of ⟦f v ð⟧; that is, it does not lenite consonants that would become silent. Unless otherwise qualified, lenition refers to total lenition, which affects ⟦f v ð⟧.

In a word containing ⟦&⟧, both instances of the reduplicated prefix are lenited. For example, ⟨&d·enfo⟩ can be pronounced as [ðeðenfo] but not as *[ðedenfo].

Lenition occurs in the following environments:

On the stem in abessive forms of nouns in paradigms 7, 8, 9, 10, and 13
On a noun modified by ⟨šinen⟩ or ⟨nemen⟩ when used as determiners, if that noun is not a form of ⟨ðên⟩
Partially, on a noun modified by ⟨ruf⟩ not immediately following it
Partially, on a noun modified by ⟨mê⟩ immediately preceding it
On a terrestrial noun modified by a participle-form verb belonging to a Type I genus
To a dative-case nominalized verb phrase as explained in Nominalized forms
Partially, on a verb when receiving the comparative prefixes ⟦mir-⟧ or ⟦ła-⟧
On a classifier attached to the numeral ⟨ces⟩ or any numeral ending in ⟨ħas⟩ or ⟨sreþas⟩
On the second item of a compound noun, if it is neither terrestrial nor a form of ⟨vês⟩
On a verb with the cessative prefix ⟦car-⟧ or the terminative prefix ⟦er-⟧

Eclipsis tends to add voice to voiceless consonants and change voiced stops into nasals. It is indicated by prefixing a consonant: ⟦t d c g f þ ł⟧ become ⟦dt nd gc ŋg vf ðþ lł⟧, respectively. ⟦p⟧ becomes ⟦vp⟧ before any of ⟦i e u î ê⟧ and ⟦mp⟧ elsewhere. If a word starts with a vowel, then it is eclipsed by prefixing ⟦g⟧.

In a word containing ⟦&⟧, only the first instance of the reduplicated prefix is eclipsed. For example, ⟨n&denfin⟩ can be pronounced as [nedenfin] but not as *[nenenfin].

Eclipsis occurs in the following environments:

On the genitive dual, plural, and singulative forms of nouns
On a noun modified by ⟨lê⟩ or ⟨tê⟩ immediately preceding it
On a noun modified by ⟨dân⟩
On a finite form of a verb or relational with perfective aspect
To a locative, instrumental, or abessive-case nominalized verb phrase that is not an object of a modifying relational, as explained in Nominalized forms
On a short numeral modified by ⟨ceþe⟩

Lenition can happen on any syllabic onset of a word, but eclipsis is limited to word-initial positions.

In this documentation, lenition is sometimes marked with an empty circle ○, and eclipsis with an filled circle ●. Partial lenition is marked with an empty triangle △.

Loanwords

Almost all loanwords in Ŋarâþ Crîþ are nouns. [TODO: we are reworking nouns]

Generally, when borrowing from languages that use the Cenvos script or a script related to it, and whose orthographies in the script in question do not deviate too far from Ŋarâþ Crîþ usage, Ŋarâþ Crîþ prefers to borrow the word graphemically than phonemically.

The typography of Ŋarâþ Crîþ

In principle, layer 2w is the highest written layer needed to write in Ŋarâþ Crîþ. (Note that there is only one valid layer-2w representation for each layer-1 string; in other words, changing a valid layer-2w string in a way that preserves the layer-1 representation always results in an invalid layer-2w string.) However, speakers of Ŋarâþ Crîþ tend to value aesthetics, even in writing. Thus, a mastery of handwriting beyond layer 2w is considered crucial.

Even though movable type has been available for a long time, prominent parts of printed materials (such as titles) often continued to use plates engraved from handwriting. Eventually, typography and calligraphy were considered parts of the same discipline, leading to typefaces supporting more features from the latter. Even today, logos often opt for lettering over typefaces. Because of this unification, we use the term typography to refer to the discipline of laying out writing in general.

Although a full treatment of Ŋarâþ Crîþ typography is out of scope for this grammar, this section gives an overview of the concerns at hand.

Kerning

Cenvos is a script that absolutely requires kerning. To start, some glyphs such as ²⟨e⟩ and ²⟨m⟩ have long leftward tails that necessitate kerning with glyphs such as ²⟨s⟩ or ²⟨o⟩, which lack descenders, or even some glyphs with descenders such as ²⟨j⟩.

Other glyphs such as ²⟨j⟩ and ²⟨ê⟩ have shorter leftward descenders that also require kerning with following glyphs.

²⟨â⟩ has a descender in the opposite direction; thus, it must kern with certain preceding glyphs.

Diagonal strokes with matching slopes (such as in ²⟨âv⟩ or ²⟨rj⟩) should be kerned to bring them closer.

Examples of glyph pairs that require kerning. — Figure 1: Examples of glyph pairs that require kerning: ²⟨es⟩, ²⟨mj⟩, ²⟨jo⟩, ²⟨ên⟩, ²⟨câ⟩, and ²⟨âv⟩.

Moreover, even pairs are sometimes insufficient. Since ²⟨e⟩ and ²⟨i⟩ are kerned so closely, ²⟨ei⟩ must itself kern with glyphs such as ²⟨s⟩.

Kerning of eis and eig. — Figure 2: Kerning of ²⟨eis⟩ and ²⟨eig⟩. In ²⟨eis⟩, ²⟨ei⟩ has room to kern with ²⟨s⟩. ²⟨ei⟩ obviously cannot kern with ²⟨g⟩; that is, in ²⟨eig⟩, ²⟨i⟩ and ²⟨g⟩ are spaced *farther apart* than usual.

Ligation and shaping

Another important aspect of typography is the use of ligatures (beyond the required ones). The concepts of higher written layers and the hierarchy of graphic variations have been developed to try to formalize this problem.

To explain the idea behind this model, we note that a good ligature will have the end of one glyph near the start of the next. The starting and ending points of a glyph, in turn, depend on the order in which the strokes are written.

Furthermore, natural handwriting tends to join certain strokes together. In some cases, this joining can affect how a glyph ligates; for instance, ³⟨a_1α ⟩ cannot ligate with the previous character (ligating through the middle would cause a stroke collision with stroke 2 of ³⟨a_1α ⟩), but ³⟨a_1β ⟩, in which the two strokes are joined without a loop, can do so.

In addition, rapid handwriting often produces stylistic variations of glyphs. For example, ³⟨i_2α ⟩ (“²⟨i⟩ with the stroke going upward”) can often end in a leftward swash at the end of the stroke. Since this deviation does not create any ambiguity, it has been accepted, yielding the stylistic variant ⁴⟨i_2α^S⟩.

The ideas behind ligation. — Figure 3: (a) An example of a bad ligature, in which the first glyph ends at the baseline and the second glyph starts at the top line. In the next example, the second glyph starts at the baseline as well, avoiding an awkward joining point. (b) A difference in stroke order (shown with the glyph ²⟨a⟩) can change the starting points (shown as blue dots) and the ending points (shown as red dots) of a glyph. (³⟨`a_1α` ⟩ does not have a starting point suitable for ligation.) (c) The first stroke of ³⟨`a_1α` ⟩ blocks ligation from a previous glyph, but such a stroke is absent in ³⟨`a_1β` ⟩. (d) The default variant ⁴⟨`i_2α` ⟩ in comparison to ⁴⟨`i_2α^S`⟩ (both ligated after ⁴⟨`f_1α` ⟩).

We now cover the formalism itself. Layers 2w*, 3w, and 4w are aesthetic layers; the writer decides the precise sequence of glyphs to realize a layer-2w string in higher layers. Nonetheless, not all layer-3w or -4w strings are valid, even those that correspond to valid layer-2w strings; for instance, ³⟨s₁i₁⟩ is not a valid realization of ²⟨si⟩ because it requires a base-to-top ligation.

Only some glyphs participate in typesetting. Notably, all letters participate, but no numerals do so, nor does the space.

Each participating layer-2w* glyph has a hierarchy of variations as follows:

At the top level is the layer-2w* glyph itself.
These are divided into stroke-order variants, which differ only in stroke order. All strokes must be preserved, and no loops may be introduced or removed, but the relative stroke order might be different, and some strokes may be written in the reverse direction; furthermore, a stroke may be split at a turn, and two strokes may be joined where one ends and another begins. These are denoted with subscript numerals: ²⟨a⟩ has variants ³⟨a₁⟩, ³⟨a₂⟩, and ³⟨a₃⟩. Variant 1 is considered the ‘canonical’ variant.
Each stroke-order variant has one or more topological variants, which may join strokes together, cause two different strokes to touch each other when they did not (or vice versa), or introduce or remove loops. Lengthening or shortening strokes to alter ligation properties also falls under this level. Topological variants are distinguished using lowercase Greek letters. For instance, ³⟨a₁⟩ has three topological variants: ³⟨a_1α ⟩, ³⟨a_1β ⟩, ³⟨a_1γ ⟩. α is reserved for the canonical variant, which preserves all strokes, although it is not always the most common variant.
Each topological variant has one or more stylistic variants, which can modify the strokes of the glyph themselves. For instance, ⁴⟨i_2α ⟩ is the topological variant of ²⟨i⟩ in which the stroke goes from the base to the top. It has two stylistic variants: ⁴⟨i_2α ⟩ is the default one, and ⁴⟨i_2α^S⟩ has a swash to the left at the top of the stroke. Note that the ‘canonical’ stylistic variant has no superscript letter, while the other variants do.

Layer 2w is transliterated using mostly the same symbols as the layer-1 romanization, but required ligatures are notated with an overline (such as in ²⟨me⟩ for 𐲌𐲁), and final forms are written as if they were ligatures with a special $ symbol: ²⟨c$⟩ for 𐲀. Layer 2w* introduces discretionary ligatures, which are similarly marked in our notation. By discretionary ligature, we mean a ligature that the writer may choose to use but is not obligated to do so, and that cannot be derived by simply connecting the ending stroke of one glyph to the starting stroke of another.

Layer 3w works on topological variants. The overline denotes optional ligatures between topological variants; it is now omitted for required and discretionary ligatures, which are their own layer-2w* glyphs in their own right: ³⟨+_1α me_1α r_1α l_2β a_1α n_1α #_1α f_1α l_2δ i_1β r_1α o_1α r_2α a_3β ⟩ transliterates a particularly fancy realization of ⟨+merlan #flirora⟩.

#merlan +flirora — Figure 4: What ³⟨`+_1α` `me_1α` `r_1α` `l_2β` `a_1α` `n_1α` `#_1α` `f_1α` `l_2δ` `i_1β` `r_1α` `o_1α` `r_2α` `a_3β` ⟩ would look like.

Layer 4w works on stylistic variants. In the transliteration, the overline is used as in 3w.

Layer 3w can be thought of as the ‘ligation layer’; similarly, layer 4w can be thought of as the ‘shaping layer’.

Table 7 describes the canonical stroke order of each glyph, and Table 8 lists the stroke-order variants.

Glyph	Stroke order
c	(1) Counterclockwise
e	(1) From top right to bottom left
n	(1) From top left to bottom right
ŋ	(1) From top right to bottom
v	(1) From right to left
o	(1) From top to bottom left
s	(1) From top right to bottom left
þ	(1) Rightmost stroke from right to left (2) Leftmost stroke from right to left
š	(1) From top right to bottom left
r	(1a) From bottom to top (1b) to left
l	(1a) r-stroke from bottom to top (1b) to left (2) Intersecting stroke from right to left
ł	(1a) o-stroke from top to bottom (1b) to left (2) Intersecting stroke from right to left
m	(1) e-stroke from top right to bottom left (2) Intersecting stroke from right to left
a	(1) þ-sloping stroke from left to right (2) f-sloping stroke from right to left
f	(1) Rightmost stroke from right to left (2) Leftmost stroke from right to left
g	(1) From top right to bottom
p	(1) From right to bottom
t	(1a) v-stroke from right to top (1b) to left (2) Vertical stroke from top to bottom
č	(1) Ascending stroke from top to bottom (2) f-sloping stroke from right to left
î	(1) From bottom right to top left
j	(1) From top right to bottom left
i	(1) From top to bottom
d	(1) þ-sloping stroke from left to right (2) f-sloping stroke from right to left
ð	(1) Leftmost þ-sloping stroke from left to right (2) Rightmost þ-sloping stroke from left to right (3) f-sloping stroke from right to left
h	(1) From right to left
ħ	(1) Clockwise, starting and ending at the top
ê	(1) From top right to bottom left
ô	(1) From top to bottom
â	(1) From bottom right to top left
u	(1) o-stroke from top to bottom left (2) Rightmost dot (3) Leftmost dot
w	(1) From top to bottom
x	(1) Stroke with descender, starting from the top-right corner and ending on the descender (2) Wave stroke, from right to left
y	(1) From right to left
z	(1) From right to left
c$	(1) From right to bottom left
ŋ$	(1) ŋ-stroke from top right to bottom (2) Intersecting stroke from right to left
ee	(1) e-stroke from top right to bottom left (2) Overbar from right to left
em	(1) e-stroke from top right to bottom left (2) Roof from right to lef
me	(1) e-stroke from top right to bottom left (2) Intersecting stroke from right to left (3) Overbar from right to left
mm	(1) e-stroke from top right to bottom left (2) Intersecting stroke from right to left (3) Roof from right to left
jâ	(1) j-stroke from top right to bottom left (2) Ring clockwise (starting and ending point unspecified)
âj	(1) â-stroke from bottom right to top left (2) Ring clockwise (starting and ending point unspecified)
ww	(1) w-stroke, from top to bottom (2) Ring clockwise (starting and ending point unspecified)
xx	(1) Stroke with descender, starting from the top-right corner and ending on the descender (2) Wave stroke, from right to left (3) Bottom-right tick (4) Top-left tick
yy	(1) y-stroke, from right to left (2) Tick, from top to bottom
zz	(1) z-stroke, from right to left (2) Ring clockwise (starting and ending point unspecified)
#	(1) From bottom right to top left
+	(1) From top right to bottom left
+*	(1) From top right to bottom left (2) Vertical stroke from top to bottom (3) f-sloping stroke from top right to bottom left (4) þ-sloping stroke from bottom right to top left
@	(1) Vertical stroke from top to bottom (2) v-stroke from right to left
*	(1) Vertical stroke from top to bottom (2) Horizontal stroke from right to left (3) f-sloping stroke from top right to bottom left (4) þ-sloping stroke from bottom right to top left
&	(1) Sinusoid from right to left (2) Arrowhead
.	(1) Main stroke from right to left (2) Arrowhead
;	(1) Main stroke from right to left (2) Arrowhead
?	(1) Main stroke from right to left (2) Arrowhead
!	(1) Main stroke from right to left (2) Arrowhead
{	(1) From right to left
}	(1) From right to left
«	(1) From top to bottom
»	(1) Vertical stroke from top to bottom (2) Left cornered edge from top to bottom
/	(1) From bottom, curving at the top toward the left, then descending while crossing to the right half and possibly to the left again
(ra)	(1) Stroke as in ²⟨r⟩, but with the end extending to the descender line (2) Stroke intersecting the second part of stroke 1
(ro)	(1a) The stem of the ²⟨r⟩-stroke, from bottom to top (1b) A ²⟨v⟩-stroke from right to left

Table 7: Canonical stroke orders for layer-2w* glyphs. (Glyphs in parentheses are discretionary ligatures.)

Figure 5: Canonical stroke orders of layer-2w glyphs.

Figure 6: Stroke orders of discretionary ligatures.

Glyph	1	2	3	4	5	6
c	1
e	1
n	1	1′
ŋ	1
v	1
o	1
s	1
þ	1 2
š	1
r	1	1a′ 1b
l	1 2	1a′ 1b 2
ł	1 2
m	1 2	2′ 1	1 2′
a	1 2	2 1′	1′ 2
f	1 2
g	1
p	1
t	1 2	1a+2 1b
č	1 2
î	1
j	1
i	1	1′
d	1 2	2 1′	1′ 2
ð	1 2 3	3 1 2
h	1
ħ	1
ê	1
ô	1
â	1
u	1 2 3
w	1
x	1 2
y	1
z	1
c$	1
ŋ$	1 2	1 2′
ee	1 2	2 1
em	1 2	2 1
me	1 2 3	2′ 1 3	1 2′ 3	3 1 2	3 2′ 1	3 1 2′
mm	1 2 3	2′ 1 3	1 2′ 3	3 1 2	3 2′ 1	3 1 2′
jâ	1 2
âj	1 2
ww	1 2
xx	1 2 3 4
yy	1 2
zz	1 2
#	1
+	1
+*	1 2 3 4
@	1 2
*	1 2 3 4
&	1 2
.	1 2
;	1 2
?	1 2
!	1 2
{	1
}	1
«	1
»	1 2	1+2′
/	1
(ra)	1 2
(ro)	1	1a′ 1b

Table 8: Stroke order variants of glyphs, in reference to the canonical stroke order. The prime symbol denotes the reverse direction; the plus denotes a fused stroke.

Glyph	Start join	End join	Description	Use
`c_1α`	M	—	Default	Default
`e_1α`	Mv	D	Default	Default
`e_1β`	Bv	D	Stem shortened to start at base	After glyphs that end at the base
`n_1α`	—	—	Default	Default
`n_2α`	B	M	Default	Before glyphs that start at the mid
`ŋ_1α`	M	Dv	Default	Default
`v_1α`	B	B	Default	Default
`o_1α`	Tv	M	Default	Default
`o_1β`	M	M	Loop on stroke to allow for mid ligation with previous glyph	After glyphs that end at the mid
`s_1α`	M	B	Default	Default
`þ_1α`	B	M	Default	Default
`þ_1β`	B	M	Strokes 1 and 2 connected	Stylistic
`š_1α`	M	Bv	Default	Default
`r_1α`	Dv	B	Default	Default
`r_2α`	Mv	B	Default	Rare (`β` form is more common), but sometimes after glyphs that end at the mid
`r_2β`	Bv	B	Stroke 1 disconnected from 2 (starts at base instead)	After glyphs that end at the base
`l_1α`	Dv	M	Default	Default
`l_1β`	Dv	M	Strokes 1 and 2 connected	Stylistic
`l_2α`	Mv	M	Default	Rare (`β` form is more common), but sometimes after glyphs that end at the mid
`l_2β`	Bv	M	Stroke 1 disconnected from 2 (starts at base instead)	After glyphs that end at the base
`l_2γ`	Mv	M	Strokes 2 and 3 connected	Rare (`δ` form is more common), but stylization of `α`
`l_2δ`	Bv	M	Stroke 1 disconnected from 2 (starts at base instead), and strokes 2 and 3 connected	Stylization of `β`
`ł_1α`	Tv	BD	Default	Default
`ł_1β`	Tv	BD	Strokes 1 and 2 connected	Stylistic
`m_1α`	Mv	—	Default	Default
`m_2α`	—	D	Default	Rare; `β` form is more common
`m_2β`	—	D	Strokes 1 and 2 connected	Stylistic
`m_3α`	Mv	—	Default	Rare; `β` form is more common
`m_3β`	Mv	—	Strokes 1 and 2 connected	Stylistic
`a_1α`	—	D	Default	Default
`a_1β`	M	D	Strokes 1 and 2 fused, with 2 beginning where 1 ends (without a loop)	Stylistic (‘italic’ variant)
`a_1γ`	—	D	Strokes 1 and 2 connected (with a loop)	Stylistic
`a_2α`	M	M	Default	After glyphs that end at the mid
`a_2β`	M	M	Strokes 1 and 2 connected (rare)	Stylistic
`a_3α`	B	D	Default	After glyphs that end at the base
`a_3β`	B	D	Strokes 1 and 2 connected	Stylistic
`f_1α`	M	B	Default	Default
`f_1β`	M	B	Strokes 1 and 2 connected	Stylistic
`g_1α`	M	Dv	Default	Default
`p_1α`	B	Dv	Default	Default
`t_1α`	B	—	Default	Default
`t_2α`	B	B	Default	Stylistic
`č_1α`	T	B	Default	Default
`î_1α`	B	M	Default	Default
`j_1α`	M	D	Default	Default
`i_1α`	Tv	Bv	Default	Default
`i_1β`	M	Bv	Loop on stroke to allow for mid ligation with previous glyph	After glyphs that end at the mid
`i_2α`	B	T	Default	After glyphs that end at the base
`d_1α`	—	B	Default	Default
`d_2α`	M	M	Default	After glyphs that end at the mid
`d_3α`	B	B	Default	After glyphs that end at the base
`ð_1α`	B	—	Default	Default
`ð_1β`	B	—	Strokes 1 and 2 connected	Stylistic
`ð_1γ`	B	—	Strokes 2 and 3 connected	Stylistic
`ð_1δ`	B	—	Strokes 1, 2, and 3 connected	Stylistic
`ð_2α`	M	M	Default	After glyphs that end at the mid, or as a stylization
`ð_2β`	M	M	Strokes 2 and 3 connected	Stylistic
`h_1α`	M	M	Default	Default
`ħ_1α`	—	—	Default	Default
`ê_1α`	M	D	Default	Default
`ê_1β`	M	—	Stroke bends to the right at the end, preventing linkage with the next glyph	Stylistic
`ô_1α`	M	D	Default	Default
`â_1α`	D	M	Default	Default
`u_1α`	Tv	DB	Default	Default
`u_1β`	M	DB	Loop on stroke 1 to allow for mid ligation with previous glyph	After glyphs that end at the mid
`w_1α`	M	Dv	Default	Default
`x_1α`	M	M	Default	Default
`y_1α`	B	B	Default	Default
`z_1α`	B	B	Default	Default
`c$_1α`	M	D	Default (in practice, final forms have no successor to ligate to)	Default
`ŋ$_1α`	M	DB	Default	Default
`ŋ$_2α`	M	—	Default	Rare; `β` form is more common
`ŋ$_2β`	M	—	Strokes 1 and 2 connected	Stylistic
`ee_1α`	Mv	M	Default	Default
`ee_2α`	M	D	Default	Sometimes after a glyph that ends at the mid
`ee_2β`	M	D	Strokes 1 and 2 connected (uncommon)	Stylistic
`em_1α`	Mv	M	Default	Default
`em_2α`	M	D	Default	Stylistic
`em_2β`	M	D	Strokes 1 and 2 connected (uncommon)	Stylistic
`me_1α`	Mv	M	Default	Default
`me_2α`	—	M	Default	Stylistic
`me_2β`	—	M	Strokes 1 and 2 connected	Stylistic
`me_3α`	Mv	M	Default	Stylistic
`me_3β`	Mv	M	Strokes 1 and 2 connected	Stylistic
`me_3γ`	—	M	Strokes 2 and 3 connected	Stylistic
`me_3δ`	—	M	Strokes 1, 2, and 3 connected	Stylistic
`me_4α`	M	D	Default	Sometimes after a glyph that ends at the mid
`me_4β`	M	D	Strokes 1 and 2 connected	Stylistic
`me_5α`	M	D	Default	Sometimes after a glyph that ends at the mid
`me_5β`	M	D	Strokes 1 and 2 connected	Stylistic
`me_5γ`	M	D	Strokes 2 and 3 connected	Stylistic
`me_5δ`	M	D	Strokes 1, 2, and 3 connected	Stylistic
`me_6α`	M	—	Default	Sometimes after a glyph that ends at the mid
`me_6β`	M	—	Strokes 1 and 2 connected	Stylistic
`me_6γ`	M	—	Strokes 2 and 3 connected	Stylistic
`me_6δ`	M	—	Strokes 1, 2, and 3 connected	Stylistic
`mm_1α`	Mv	M	Default	Default
`mm_2α`	—	M	Default	Stylistic
`mm_2β`	—	M	Strokes 1 and 2 connected	Stylistic
`mm_3α`	Mv	M	Default	Stylistic
`mm_3β`	Mv	M	Strokes 1 and 2 connected	Stylistic
`mm_3γ`	—	M	Strokes 2 and 3 connected	Stylistic
`mm_3δ`	—	M	Strokes 1, 2, and 3 connected	Stylistic
`mm_4α`	M	D	Default	Sometimes after a glyph that ends at the mid
`mm_4β`	M	D	Strokes 1 and 2 connected	Stylistic
`mm_5α`	M	D	Default	Sometimes after a glyph that ends at the mid
`mm_5β`	M	D	Strokes 1 and 2 connected	Stylistic
`mm_5γ`	M	D	Strokes 2 and 3 connected	Stylistic
`mm_5δ`	M	D	Strokes 1, 2, and 3 connected	Stylistic
`mm_6α`	M	—	Default	Sometimes after a glyph that ends at the mid
`mm_6β`	M	—	Strokes 1 and 2 connected	Stylistic
`mm_6γ`	M	—	Strokes 2 and 3 connected	Stylistic
`mm_6δ`	M	—	Strokes 1, 2, and 3 connected	Stylistic
`jâ_1α`	M	M	Default	Default
`âj_1α`	D	D	Default	Default
`ww_1α`	M	—	Default	Default
`xx_1α`	M	D	Default	Default
`yy_1α`	B	M	Default	Default
`zz_1α`	B	—	Default	Default
`#_1α`	—	M	Default	Default
`+_1α`	—	M	Default	Default
`+*_1α`	—	—	Default	Default
`@_1α`	Tv	M	Default	Default
`@_1β`	M	M	Loop on stroke 1 to allow for mid ligation with previous glyph	After a glyph that ends at the mid
`*_1α`	—	M	Default	Default
`&_1α`	—	—	Default	Default
`._1α`	MT	—	Default	Default
`;_1α`	B	—	Default	Default
`?_1α`	MT	—	Default	Default
`!_1α`	M	—	Default	Default
`{_1α`	T	Tv	Default	Default
`}_1α`	Bv	B	Default	Default
`«_1α`	—	—	Default	Default
`»_1α`	—	—	Default	Default
`»_2α`	—	—	Default	Stylistic (handwriting variant)
`/_1α`	—	—	Default	Default
`ra_1α`	Dv	—	Default	Default
`ro_1α`	Dv	M	Default	Default
`ro_2α`	Mv	M	Default	Rare (`β` form is more common), but sometimes after glyphs that end at the mid
`ro_2β`	Bv	M	Stroke 1 disconnected from 2 (starts at base instead)	After glyphs that end at the base

Table 9: Topological variants of glyphs: ligation properties and descriptions. (Stroke numbers are in reference to the stroke-order variant, not the 2w glyph.)

Table 9 lists all topological variants with their possible join positions on each side, with B for base, M for mid (or mean), T for top (ascender line), and D for descender. If more than one position is listed, then any one of them can be used. A v suffix on a position indicates that the stroke end at the appropriate side is vertical.

In general, for two topological variants a and b to ligate to each other (in that order), there must exist a position C such that a can join at C endward and b can join at C startward, with at least one end not being vertical.

There are a few exceptions to this rule: any topological variant of ²⟨l⟩ can be ligated before ³⟨i_2α ⟩ (see Figure 4 for an example).

Stylistic variants are much less standardized in comparison, but there are some widely recognized variants:

Some topological variants (³⟨þ_1β ⟩, ³⟨j_1α ⟩, ³⟨i_2α ⟩, ³⟨c$_1α ⟩, ³⟨«_1α ⟩) have an S variant that introduces a swash at the end of the last stroke.
In the standard forms, ²⟨e⟩ and ²⟨m⟩ (as well as the required ligatures involving these) have the tail sloping slightly upwards (as it goes to the left). This tail might sometimes bend downwards (the C variant) or even start with a downward slope (the D variant).
The rightward descending stem of a glyph such as ³⟨r_1α ⟩ can be shortened (in the H variant) after an ²⟨e⟩ or ²⟨m⟩ to allow kerning.

²⟨’⟩ and ²⟨·⟩ are special: they can ligate with any participating glyph on either end, appearing as an extension of the stroke near the ²⟨’⟩ or ²⟨·⟩. Nonetheless, such ligation is not particularly common.

The rules over layers 3w and 4w dictate only what is legal, not what is considered beautiful. (Indeed, it is perfectly legal to use the 1α form of every glyph and abstain from all non-required ligatures.) Nor do they dictate how an eligible pair of glyphs should be ligated. There are some guidelines, however, on what is desirable:

Avoid stroke collisions
Minimize horizontal space
Minimize effort to write
Prefer to ligate when possible, but avoid doing so excessively
Prefer to use the canonical stroke-order
Prefer to use the most common topological forms
Vary the particular forms of each letter

Connotations associated with choices in layer-4w realization

Of course, context also plays a role in deciding how to realize text into layer 4w. First, the purpose of the writing has an influence (text meant for children or language learners will be less embellished, and header text tneds to be more embellished than body text).

Another part of context is the expressive connotation that the writer wishes to communicate.

Connotation	Properties of realization
Elegant, refined	Increased use of ligation in general; use of ‘broken ²⟨r⟩-stroke forms’ such as ³⟨`r_2β` ⟩ and ³⟨`l_2β` ⟩
Rational	Use of the non-H stylistic variants of glyphs such as ³⟨`r_1α` ⟩ after ²⟨e⟩ or ²⟨m⟩ rather than the H variants
Casual, informal	Use of ³⟨`a_1β` ⟩

Table 10: Expresive connotations associated with choices in layer-4w realization.

Vertical ligation

Another desirable practice is vertical ligation, in which the strokes of two glyphs in different lines are connected. This is naturally difficult even in handwriting, let alone in type!