Any honest description of Yapiri has to begin with a confession and a clarification. The confession: Yapiri is not entirely diacritic-free. The clarification: where it does use diacritics, it uses them for exactly the right reasons — reasons that no standalone letter could serve. Understanding this distinction is the key to understanding what Yapiri is, and what it is trying to do for Kokborok.
Let us go through it piece by piece: the vowels first, then the tone mark, then the reduplication mark. Each tells a different part of the same story.
The Abugida Problem
To understand Yapiri's vowel design, you first need to understand what it is departing from. Most scripts used historically for Kokborok — Bengali, Devanagari, and the Aima script — are abugidas. In an abugida, every consonant carries a built-in default vowel. Other vowels are not written as letters; they are written as small marks attached to the consonant — above it, below it, or beside it. The vowel is not a letter. It is a modification of a letter.
This is not a flaw in abugidas as such. For Sanskrit, the language that shaped most of the scripts of the Indian subcontinent, this system is quite elegant: Sanskrit syllables are overwhelmingly consonant-led, and vowels are often predictable from context. The abugida model fits Sanskrit well.
But Kokborok is not Sanskrit. It belongs to the Tibeto-Burman family, not the Indo-Aryan one. Its vowel system is phonemically rich and its vowels are not predictable. Writing Kokborok in an abugida imports a set of assumptions that simply do not apply — and the most visible sign of that mismatch is the treatment of vowels as secondary marks rather than first-class sounds.
Yapiri's Six Vowels — Full Letters, Every One
Yapiri has six vowel letters, covering the complete phonemic vowel inventory of Kokborok. Each is a standalone character with its own codepoint in the Unicode Private Use Area. None of them depends on a consonant to exist. None attaches to anything. Each stands alone on the line, the same height and visual weight as every consonant around it.
The sixth vowel deserves particular attention. The schwa /ə/ — the neutral, central vowel — is one of Kokborok's most characteristic sounds. It appears frequently, it is phonemically contrastive (changing the vowel changes the meaning of the word), and yet it is the sound most carelessly handled by other orthographies. Bengali script renders it imprecisely. Roman Kokborok often omits it or conflates it with other vowels. In Yapiri, it has its own dedicated letter at U+E005, equal in standing to every other vowel. The schwa is not squeezed in as an afterthought; it is a founding member of the vowel set.
"In Yapiri, you say /k/ then /i/ — you write the /k/ glyph then the /i/ glyph. The logic of the script and the logic of speech are one and the same."
— Design principle, Yapiri v1.0The practical consequence is that Yapiri is a fully phonemic alphabet for its vowels: one sound, one letter, no exceptions, no implied sounds, no cancellation marks. What you see on the page is exactly what you say aloud. This makes Yapiri genuinely easier to learn than an abugida for native Kokborok speakers — not because it is simpler, but because it is more honest about how the language actually sounds.
The Marks Yapiri Does Use
Here is where intellectual honesty matters. Yapiri is a phonemic alphabet for its core letter inventory — but the language itself has features that cannot be captured by standalone letters alone. For those features, Yapiri uses combining diacritical marks: characters that attach above a base letter via OpenType mark positioning. There are exactly two of them.
Using diacritics here is not a contradiction of Yapiri's design philosophy. The philosophy was never "no diacritics at all costs." It was "every sound deserves to be written explicitly, and no part of the language should be treated as secondary." For tone and reduplication, combining marks are precisely the right tool — because in both cases, the mark modifies a letter rather than replacing one.
The High Tone Mark — U+0301
Kokborok is a tonal language. It has two phonemic tones: a high tone and a low tone. Change the tone on a syllable and you change the word's meaning. This is not a subtle feature — it is a core grammatical property of the language, and any script that ignores it is writing Kokborok incompletely.
Yapiri marks the high tone using the combining acute accent at U+0301 — a standard Unicode combining character that floats above the letter it modifies. The low tone is left unmarked. This follows the same logical economy used in Roman Kokborok, where high tone is indicated (traditionally with the letter "h" after the vowel) and low tone is the default, unmarked state.
Why a combining mark rather than a separate letter? Because tone is not a segment — it is a property of a segment. The high tone does not replace the vowel; it sits above it, modifying it, which is exactly what combining diacritics are designed to do. A standalone letter for high tone would be phonologically wrong; it would imply a separate sound where there is only a pitch variation on an existing one. The combining mark is the honest representation.
The mark is anchored above each base letter via the high_tone anchor class defined in the font. This means it positions itself correctly and consistently above every vowel and every consonant in the script, without any manual spacing adjustment needed from the writer.
The Reduplication Mark — U+030B
Kokborok makes extensive use of reduplication — a grammatical process where a syllable or word is repeated to intensify meaning, create a new word, or mark a grammatical category. It is not a stylistic flourish; it is a structural feature of the language that appears constantly in everyday speech.
In Yapiri, reduplication is marked with the combining double acute accent at U+030B, placed above the last letter of the repeated portion. Like the tone mark, it is a zero-width combining character that attaches via the reduplication anchor class in the font. But precisely how and where it is used depends on which of two distinct scenarios is occurring — and this distinction matters.
Scenario One — Syllable Reduplication Within a Word
When a syllable repeats inside a single word, the reduplication mark appears alone — no hyphen. The mark sits above the last letter of the repeated syllable, signalling to the reader that this syllable is spoken twice even though the word is written as one unbroken unit.
The word nini (meaning "yours") is an example. The syllable ni repeats within the word. In Yapiri it is written as a continuous sequence of letters, with the mark placed above the final i of the reduplicated syllable:
There is no hyphen here because this is not two words — it is one word whose internal structure involves a reduplicated syllable. The mark alone is sufficient to tell the reader that the syllable repeats.
Scenario Two — Word-Level Reduplication
When an entire word repeats consecutively, the mark and hyphen together act as a shorthand — you write the word only once. The reduplication mark above the last letter signals that this word repeats, and the trailing hyphen confirms it is a full word-level repetition rather than a syllable repeat. The reader understands the word is spoken twice from these two signals alone. The word is never written a second time.
The word serek (meaning "silent") reduplicated as serek serek (to put weight on the word) is the canonical example from the Yapiri specification. In Yapiri, this is written as serek̋- — just once, with the mark above the final k and a hyphen after:
This is what makes the system elegant. The mark and hyphen together are not decorations on a repeated word — they replace the repetition. A reader who sees serek̋- does not need to see serek written again; the two signals are sufficient. Compare this to Scenario One: the mark without a hyphen means a syllable repeats within a single word; the mark with a hyphen means the entire word repeats consecutively. The hyphen is the decisive element that distinguishes the two cases.
The choice of a combining mark rather than a standalone character across both scenarios is deliberate and linguistically sound. Reduplication is not a new sound; it is a grammatical operation on existing sounds. The mark annotates structure rather than adding phonemic content, which is exactly what a combining diacritic is designed to do.
The Complete Picture
With all three categories in view, the design of Yapiri becomes clear as a unified system rather than a set of ad hoc decisions.
| Character type | Codepoint | Example | What it represents | Why this form |
|---|---|---|---|---|
| Vowel letters | U+E000 – U+E005 | | The six phonemic vowels of Kokborok | Vowels are full sounds — they deserve full letters |
| High tone mark | U+0301 (combining acute) | ́ | Pitch raised on the vowel it sits above | Tone is a property of a sound, not a sound itself |
| Reduplication mark (syllable) | U+030B (combining double acute) | ̋ | Syllable repeats within one word — mark above last letter of repeated syllable, no hyphen | Reduplication is a grammatical process, not a new phoneme |
| Reduplication mark + hyphen (word) | U+030B + U+002D | ̋- | Full word repeats consecutively — mark above last letter + trailing hyphen. Word is written once only; the two signals replace the second occurrence | Shorthand efficiency: mark + hyphen = word spoken twice, written once |
The pattern is consistent throughout. When something is a sound, Yapiri gives it a full standalone letter. When something is a property of a sound (like pitch) or a grammatical annotation (like reduplication), Yapiri uses a combining mark above the base letter. And when the grammar requires a word boundary to be visible — as in word-level reduplication — a standard hyphen steps in as a structural separator. Every tool is used for exactly the job it is suited for, and no more.
Why This Matters for Kokborok
The scripts that Kokborok has historically used were designed for other languages. Bengali script carries the assumptions of Sanskrit phonology. Roman script was not designed with Tibeto-Burman tone systems or agglutinative grammar in mind. When you write Kokborok in either of these scripts, you spend constant effort working around the gaps — inventing workarounds for sounds the script was not built to handle, omitting or approximating features the orthography cannot represent neatly.
Yapiri was designed from Kokborok's phonology outward. Every design decision — the six independent vowel letters, the schwa as a first-class character, the combining tone mark, the reduplication mark — traces back to a specific feature of the language that needed a home. Nothing in Yapiri is borrowed convention applied without thought. Nothing is there because it was in some predecessor script. Every mark, every letter, every anchor has a reason rooted in the sounds and structures of Kokborok itself.
That is what it means to design a script for a language rather than adapting a script to one. The difference is visible in every line of Yapiri text — in the vowels that stand tall beside their consonants, in the acute that rises above a high-toned syllable, in the double mark that signals a reduplicated form. The language, written as it actually is.
Community Thoughts
Loading