Schwa Deletion and Pronunciation Rules in Bishnupriya Manipuri

Abstract. One of the most important problems in building a pronunciation engine for Bishnupriya Manipuri is determining when the inherent vowel, often represented as schwa, should be preserved and when it should be deleted. This article examines the role of schwa in Bishnupriya Manipuri phonology, explains the difficulty of predicting its pronunciation from orthography alone, and proposes a rule-based approach for computational handling. These rules are essential for generating reliable IPA output, phoneme sequences, and diphone-based text-to-speech.

1. Introduction

In many Indo-Aryan writing systems, consonants inherently carry a vowel unless that vowel is blocked or deleted by phonological rules. In Bishnupriya Manipuri, as in related languages, this default vowel is not always pronounced exactly as written.

For computational purposes, the main challenge is:

When should the inherent vowel be pronounced?
When should it be deleted?

This problem directly affects pronunciation generation, dictionary accuracy, and speech synthesis. If schwa rules are wrong, then:

IPA transcription becomes inaccurate
phoneme segmentation becomes unstable
diphone filenames become inconsistent
TTS output sounds unnatural

2. The Inherent Vowel Problem

A consonant in Eastern Nagari script often carries a default vowel unless another vowel sign or virama is present. However, in actual speech that vowel may not always surface.

Example:
কথা → kɔtʰa

Here the vowels are clearly pronounced.

Example:
অগ্নি → ɔgni

In this case, the orthography suggests a fuller syllabic shape, but actual pronunciation contains a cluster and no extra schwa between g and n.

Therefore, the written form alone does not always reveal the spoken structure.

3. Why Schwa Rules Matter in TTS

Suppose a word is transcribed incorrectly because a schwa is inserted where it should not be. Then the entire downstream system changes:

Wrong IPA
    ↓
Wrong phonemes
    ↓
Wrong diphones
    ↓
Missing audio files
    ↓
Broken TTS playback

Correct path:
অক্ষর → ɔkʰʃɔr

Incorrect path:
অক্ষর → ɔkʰɔʃɔr

The second form creates extra vowels and therefore extra diphones that do not match the recorded library.

4. Types of Schwa Behavior

For practical implementation, schwa behavior can be divided into several categories.

4.1 Schwa preserved

In many simple open syllables, the inherent vowel remains audible.

কথা → kɔtʰa
কর → kɔr
বন → bɔn

4.2 Schwa deleted inside consonant clusters

When consonants form a phonological cluster, the schwa may disappear.

অগ্নি → ɔgni
অগ্র → ɔgrɔ / ɔgr
অঙ্ক → ɔŋkɔ / ɔŋk

4.3 Final schwa deletion

One of the hardest problems is final position. Some words lose final schwa, while others retain it.

Examples of variable final behavior:
অর্থ
শত
অন্তত
অবগত

4.4 Lexical exceptions

Some words must be stored as exceptions because orthographic rules alone are insufficient.

Words may look similar in spelling but behave differently in actual pronunciation.

5. Final Schwa Rules

The final schwa is especially important in Bishnupriya Manipuri computational processing. A word-final consonant may:

retain an audible vowel
lose the vowel completely
behave differently depending on morphology or lexical history

A rule-based converter therefore needs explicit final-schwa logic.

General principle

A final consonant does not always imply a pronounced final vowel.

In implementation, final-schwa handling should be treated as a dedicated stage, not as a side effect of basic letter-to-sound conversion.

6. Cluster-Sensitive Rules

Schwa behavior depends strongly on consonant clusters. If the ending contains cluster-like sequences, the converter must decide whether:

the cluster is preserved without schwa
a final schwa is reintroduced
the word belongs to an exception group

Examples of cluster-sensitive endings include:

-ন্ত্র
-ঙ্গ
-ঙ্ক
-ক্ষ
-র্থ
-স্ত
-গ্ন
-দ্র
-শ্ন
-র্গ

Such clusters often require special handling in phonological conversion.

7. Exception Lists in Rule-Based Systems

No matter how carefully the general rules are designed, some words must be stored in lexical exception lists.

These exception lists are not a weakness. They are a normal part of practical pronunciation systems.

Why exception lists are necessary

historical spelling may differ from modern pronunciation
borrowed words may preserve unusual patterns
learned Sanskrit forms often behave differently
dictionary tradition may preserve conservative orthography

A final-schwa exception list allows the converter to preserve pronunciation in words that would otherwise be wrongly reduced.

8. A Rule-Based Computational Strategy

A practical schwa-handling pipeline may proceed in the following order:

1. Normalize Unicode
2. Parse graphemes
3. Apply basic consonant/vowel mapping
4. Detect internal clusters
5. Apply medial schwa deletion rules
6. Apply final schwa rules
7. Check lexical exception list
8. Produce final IPA output

This staged approach is more reliable than trying to solve all schwa behavior in one regex pass.

9. Example Workflow

Input word: অক্ষর

Step 1: normalize text

অক্ষর

Step 2: identify grapheme sequence

অ + ক্ষ + র

Step 3: apply base mappings

ɔ + kʰʃ + ɔr

Step 4: evaluate schwa behavior

cluster preserved, no extra schwa inserted

Final IPA:

ɔkʰʃɔr

Input word: অগ্নি

Step 1: normalize text

অগ্নি

Step 2: grapheme parsing

অ + গ্নি

Step 3: cluster recognition

gn cluster detected

Final IPA:

ɔgni

10. Final Schwa in TTS Engineering

In a diphone-based TTS system, the final schwa has direct engineering consequences.

For example, compare the two outputs:

Output A: kɔtʰa
Diphones:

#-k
k-ɔ
ɔ-tʰ
tʰ-a
a-#

Output B: kɔtʰ
Diphones:

#-k
k-ɔ
ɔ-tʰ
tʰ-#

These produce entirely different diphone sequences and filenames. Therefore, schwa rules are not merely theoretical—they determine whether audio files are found.

11. Recommended Implementation Principles

For a practical Bishnupriya Manipuri pronunciation engine, the following principles are recommended:

keep schwa handling in a dedicated function
separate medial and final schwa logic
use exception lists for difficult lexical items
test against real dictionary data
validate against actual audio library output

A good engineering rule is: freeze schwa rules before rebuilding the diphone inventory. Otherwise every change in pronunciation rules invalidates previously generated diphones.

12. Linguistic and Computational Significance

Documenting schwa rules is valuable for both linguistics and speech technology.

Linguistically, it clarifies:

syllable structure
cluster behavior
word-final vowel patterns
interaction of orthography and pronunciation

Computationally, it supports:

IPA conversion
phoneme extraction
diphone generation
TTS reliability

13. Conclusion

Schwa deletion is one of the central problems in Bishnupriya Manipuri pronunciation modeling. A successful rule-based system must distinguish between:

ordinary vowel preservation
cluster-driven schwa deletion
final-schwa behavior
lexical exceptions

For speech synthesis, these rules are indispensable. Incorrect schwa handling leads directly to incorrect IPA, incorrect diphones, and missing audio files.

A carefully documented and stable schwa rule system therefore forms the backbone of a Bishnupriya Manipuri text-to-speech engine.

Bishnupriya Manipuri Research Archive

Language, linguistics, dictionary, IPA, phonemes, diphones, and speech technology