Malayalam phonology
Malayalam phonology comprises the sound system of the Malayalam. It is characterized by a large consonant inventory, including true subapical retroflexes and a phonemic distinction in vowel length. The language also maintains clear contrasts among dental, alveolar, and retroflex places of articulation.
For the consonants and vowels, the International Phonetic Alphabet (IPA) symbol is given, followed by the Malayalam character and the ISO 15919 transliteration.[2] The current Malayalam script bears high similarity with Tigalari script, which was used for writing the Tulu language, spoken in coastal Karnataka (Dakshina Kannada and Udupi districts) and the northernmost Kasargod district of Kerala.[3] Tigalari script was also used for writing Sanskrit in Malabar region.
Vowels
| Short | Long | |||||
|---|---|---|---|---|---|---|
| Front | Central | Back | Front | Central | Back | |
| Close | /i/ ഇ | /u/ ഉ | /iː/ ഈ | /uː/ ഊ | ||
| Mid | /e/ എ | /ə̆/ ് | /o/ ഒ | /eː/ ഏ | /oː/ ഓ | |
| Open | /a/ അ | /aː/ ആ | ||||
- ⟨്⟩ /ə̆/ [ə̆~ɤ̈̆] formed from word final short /u/'s but now there are /u/'s finally as well, mostly in loanwords but also natively like in guru, kuru, puẓu and native pērŭ, there are minimal pairs as well: appŭ "water", appu a given name, naṭŭ "plant! (imperative)", naṭu "middle"; It is also added to the end of loanwords ending in some consonants, e.g. Sanskrit manas, āpad, Malayalam manassŭ, āpattŭ, English current Ml. karaṇṭŭ. In declensions the short u is deleted while the full u isnt, eg. naṭŭ-illa, naṭu-illa > naṭilla, naṭuvilla. It is the saṁvr̥tōkāram, an epenthentic vowel in Malayalam. Therefore, it has no independent vowel letter (because it never occurs at the beginning of words) but, when it comes after a consonant, there are various ways of representing it. In medieval times, it was just represented with the symbol for /u/ ⟨കു⟩, but later on it was just completely omitted (that is, written as an inherent vowel ⟨ക⟩, thus, ⟨പേര്⟩ pērŭ "name" was once written as ⟨പേര⟩ pēra which means "guava"). In modern times, it is written in two different ways – the Northern style, in which a chandrakkala is used ⟨ക്⟩, and the Southern or Travancore style, in which the diacritic for a /u/ is attached to the preceding consonant and a chandrakkala is written above ⟨കു്⟩. According to one author, this alternative form ⟨കു്⟩ is historically more correct, though the simplified form without a vowel sign u is common nowadays.[4]
- /a/ (phonetically central: [ä]) is represented as basic or the "default" vowel in the abugida script.
Malayalam has also borrowed the Sanskrit diphthongs of /ai̯/ (represented in Malayalam as ഐ, ai) and /au̯/ (represented in Malayalam as ഔ, au) although these mostly occur only in Sanskrit loanwords. Traditionally (as in Sanskrit), four vocalic consonants (usually pronounced in Malayalam as consonants followed by the saṁvr̥tōkāram, which is not officially a vowel, and not as actual vocalic consonants) have been classified as vowels: vocalic r (ഋ, /rɨ̆/, r̥), long vocalic r (ൠ, /rɨː/, r̥̄), vocalic l (ഌ, /lɨ̆/, l̥) and long vocalic l (ൡ, /lɨː/, l̥̄). Except for the first, the other three have been omitted from the current script used in Kerala as there are no words in current Malayalam that use them.
Some authors say that Malayalam has no diphthongs and /ai̯, au̯/ are clusters of V+glide j/ʋ[5] while others consider all V+glide clusters to be diphthongs /ai̯, aːi̯, au̯, ei̯, oi̯, i̯a/ as in kai, vāypa, auṣadhaṁ, cey, koy and kāryaṁ[2]
Vowel length is phonemic and all of the vowels have minimal pairs; for example kaṭṭi "thickness", kāṭṭi "showed", koṭṭi "tapped", kōṭṭi "twisted, stick, marble", er̠i "throw", ēr̠i "lots"[2]
Vowels tend to be fronted around palatalized consonants and backed around velarized consonants.[6]
Some speakers also have /æː/, /ɔː/, /ə/ from English loanwords; e.g. /bæːŋgɨ̆/ "bank" but most speakers replace it with /aː/, /eː/ or /ja/; /oː/ or /aː/ and /e/ or /a/.[5]
Consonants
| Labial | Dental | Alveolar | Retroflex | Postalveolar/ Palatal |
Velar | Glottal | ||
|---|---|---|---|---|---|---|---|---|
| Nasal | m മ | n̪ ന | n ന / ഩ[a] | ɳ ണ | ɲ ഞ | ŋ, (ŋʲ) ങ | ||
| Plosive/ Affricate |
voiceless | p പ | t̪ ത | t റ്റ | ʈ ട | t͡ɕ~t͡ʃ ച | k, (kʲ) ക | |
| aspirated | pʰ ഫ | t̪ʰ ഥ | ʈʰ ഠ | t͡ɕʰ~t͡ʃʰ ഛ | kʰ ഖ | |||
| voiced | b ബ | d̪ ദ | (d)[b] ന്റ | ɖ ഡ | d͡ʑ~d͡ʒ ജ | ɡ ഗ | ||
| breathy | bʱ ഭ | d̪ʱ ധ | ɖʱ ഢ | d͡ʑʱ~d͡ʒʱ ഝ | ɡʱ ഘ | |||
| Fricative | f ഫ | s, (z) സ | ʂ ഷ | ɕ~ʃ ശ | h ഹ | |||
| Approx. | central | ʋ വ | ɻ ഴ [c] | j യ | ||||
| lateral | l ല | ɭ ള | ||||||
| Tap | ɾ ര | |||||||
| Trill | r റ | |||||||
- As in other Dravidian languages, the retroflex series are true subapical consonants, in which the underside of the tongue contacts the roof.[7]
- All of the alveolars except /s/ are apical.[2]
- /ɕ ~ ʃ, t͡ɕ ~ t͡ʃ, t͡ɕʰ ~ t͡ʃʰ, d͡ʑ ~ d͡ʒ, d͡ʑʱ ~ d͡ʒʱ/ can either be postalveolar or alveolo-palatal depending upon the speaker and dialect; the postalveolar and alveolo-palatal realizations are in free variation.[1]
- The alveolar nasal once had a separate character ⟨ഩ⟩ that is now obsolete and the sound is now almost always represented by the symbol that was originally used only for the dental nasal. However, both sounds are extensively used in current colloquial and official Malayalam, and although they were allophones in Old Malayalam, they now occasionally contrast in gemination – for example, eṉṉāl ('by me', first person singular pronoun in the instrumental case) and ennāl ('if that is so'), which are both written ennāl (എന്നാൽ) and tiṉṉŭ "eat!", tinnŭ "ate".
- The unaspirated alveolar stop also had a separate character ⟨ഺ⟩ but it has become obsolete, as the sound only occurs in geminate form (when geminated it is written with a റ below another റ ⟨റ്റ⟩) or immediately following other consonants (in these cases, റ or ററ are usually written in small size underneath the first consonant).
- The proto Dravidian alveolar stop *ṯ developed into an alveolar trill /r/ in many of the Dravidian languages and *ṉṯ became nn in Malayalam while *ṯṯ remained.[5] Currently Malayalam only has [nd] in the genitive case ending -ṉṟe and a word formed with it taṉṟēṭam; Malayalam regained it from the older genitive case ending -ṉuṭaiya > -ṉuṭe > -ṉṭe > -ṉṟe, Malayalam still retains both forms in words like eṉṉuṭe and eṉṟe though the former is dated, a similar process happened in some Sri Lankan Tamil dialects.[8]
- ന്റ is pronounced as [nd] but ൻറ can be pronounced as [nd] or [nr]; [nr] doesn't occur natively but it occurs in loans like എൻറോൾ (en̠r̠ōḷ) 'enroll' or ഹെൻറി (hen̠r̠i) 'Henry'.[9]
- All non geminated voiceless stops and affricate become voiced intervocalically and after a nasal as in Tamil.[2][5]
- The geminated velars /kk/ and /ŋŋ/ are sometimes palatalized word medially after /j, i(:), e(:)/ like in the words കിടക്കുക [kiɖɐk:ugɐ] vs ഇരിക്കുക [iɾikʲ:ugɐ] and മങ്ങൽ [mɐŋ:ɐl] vs. മത്തങ്ങ [mɐt̪:ɐŋʲ:ɐ], their distribution is unpredictable e.g. it doesn't palatalize in vikkŭ but does in irikkŭ. If the palatalization is from /j/ it is sometimes deleted e.g. poykko can be [pojkʲːo] or [pokʲːo], aḍaykka as [ɐɖɐjkʲːɐ] or [ɐɖɐkʲːɐ]. Some of the northern dialects might pronounce them without palatalization.[1][2]
- The letter ഫ represents both /pʰ/, a phoneme occurring in Sanskrit loanwords, and /f/, which is mostly found in comparatively recent borrowings from European languages. Though nowadays most people (especially youngsters) pronounce /pʰ/ as /f/ like in the word ഫലം /falam/.[2] In the Jesari dialect the native word-initial /p/ is also spirantized to [f].[10]
- /m, n, ɳ, l, ɭ/ are unreleased word finally.[2] Words will never begin or end with a geminated consonant. All consonants appear word medially.[2]
- The plain stops, affricates, nasals, laterals, the fricatives other than /h/ and /ʂ/ and approximants other than /ɻ/ can be geminated and gemination can sometimes change the meaning of the word, e.g. kaḷam 'cell', kaḷḷam 'lie'.[2] /ɲ, ŋ, t/ only occur in geminated form intervocalically.[1]
- The approximant /ɻ/ has both rhotic and lateral qualities, and is indeterminate between an approximant and a fricative. The articulation of /ɻ/ changes part-way through, perhaps explaining why it behaves as both a rhotic and a lateral, both an approximant and a fricative, but the nature of the change is not understood.[11]
- /n, t, ɾ, l, ɻ/ are palatalized and have an advanced tongue root while /n̪, r, ɭ/ are clear or velarized and have a retracted tongue root, particularly noticeable in geminates. /ɾ, r/ can vary between ⟦ɹ̟ʲ ~ ɾ̟ʲ ~ r̟ʲ, ɾ̠ˠ ~ r̠ˠ⟧.[1][12][13]
- /ʋ/ may be realized as [ʋ], [v], [w] or [β̞].[1]
- Around 75% of nk and 50% of ñc and nt from Old Malayalam got assimilated to ṅṅ, ññ and nn, almost all of the ṉṯ merged with nn suggesting an earlier merger of some of the n̠t̠ and nt (for e.g. the cognate of Tamil nan̠r̠i is spelt as nandi (influenced by Sanskrit nandi) and pronounced nanni); mp and ṇṭ were unchanged, e.g. Tamil mūṉṟu, maruntu, kañci, teṅku, but Malayalam mūnnŭ, marunnŭ, kaññi, teṅṅŭ.[14] Word final ai, āy and ey became a unless the word is monosyllabic, e.g. Tamil avai, māṅgāy, veṇṇey but Malayalam ava, māṅṅa, veṇṇa. Final āy in monosyllabic words became āya e.g. Tamil kāy, but Malayalam kāya.
- Loanwords with /z/ are replaced with /s/ and not with /d͡ʒ/ like in Hindi or Telugu e.g. /brasi:l/ English "Brazil" unless it was loaned through Indo-Aryan then the Indo-Aryan pronunciation is taken e.g. /d͡ʒilla/ Hi. /d͡ʒilaː/ Per. /zilʔ/, other Perso-Arabic phonemes like /q, x, ɣ, ħ, Cˤ, ʕ, ʔ/ are replaced with /k, kʰ, g, h, C, ∅, ∅/, sometimes /q, x/ are replaced with /kʰ, k/ e.g. Arabic قطر qaṭar, خَطّ xaṭṭ became ഖത്തർ khattaṟ, കത്ത് kattŭ. English loans with /θ, ð, ʒ/ are replaced with /t̪, d̪, ʃ/; the dentals do not clash with English loans with /t, d/, which are replaced with [t, d] or [ʈ, ɖ], though [d] is rare because of the limited distribution natively e.g. "taxi" as ṯāksi or more commonly ṭāksi. The English /ɹ/ is loaned as either /ɾ/ or /r/ unpredictably, for e.g. 'current' got loaned as karaṇḍŭ but 'maroon' got loaned as mar̠ūṇ or mer̠ūṇ but the cluster /ɹs/ is loaned as /ɻs/ other clusters are loaned as /rC/ or /ɾC/, /ɻ/ only occurs in words with /ɹs/ e.g. 'force' as fōḻsŭ. Speakers with non rhotic English accents don't have /ɹC/ clusters in English loans and pronounce it as fōs(ŭ). In Sanskrit, loans with /t̪C/ and /d̪C/ (unless C is a sonorant or a dental stop) sometimes the /t̪, d̪/ becomes /l/ especially in /t̪s/ e.g. utsava > ulsavam, utpādana > ulpādaṉam, udghāṭana > ulghāḍaṉam.[15] There are some native words with /s/ (urasŭ), /ʃ/ (vīśŭ), /ʂ/ (muṣiyuka), /pʰ/ (apphan), /bʰ/ (ēbhyan), some interjections with /h/ (ōhō, āhā) but rest of the fricatives (except /f/ in native words of Jesari) and aspirates are only found in loans.
- Rarely, some speakers pronounce the voiced aspirated consonants as voiceless so words like dhaṉam as thaṉam, it is more commonly deaspirated so dhaṉam as daṉam and kharam as karam, intervocalically the voiceless aspirate also becomes voiced so mukham as mogam.[16]
Colloquial and dialectal language
Source:[15]
- i, u gets lowered to e, o when before Ca, eg. iṭam, mukham > eṭam, mogam, this change is seen in 10th century inscriptions, irunnaṭattu instead of irunniṭattu.[17] Exceptions include ivan uḷḷa > ivan, oḷḷa (south), ŭḷḷa iḷḷa, eḷḷa, ḷḷa in other dialects. The latter word and uṇṭŭ have seen such variation since old Malayalam of the south. u can become ŭ sporadically in other dialects as with the previous word.
- Medial u's maybe pronounced as a kuttiyalugaram in the north and center, in central it may even merge with a for some, eg. kaṟuppŭ > kaṟ(a)ppŭ, appuṟam > app(a)ṟam, taṇuttŭ > taṇ(a)ttŭ. In center and north, ru can become ri when there is a non back vowel preceding it, eg. ceruppŭ, irumpŭ, parutti, kurukkŭ > cerippŭ, irimpŭ, paritti, kurikkŭ. This change is also seen in the standard form.
- Sporadic cases of a > e, eg. laḍḍu, gaṅga, daśa, karayuka, raktam, raṇṭŭ, śani, bahu, jalam > leḍḍu, geṅṅa, deśa, kareyuka, rektam, reṇṭŭ, śeni, behu, jelam, but not in calam, śaśi or nagaram.
- Cases of aya, ava > ē, ō, most commonly in the north and in some castelects, ef. avaḷuṭe > ōḷṭe.
- A general feature is that the overall length of vowels decrease further north you go.
- In fast speech initial consonants may be dropped, eg. ceytu koḷḷām > ceytōḷām; ḍraivŭ cey > ḍraivey; iṭṭu vaccu > iṭṭēccu; pōkuka vēṇam eṭā > pōkēṇam ḍā/pōkaṇam ḍa/pōṇōḍa; pōkuka vēṇṭā eṭī > pōkēṇṭā ḍī/pōkaṇṭa ḍi/pōṇḍṟi; paḻam āyi > paḻōy; pōyi koḷḷuka vēṇam > poykkōḷaṇam > pokkyōṇam.
- l, ḷ, ḻ dropped in coda, eg. tāḻttŭ, vilkkŭ, ñaṅṅaḷuṭe (>ñaṅṅaḷṭe), taṇṇīrmattaṉ, ippōḷ > tāttŭ, vikkŭ, ñaṅṅaṭe, taṇṇimattaṉ, ippō. Northern and Southern dialects might hypercorrect the last and similar words to ippam. Word finally it happens only if the next word starts with a consonant.
- Medial k maybe lenited to a fricative or completely lost in center and north, eg. varukayilla > SK. varilla, NK. CK. varūla; pōkunnŭ > CK. pōṇŭ.
- ḻ merged with y in certain Mappila and castelects, eg. paḻam > payam. Judeo-Malayalam used to made them t Intervocalically and s before another t, there are also cases of hypercorrection like kaḻa < katha, but they are only attested in writting and wasn't present during aliyah. This was also done by certain northern Thiyya speakers affirming the affinity of Judeo Malayalam and northern Malayalam.[18]
- More cases of nasal assimilation, eg. candaṉam, bhaṅgi, vēṇṭum > cannaṉam, baṅṅi, vēṇam. In some dialects neñcŭ, kuṭumbam > neññŭ, kuṭummam.
- Loss of aspiration, eg. dhaṉam, mukham > daṉam, mogam/mōm.
- śc > śś, sometimes cc too, eg. niścayam, talaccēri > niśśayam, talaśśēri.
- Merger of v with b farther north and sporadically in center, particularly among Mappila speech, e.g. vā, vēṇam > bā, bēṇam. In some areas like Malappuram it's merged with m instead, so vēṇam > mēṇam.[19]
References
- ^ a b c d e f Namboodiripad, Savithry (2016). Malayalam (Namboodiri Dialect) (Thesis). Cambridge University Press. Archived from the original on 2 March 2022. Retrieved 22 December 2020.
- ^ a b c d e f g h i j Haowen Jiang (April 2010). "Malayalam: a Grammatical Sketch and a Text" (PDF). Department of Linguistics, Rice University. Archived from the original (PDF) on 11 September 2012. Retrieved 8 January 2022.
- ^ Vaishnavi Murthy K Y; Vinodh Rajan. "L2/17-378 Preliminary proposal to encode Tigalari script in Unicode" (PDF). unicode.org. Archived (PDF) from the original on 8 February 2018. Retrieved 28 June 2018.
- ^ Muller, Eric (2006). "Malayalam cillaksarams" (PDF). JTC1/SC2/WG2 N3126 L2/06-207. Archived (PDF) from the original on 24 November 2020. Retrieved 10 September 2009.
- ^ a b c d e Krishnamurti, Bhadriraju (2003). The Dravidian Languages. Cambridge University Press.
- ^ "Palatalization and velarization in Malayalam nasals: a preliminary acoustic study of the dental–alveolar contrast" (PDF). Archived from the original (PDF) on 19 October 2023.
- ^ Hamann, Silke (2003). The Phonetics and Phonology of Retroflexes (PDF) (Thesis). Utrecht, Netherlands. Archived (PDF) from the original on 16 January 2021. Retrieved 13 January 2021.
- ^ Zvelebil, Kamil (1965). Some features of Ceylon Tamil. Indo-Iranian Journal. Vol. 9. JSTOR. pp. 113–138. JSTOR 24650188.
- ^ The Unicode Standard Version 13.0 – Core Specification, South and Central Asia-I, Official Scripts of India pg. 514
- ^ Andronov, Mikhail Sergeevich (2003). A Comparative Grammar of the Dravidian Languages. Otto Harrassowitz Verlag. ISBN 978-3-447-04455-4. Archived from the original on 2 October 2022. Retrieved 25 June 2022.
- ^ Scobbie, Punnoose & Khattab (2013) "Articulating five liquids: a single speaker ultrasound study of Malayalam". In Rhotics: New Data and Perspectives. BU Press, Bozen-Bolzano.
- ^ Dowla Khan, Sameer ud (2021). Palatalization and velarization in Malayalam nasals: a preliminary acoustic study of the dental–alveolar contrast (PDF). Reed College.
- ^ "Phonetic implementation of geminates in Malayalam nouns" (PDF). Archived from the original (PDF) on 15 April 2012.
- ^ Krishnamurti (2003), p. 167.
- ^ a b Shekhar, A. C.; Sankaran, C. R. (1944). "Notes on Colloquial Malayāḷam". Bulletin of the Deccan College Research Institute. 6 (1/2): 49–52. JSTOR 42929361.
- ^ Steever (2015), p. 63.
- ^ Sekhar, A. C. (1951). "[Evolution of Malayalam]". Bulletin of the Deccan College Research Institute. 12 (1/2). JSTOR 42929457.
- ^ Gamliel, Ophira. "(PDF) Fading memories and Linguistic Fossils in the religiolect of Kerala Jews". www.academia.edu. Retrieved 2 December 2025.
- ^ Ravi Sankar S. Nair (7 July 2013). "Tribal Languages of Kerala" (PDF). Language in India. 13. Bloomington, MN: M. S. Thirumalai. ISSN 1930-2940. Archived (PDF) from the original on 12 March 2025. Retrieved 11 August 2025.
Sources
- Steever, Sanford B. (15 April 2015). The Dravidian Languages. Routledge. ISBN 978-1-136-91171-2.