Bengali language

বাংলা baṅgla
Spoken in:India,Bangladesh, and several others 
Region:Eastern South Asia
Total speakers:230 million (189 million native) 
Ranking:6,[1] 5,[2]
Language family:}}}
   Eastern Group
Writing system:Bengali script 
Official status
Official language of:
The template is . Please use instead.
This usage is deprecated. Please replace it with {{tdeprecated|Bengali language|Country}}.
'''The template is deprecated. Please use instead.
 India (West Bengal and Tripura)
Regulated by:Bangla Academy (Bangladesh)
Paschimbanga Bangla Akademi (West Bengal)
Language codes
ISO 639-1:bn
ISO 639-2:ben
ISO 639-3:ben 
Global extent of Bengali.
Bengali or Bangla (IPA: ] ) is an Indo-Aryan language of the eastern Indian subcontinent, evolved from the Magadhi Prakrit, Pāli and Sanskrit languages.

Bengali is native to the region of eastern South Asia known as Bengal, which comprises present day Bangladesh and the Indian state of West Bengal. With nearly 230 million total speakers, Bengali is one of the most widely spoken languages (ranking 5th[2] or 6th[1] in the world). Bengali is the primary language spoken in Bangladesh and is the second most widely spoken language in India[3][4]. Along with Assamese, it is geographically the most eastern of the Indo-Iranian languages.

The Bengali language, with its long and rich literary tradition, serves to bind together a culturally diverse region. In 1952, when Bangladesh used to be East Pakistan, this strong sense of identity led to the Bengali Language Movement, in which several people braved bullets and died on February 21. This day has now been declared as the International Mother Language Day.


Like other Eastern Indo-Aryan languages, Bengali arose from the eastern Middle Indic languages of the Indian Subcontinent. Magadhi Prakrit, the earliest recorded spoken language in the region and the language of the Buddha, had evolved into Ardhamagadhi ("Half Magadhi") in the early part of the first millennium CE. Ardhamagadhi, as with all of the Prakrits of North India, began to give way to what are called Apabhramsa languages just before the turn of the first millennium.[5] The local Apabhramsa language of the eastern Subcontinent, Purvi Apabhramsa or Apabhramsa Abahatta, eventually evolved into regional dialects, which in turn formed three groups: the Bihari languages, the Oriya languages, and the Bengali-Assamese languages. Some argue for much earlier points of divergence—going back to even 500 CE[6] but the language was not static; different varieties coexisted and authors often wrote in multiple dialects. For example, Magadhi Prakrit is believed to have evolved into Apabhramsa Abahatta around the 6th century which competed with Bengali for a period of time.[7]

Usually three periods are identified in the history of Bengali:[5]
  1. Old Bengali (900/1000 CE–1400 CE)—texts include Charyapada, devotional songs; emergence of pronouns Ami, tumi, etc; verb inflections -ila, -iba, etc. Oriya and Assamese branch out in this period.
  2. Middle Bengali (1400–1800 CE)—major texts of the period include Chandidas's Srikrishnakirtan; elision of word-final ô sound; spread of compound verbs; Persian influence. Some scholars further divide this period into early and late middle periods.
  3. New Bengali (since 1800 CE)—shortening of verbs and pronouns, among other changes (e.g. tahartar "his"/"her"; koriyachhilôkorechhilo he/she had done).

Historically closer to Pali, Bengali saw an increase in Sanskrit influence during the Middle Bengali (Chaitanya era), and also during the Bengal Renaissance. Of the modern Indo-European languages in South Asia, Bengali and Marathi retain a largely Sanskrit vocabulary base while Hindi and others such as Punjabi are more influenced by Arabic and Persian.

Enlarge picture
Shaheed Minar, or the Martyr's monument, in Dhaka, commemorates the struggle for the Bengali language
Until the 18th century, there was no attempt to document the grammar for Bengali. The first written Bengali dictionary/grammar, Vocabolario em idioma Bengalla, e Portuguez dividido em duas partes, was written by the Portuguese missionary Manoel da Assumpcam between 1734 and 1742 while he was serving in Bhawal.[9] Nathaniel Brassey Halhed, a British grammarian, wrote a modern Bengali grammar(A Grammar of the Bengal Language (1778)) that used Bengali types in print for the first time.[10] Raja Ram Mohan Roy, the great Bengali Reformer, also wrote a "Grammar of the Bengali Language" (1832).

During this period, the Choltibhasha form, using simplified inflections and other changes, was emerging from Shadhubhasha (older form) as the form of choice for written Bengali.[11]

Bengali was the focus, in 1951–52, of the Bengali Language Movement (Bhasha Andolon) in what was then East Pakistan (now Bangladesh).[12] Although Bengali speakers were more numerous in the population of Pakistan, Urdu was legislated as the sole national language. On February 21, 1952, protesting students and activists walked into military and police fire in Dhaka University and three young students and several others were killed. Subsequently, UNESCO has declared 21 February as International Mother Language Day. In a separate event in May 1961, police in Silchar, India, killed eleven people who were protesting legislation that mandated the use of the Assamese language.[13]

Geographical distribution

Enlarge picture
The native geographic extent of Bengali

Bengali is native to the region of eastern South Asia known as Bengal, which comprises Bangladesh and the Indian state of West Bengal. Around 98% of the total population of Bangladesh speak Bengali as a native language.[14] There are also significant Bengali-speaking communities in immigrant populations in the Middle East, West and Malaysia.

Official status

Bengali is the national and official language of Bangladesh and one of the 23 national languages recognised by the Republic of India.[3] It is the official language of the state of West Bengal and the co-official language of the state of Tripura, Cachar District of southern Assam and the union territory of Andaman and Nicobar Islands. Bengali speakers make the majority in Neil Island and Havelock Island. It was made an official language of Sierra Leone in order to honour the Bangladeshi peacekeeping force from the United Nations stationed there.[16] It is also the co-official language of Assam, which has three predominantly Sylheti-speaking districts of southern Assam: Silchar, Karimganj, and Hailakandi.[17] The national anthems of both India and Bangladesh were written in Bengali by Rabindranath Tagore.


Main article: Bengali dialects

Regional variation in spoken Bengali constitutes a dialect continuum. Linguist Suniti Kumar Chatterjee grouped these dialects into four large clusters — Radh, Banga, Kamarupa and Varendra;[10] but many alternative grouping schemes have also been proposed.[19] The south-western dialects (Radh) form the basis of standard colloquial Bengali, while Bangali is the dominant dialect group in Bangladesh. In the dialects prevalent in much of eastern and south-eastern Bengal (Barisal, Chittagong, Dhaka and Sylhet divisions of Bangladesh), many of the stops and affricates heard in West Bengal are pronounced as fricatives. Western palato-alveolar affricates চ [], ছ [tʃʰ], জ correspond to eastern চʻ [ts], ছ় [s], জʻ [dz]~z. The influence of Tibeto-Burman languages on the phonology of Eastern Bengali is seen through the lack of nasalized vowels. Some variants of Bengali, particularly Chittagonian and Chakma Bengali, have contrastive tone; differences in the pitch of the speaker's voice can distinguish words.

Rajbangsi, Kharia Thar and Mal Paharia are closely related to Western Bengali dialects, but are typically classified as separate languages. Similarly, Hajong is considered a separate language, although it shares similarities to Northern Bengali dialects.[20]

During the standardization of Bengali in the late 19th and early 20th century, the cultural center of Bengal was its capital Kolkata (then Calcutta). What is accepted as the standard form today in both West Bengal and Bangladesh is based on the West-Central dialect of Nadia, a district located near Kolkata.[21] There are cases where speakers of Standard Bengali in West Bengal will use a different word than a speaker of Standard Bengali in Bangladesh, even though both words are of native Bengali descent. For example, nun (salt) in the west corresponds to lôbon in the east.[21]

Spoken and literary varieties

Bengali exhibits diglossia between the written and spoken forms of the language. Two styles of writing, involving somewhat different vocabularies and syntax, have emerged:[21][23]
  1. Shadhubhasha (সাধু shadhu = 'chaste' or 'sage'; ভাষা bhasha = 'language') was the written language with longer verb inflections and more of a Sanskrit-derived (তৎসম tôtshôm) vocabulary. Songs such as India's national anthem Jana Gana Mana (by Rabindranath Tagore) and national song Vande Mātaram (by Bankim Chandra Chattopadhyay) were composed in Shadhubhasha. However, use of Shadhubhasha in modern writing is negligible, except when it is used delibarately to achieve some effect.
  2. Choltibhasha (চলতিভাষা ) or Cholitobhasha (চলিত cholito = 'current' or 'running') , known by linguists as Manno Cholit Bangla (Standard Current Bangla), is a written Bengali style exhibiting a preponderance colloquial idiom and shortened verb forms, and is the standard for written Bengali now. This form came into vogue towards the turn of the 19th century, promoted by the writings of Peary Chand Mitra (Alaler Gharer Dulal, 1857),[24] Pramatha Chowdhury (Sabujpatra, 1914) and in the later writings of Rabindranath Tagore. It is modeled on the dialect spoken in the Shantipur region in Nadia district, West Bengal. This form of Bengali is often referred to as the "Nadia standard" or "Shantipuri bangla".[19]
Linguistically, cholit bangla is derived from sadhu bangla through two successive standard linguistic transformations.

While most writings are carried out in cholit bangla, spoken dialects exhibit a far greater variety. South-eastern West Bengal, including Kolkata, speak in manno cholit bangla. Other parts of West Bengal and west Bangladesh speak in dialects that are minor variations, such as the Medinipur dialect characterised by some unique words and constructions. However, areas of Bangladesh, particularly the Chittagong region, speak in a dialect that bears very little superficial resemblance to manno cholit bangla, including an entirely different vocabulary. The difference is so much that a person from West Bengal will be very hard pressed to understand even a single sentence in a passage of this dialect. This is known as the Bongali sublanguage, or more informally as Chattagram bangla. Writers (such as Manik Bandopadhyay in Padmanodir Majhi) have used the Bongali dialect in writing conversations. Though formal spoken Bengali is modeled on manno cholit bangla, the majority of Bengalis are able to communicate in more than one variety — often, speakers are fluent in choltibhasha and one or more Regional dialects.[11]

Even in Standard Bengali, vocabulary items often divide along the split between the Muslim populace and the Hindu populace. Due to cultural and religious traditions, Hindus and Muslims might use, respectively, Sanskrit-derived and Perso-Arabic words. Some examples of lexical alternation between these two forms are:[21]
  • hello: nômoshkar (S) corresponds to assalamualaikum/slamalikum (A)
  • invitation: nimontron/nimontonno (S) corresponds to daoat (A)
  • paternal uncle: kaka (S) corresponds to chacha (S/Hindi)
  • water : jol (D) corresponds to pani (S)
(here S = derived from Sanskrit, D = deshi; A = derived from Arabic)

Writing system

Main article: Bengali script
Enlarge picture
Anandabazar Patrika, a news daily published from Kolkata in Bengali.

The Bengali writing system is not purely alphabet-based such as the Latin script. Rather, it is written in the Bengali abugida, a variant of the Eastern Nagari script used throughout Bangladesh and eastern India. It is believed to have evolved from a modified Brahmic script around 1000 CE,[27] and is similar to the Devanagari abugida used for Sanskrit and many modern Indic languages such as Hindi. It has particularly close historical relationships with the Assamese script and the Oriya script (although the latter is not evident in appearance). The Bengali abugida is a cursive script with eleven graphemes or signs denoting the independent form of nine vowels and two diphthongs, and thirty-nine signs denoting the consonants with the so called "inherent" vowels.[27] The Bengali orthography reads from left to right.

Although the consonant signs are presented as segments in the basic inventory of the Bengali script, they are actually orthographically syllabic in nature. Every consonant sign has the vowel অ [ɔ] (or sometimes the vowel ও [o]) "embedded" or "inherent" in it.[29] For example, the basic consonant sign ম is pronounced [] in isolation. The same ম can represent the sounds [] or [mo] when used in a word, as in মত [t̪] "opinion" and মন [mon] "mind", respectively, with no added symbol for the vowels [ɔ] and [o].

A consonant sound followed by some vowel sound other than [ɔ] is orthographically realized by using a variety of vowel allographs above, below, before, after, or around the consonant sign, thus forming the ubiquitous consonant-vowel ligature. These allographs, called kars (cf. Hindi matras) are dependent vowel forms and cannot stand on their own. For example, the graph মি [mi] represents the consonant [m] followed by the vowel [i], where [i] is represented as the allograph ি and is placed before the default consonant sign. Similarly, the graphs মা [ma], মী [mi], মু [mu], মূ [mu], মৃ [mri], মে [me]/[], মৈ [moj], মো [mo] and মৌ [mow] represent the same consonant ম combined with seven other vowels and two diphthongs. It should be noted that in these consonant-vowel ligatures, the so-called "inherent" vowel is expunged from the consonant, but the basic consonant sign ম does not indicate this change.

To emphatically represent a consonant sound without any inherent vowel attached to it, a special diacritic, called the hôshonto (্‌), may be added below the basic consonant sign (as in ম্‌ [m]). This diacritic, however, is not common, and is chiefly employed as a guide to pronunciation.

The vowel signs in Bengali can take two forms: the independent form found in the basic inventory of the script and the dependent allograph form (as discussed above). To represent a vowel in isolation from any preceding or following consonant, the independent form of the vowel is used. For example, in মই [moj] "ladder" and in ইলিশ [iliʃ] "Hilsa fish", the independent form of the vowel ই is used (cf. the dependent form ি). A vowel at the beginning of a word is always realized using its independent form.

The Bengali consonant clusters (যুক্তাক্ষর juktakkhor in Bengali) are usually realized as ligatures, where the consonant which comes first is put on top of or to the left of the one that immediately follows. In these ligatures, the shapes of the constituent consonant signs are often contracted and sometimes even distorted beyond recognition. There are more than 400 such consonant clusters and corresponding ligatures in Bengali. Many of their shapes have to be learned by rote.

Three other commonly used diacritics in the Bengali are the superposed chôndrobindu (ঁ), denoting a suprasegmental for nasalization of vowels (as in চাঁদ [tʃãd] "moon"), the postposed onushshôr (ং) indicating the velar nasal [ŋ] (as in বাংলা [baŋla] "Bengali") and the postposed bishôrgo (ঃ) indicating the voiceless glottal fricative [h] (as in উঃ! [uh] "ouch!").

Bengali punctuation marks, apart from the daŗi (|), the Bengali equivalent of a full stop, have been adopted from Western scripts and their usage is similar. [10]The letters usually hang from a horizontal headstroke called the matra (not to be confused with its Hindi cognate matra, which denotes the dependent forms of Hindi vowels)
Enlarge picture
Signature of Rabindranath Tagore — an example of penmanship in Bengali.

Spelling-to-pronunciation inconsistencies

In spite of some modifications in the nineteenth century, the Bengali spelling system continues to be based on the one used for Sanskrit,[10] and thus does not take into account some sound mergers that have occurred in the spoken language. For example, there are three letters (শ, ষ, and স) for the voiceless palato-alveolar fricative [ʃ], although the letter স does retain the voiceless alveolar fricative [s] sound when used in certain consonant conjuncts as in স্খলন [skʰɔlon] "fall", স্পন্দন [spɔndon] "beat", etc. There are two letters (জ and য) for the voiced postalveolar affricate [] as well. What was once pronounced and written as a retroflex nasal ণ [ɳ] is now pronounced as an alveolar [n] (unless conjoined with another retroflex consonant such as ট, ঠ, ড and ঢ), although the spelling does not reflect this change. The near-open front unrounded vowel [æ] is orthographically realized by multiple means, as seen in the following examples: এত [æt̪o] "so much", এ্যাকাডেমী [ækademi] "academy", অ্যামিবা [æmiba] "amoeba", দেখা [d̪ækha] "to see", ব্যস্ত [bæst̪o] "busy", ব্যাকরণ [bækɔron] "grammar".

The realization of the inherent vowel can be another source of confusion. The vowel can be phonetically realized as [ɔ] or [o] depending on the word, and its omission is seldom indicated, as in the final consonant in কম [kɔm] "less".

Many consonant clusters have different sounds than their constituent consonants. For example, the combination of the consonants ক্‌ [k] and ষ [ʃɔ] is graphically realized as ক্ষ and is pronounced [kʰːo] (as in রুক্ষ [rukʰːo] "rugged") or [kʰo] (as in ক্ষতি [kʰot̪i] "loss") or even [kʰɔ] (as in ক্ষমতা [kʰɔmot̪a] "power"), depending on the position of the cluster in a word. The Bengali writing system is, therefore, not always a true guide to pronunciation.

For a detailed list of these inconsistencies, consult Bengali script.

Uses in other languages

The Bengali script, with a few small modifications, is also used for writing Assamese. Other related languages in the region also make use of the Bengali alphabet. Meitei, a Sino-Tibetan language used in the Indian state of Manipur, has been written in the Bengali abugida for centuries, though Meitei Mayek (the Meitei abugida) has been promoted in recent times. The script has been adopted for writing the Sylheti language as well, replacing the use of the old Sylheti Nagori script.[32]


Several conventions exist for writing Indic languages including Bengali in the Latin script, including "International Alphabet of Sanskrit Transliteration" or IAST (based on diacritics),[33] "Indian languages Transliteration" or ITRANS (uses upper case alphabets suited for ASCII keyboards),[34] and the National Library at Calcutta romanization.[35]

In the context of Bangla Romanization, it is important to distinguish between transliteration from transcription. Transliteration is orthographically accurate (i.e. the original spelling can be recovered), whereas transcription is phonetically accurate (the pronunciation can be reproduced). Since English does not have the sounds of Bangla, and since pronunciation does not completely reflect the spellings, being faithful to both is not possible.

Although it might be desirable to use a transliteration scheme where the original Bangla orthography is recoverable from the Latin text, Bangla words are currently Romanized on Wikipedia mixed a phonemic transcription, where the pronunciation is represented with no reference to how it is written. The Wikipedia Romanization is given in the table below, with IPA transcriptions as used above.

Highi u
High-mide o
Low-midê ô
Low a 
  s sh h
Nasalsm n  ng 
Liquids  l, rŗ   


Main article: Bengali phonology
The phonemic inventory of Bengali consists of 29 consonants and 14 vowels, including the seven nasalized vowels. An approximate phonetic scheme is set out below in International Phonetic Alphabet.

Highi u
High-mide o
Low-midæ ɔ
Low a 


  s ʃ h
Nasalsm n  ŋ 
Liquids  l, rɽ   


Magadhan languages such as Bengali are known for their wide variety of diphthongs, or combinations of vowels occurring within the same syllable.[36] Several vowel combinations can be considered true monosyllabic diphthongs, made up of the main vowel (the nucleus) and the trailing vowel (the off-glide). Almost all other vowel combinations are possible, but only across two adjacent syllables, such as the disyllabic vowel combination [u.a] in কুয়া kua "well". As many as 25 vowel combinations can be found, but some of the more recent combinations have not passed through the stage between two syllables and a diphthongal monosyllable.[37]

/ij/iinii "I take"
/iw/iubiubhôl "upset"
/ej/einei "there is not"
/ee̯/eekhee "having eaten"
/ew/euđheu "wave"
/eo̯/eokheona "do not eat"
/æe̯/êenêe "she takes"
/æo̯/êonêo "you take"
/aj/aipai "I find"
/ae̯/aepae "she finds"
/aw/aupau "sliced bread"
/ao̯/aopao "you find"
/ɔe̯/ôenôe "she is not"
/ɔo̯/ôonôo "you are not"
/oj/oinoi "I am not"
/oe̯/oedhoe "she washes"
/oo̯/oodhoo "you wash"
/ow/ounouka "boat"
/uj/uidhui "I wash"


In standard Bengali, stress is predominantly initial. Bengali words are virtually all trochaic; the primary stress falls on the initial syllable of the word, while secondary stress often falls on all odd-numbered syllables thereafter, giving strings such as shô-ho-jo-gi-ta "cooperation", where the boldface represents primary and secondary stress. The first syllable carries the greatest stress, with the third carrying a somewhat weaker stress, and all following odd-numbered syllables carrying very weak stress. However in words borrowed from Sanskrit, the root syllable is stressed, causing them to be out of harmony with native Bengali words.[38]

Adding prefixes to a word typically shifts the stress to the left. For example, while the word shob-bho "civilized" carries the primary stress on the first syllable [shob], adding the negative prefix [ô-] creates ô-shob-bho "uncivilized", where the primary stress is now on the newly-added first syllable অ ô. In any case, word-stress does not alter the meaning of a word and is always subsidiary to sentence-stress.[38]


For Bengali words, intonation or pitch of voice has minor significance, apart from a few isolated cases. However in sentences intonation does play a significant role.[40] In a simple declarative sentence, most words and/or phrases in Bengali carry a rising tone,[41] with the exception of the last word in the sentence, which only carries a low tone. This intonational pattern creates a musical tone to the typical Bengali sentence, with low and high tones alternating until the final drop in pitch to mark the end of the sentence.

In sentences involving focused words and/or phrases, the rising tones only last until the focused word; all following words carry a low tone.[41] This intonation pattern extends to wh-questions, as wh-words are normally considered to be focused. In yes-no questions, the rising tones may be more exaggerated, and most importantly, the final syllable of the final word in the sentence takes a high falling tone instead of a flat low tone.[43]

Vowel length

Vowel length is not contrastive in Bengali; all else equal, there is no meaningful distinction between a "short vowel" and a "long vowel",<ref name> unlike the situation in many other Indic languages. However, when morpheme boundaries come into play, vowel length can sometimes distinguish otherwise homophonous words. This is due to the fact that open monosyllables (i.e. words that are made up of only one syllable, with that syllable ending in the main vowel and not a consonant) have somewhat longer vowels than other syllable types.[44] For example, the vowel in cha: "tea" is somewhat longer than the first vowel in chaţa "licking", as cha: is a word with only one syllable, and no final consonant. (The long vowel is marked with a colon : in these examples.) The suffix ţa "the" can be added to cha: to form cha:ţa "the tea". Even when another morpheme is attached to cha:, the long vowel is preserved. Knowing this fact, some interesting cases of apparent vowel length distinction can be found. In general Bengali vowels tend to stay away from extreme vowel articulation.[44]

Furthermore, using a form of reduplication called "echo reduplication", the long vowel in cha: can be copied into the reduplicant ţa:, giving cha:ţa: "tea and all that comes with it". Thus, in addition to cha:ţa "the tea" (long first vowel) and chaţa "licking" (no long vowels), we have cha:ţa: "tea and all that comes with it" (both long vowels).

Consonant clusters

Native Bengali (tôdbhôb) words do not allow initial consonant clusters;[46] the maximum syllabic structure is CVC (i.e. one vowel flanked by a consonant on each side). Many speakers of Bengali restrict their phonology to this pattern, even when using Sanskrit or English borrowings, such as গেরাম geram (CV.CVC) for গ্রাম gram (CCVC) "village" or ইস্কুল iskul (VC.CVC) for স্কুল skul (CCVC) "school".

Sanskrit (তৎসম tôtshôm) words borrowed into Bengali, however, possess a wide range of clusters, expanding the maximum syllable structure to CCCVC. Some of these clusters, such as the mr in মৃত্যু mrittu "death" or the sp in স্পষ্ট spôshţo "clear", have become extremely common, and can be considered legal consonant clusters in Bengali. English and other foreign (বিদেশী bideshi) borrowings add even more cluster types into the Bengali inventory, further increasing the syllable capacity to CCCVCCCC, as commonly-used loanwords such as ট্রেন ţren "train" and গ্লাস glash "glass" are now even included in leading Bengali dictionaries.

Final consonant clusters are rare in Bengali.[47] Most final consonant clusters were borrowed into Bengali from English, as in লিফ্‌ট lifţ "lift, elevator" and ব্যাংক bêņk "bank". However, final clusters do exist in some native Bengali words, although rarely in standard pronunciation. One example of a final cluster in a standard Bengali word would be গঞ্জ gônj, which is found in names of hundreds of cities and towns across Bengal, including নবাবগঞ্জ Nôbabgônj and মানিকগঞ্জ Manikgônj. Some nonstandard varieties of Bengali make use of final clusters quite often. For example, in some Purbo (eastern) dialects, final consonant clusters consisting of a nasal and its corresponding oral stop are common, as in চান্দ chand "moon". The Standard Bengali equivalent of chand would be চাঁদ chãd, with a nasalized vowel instead of the final cluster.


Main article: Bengali grammar

Bengali nouns are not assigned gender, which leads to minimal changing of adjectives (inflection). However, nouns and pronouns are highly declined (altered depending on their function in a sentence) into four cases while verbs are heavily conjugated.

As a consequence, unlike Hindi, Bengali verbs do not change form depending on the gender of the nouns.

Word order

As a Head-Final language, Bengali follows Subject Object Verb word order, although variations to this theme are common.[48] Bengali makes use of postpositions, as opposed to the prepositions used in English and other European languages. Determiners follow the noun, while numerals, adjectives, and possessors precede the noun.[49]

Yes-no questions do not require any change to the basic word order; instead, the low (L) tone of the final syllable in the utterance is replaced with a falling (HL) tone. Additionally optional particles (e.g. কি -ki, না -na, etc.) are often encliticized onto the first or last word of a yes-no question.

Wh-questions are formed by fronting the wh-word to focus position, which is typically the first or second word in the utterance.


Nouns and pronouns are inflected for case, including nominative, objective, genitive (possessive), and locative.[5] The case marking pattern for each noun being inflected depends on the noun's degree of animacy. When a definite article such as -টা -ţa (singular) or -গুলা -gula (plural) is added, as in the tables below, nouns are also inflected for number.

Singular Noun Inflection
Animate Inanimate
the student
the shoe
the student
the shoe
the student's
the shoe's
on/in the shoe
Plural Noun Inflection
Animate Inanimate
the students
the shoes
the students
the shoes
the students'
the shoes'
on/in the shoes

When counted, nouns take one of a small set of measure words. As in many East Asian languages (e.g. Chinese, Japanese, Thai, etc.), nouns in Bengali cannot be counted by adding the numeral directly adjacent to the noun. The noun's measure word (MW) must be used between the numeral and the noun. Most nouns take the generic measure word -টা -ţa, though other measure words indicate semantic classes (e.g. -জন -jon for humans).

Measure Words
Bengali Bengali transliteration Literal translation English translation
নয়টা গর?Nôe-ţa goruNine-MW cowNine cows
কয়টা বািল?Kôe-ţa balishHow many-MW pillowHow many pillows
অনেকজন েলা?Ônek-jon lokMany-MW personMany people
চার-পাঁচজন িশক্ষ?Char-pãch-jon shikkhôkFour-five-MW teacherFour or five teachers

Measuring nouns in Bengali without their corresponding measure words (e.g. আট বিড়াল aţ biŗal instead of আটটা বিড়াল aţ-ţa biŗal "eight cats") would typically be considered ungrammatical. However, when the semantic class of the noun is understood from the measure word, the noun is often omitted and only the measure word is used, e.g. শুধু একজন থাকবে। Shudhu êk-jon thakbe. (lit. "Only one-MW will remain.") would be understood to mean "Only one person will remain.", given the semantic class implicit in -জন -jon.

In this sense, all nouns in Bengali, unlike most other Indo-European languages, are similar to mass nouns.


Verbs divide into two classes: finite and non-finite. Non-finite verbs have no inflection for tense or person, while finite verbs are fully inflected for person (first, second, third), tense (present, past, future), aspect (simple, perfect, progressive), and honor (intimate, familiar, and formal), but not for number. Conditional, imperative, and other special inflections for mood can replace the tense and aspect suffixes. The number of inflections on many verb roots can total more than 200.

Inflectional suffixes in the morphology of Bengali vary from region to region, along with minor differences in syntax.

Bengali differs from most Indo-Aryan Languages in the zero copula, where the copula or connective be is often missing in the present tense.[10] Thus "he is a teacher" is she shikkhôk, (literally "he teacher").[52] In this respect, Bengali is similar to Russian and Hungarian.


Main article: Bengali vocabulary

Bengali has as many as 100,000 separate words, of which 50,000 (67%) are considered tôtshômo (direct reborrowings from Sanskrit), 21,100 (28%) are tôdbhôbo (derived from Sanskrit words), and the rest being bideshi (foreign) and deshi words.

However, these figures do not take into account the fact that a large proportion of these words are archaic or highly technical, minimizing their actual usage. The productive vocabulary used in modern literary works, in fact, is made up mostly (67%) of tôdbhôbo words, while tôtshômo only make up 25% of the total.[53][54] Deshi and Bideshi words together make up the remaining 8% of the vocabulary used in modern Bengali literature.

Due to centuries of contact with Europeans, Mughals, Arabs, Turks, Persians, Afghans, and East Asians, Bengali has borrowed many words from foreign languages. The most common borrowings from foreign languages come from three different kinds of contact. Close contact with neighboring peoples facilitated the borrowing of words from Hindi, Assamese, Chinese, Burmese, and several indigenous Austroasiatic languages(like Santali).[55] of Bengal. After centuries of invasions from Persia and the Middle East, numerous Persian, Arabic, Turkish, and Pashtun words were absorbed into Bengali. Portuguese, French, Dutch and English words were later additions during the colonial period.


A Bengali poem
A section of the poem Abani Bari Achho by Shakti Chattopadhyay read by a male native speaker.
Problems listening to the file? See media help

See also


1. ^ Languages spoken by more than 10 million people. Encarta Encyclopedia (2007). Retrieved on 2007-03-03.
2. ^ Statistical Summaries. Ethnologue (2005). Retrieved on 2007-03-03.
3. ^ Gordon, Raymond G., Jr. (ed. (2005). Languages of India. Ethnologue: Languages of the World, Fifteenth edition.. SIL International. Retrieved on 2006-11-17.
4. ^ Languages in Descending Order of Strength - India, States and Union Territories - 1991 Census. Census Data Online 1. Office of the Registrar General, India. Retrieved on 2006-11-19.
5. ^
6. ^
7. ^ Abahattha in
8. ^
9. ^ Rahman, Aminur. Grammar. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-19.
10. ^ Bangla language in
11. ^ Ray, S Kumar. The Bengali Language and Translation. Translation Articles. Kwintessential. Retrieved on 2006-11-19.
12. ^
13. ^ No alliance with BJP, says AGP chief. The Telegraph. Retrieved on 2006-11-19.
14. ^ [ The World Fact Book]. CIA. Retrieved on 2006-11-04.
15. ^ Languages of India. Ethnologue Report. Retrieved on 2006-11-04.
16. ^ "Sierra Leone makes Bengali official language", Daily Times, December 29, 2002. Retrieved on 2006-11-17. 
17. ^ NIC, Assam State Centre, Guwahati, Assam. Language. Government of Assam. Retrieved on 2006-06-20.
18. ^ Bangla language in
19. ^ Morshed, Abul Kalam Manjoor. Dialect. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
20. ^ Hajong. The Ethnologue Report. Retrieved on 2006-11-19.
21. ^ Huq, Mohammad Daniul. Chalita Bhasa. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
22. ^ Huq, Mohammad Daniul. Chalita Bhasa. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
23. ^ Huq, Mohammad Daniul. Sadhu Bhasa. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
24. ^ Huq, Mohammad Daniul. Alaler Gharer Dulal. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
25. ^ Morshed, Abul Kalam Manjoor. Dialect. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
26. ^ History of Bangla (Banglar itihash). Bangla. Bengal Telecommunication and Electric Company. Retrieved on 2006-11-20.
27. ^ Bangla Script in
28. ^ Bangla Script in
29. ^ Escudero Pascual Alberto (23 October, 2005). Writing Systems/ Scripts (PDF). Primer to Localization of Software. IT +46. Retrieved on 2006-11-20.
30. ^ Bangla language in
31. ^ Bangla language in
32. ^ Islam, Muhammad Ashraful. Sylheti Nagri. Banglapedia. Asiatic Society of Bangladesh. Retrieved on 2006-11-17.
33. ^ Learning International Alphabet of Sanskrit Transliteration. Sanskrit 3 - Learning transliteration. Gabriel Pradiipaka & Andrés Muni. Retrieved on 2006-11-20.
34. ^ ITRANS - Indian Language Transliteration Package. Avinash Chopde. Retrieved on 2006-11-20.
35. ^ Annex-F: Roman Script Transliteration (PDF). Indian Standard: Indian Script Code for Information Interchange - ISCII 32. Bureau of Indian Standards (1 April, 1999). Retrieved on 2006-11-20.
36. ^
37. ^
38. ^
39. ^
40. ^
41. ^
42. ^
43. ^
44. ^
45. ^
46. ^
47. ^
48. ^
49. ^ Bengali. UCLA Language Materials project. University of California, Los Angeles. Retrieved on 2006-11-20.
50. ^
51. ^ Bangla language in
52. ^ Among Bengali speakers brought up in neighbouring linguistic regions (e.g. Hindi), the lost copula may surface in utterances such as she shikkhôk hochchhe. This is viewed as ungrammatical by other speakers, and speakers of this variety are sometimes (humorously) referred as "hochchhe-Bangali".
53. ^ Tatsama in
54. ^ Tatbhava in
55. ^ Byomkes Chakrabarti A Comparative Study of Santali and Bengali, K.P. Bagchi & Co., Kolkata, 1994, ISBN 8170741289


  • Haldar, Gopal (2000), Languages of India, National Book Trust, India, ISBN 81-237-2936-7.
  • Alam, M (2000), Bhasha Shourôbh: Bêkorôn O Rôchona (The Fragrance of Language: Grammar and Rhetoric), S. N. Printers, Dhaka.
  • Chakrabarti, Byomkes, A Comparative Study of Santali and Bengali, K.P. Bagchi & Co., Kolkata, 1994, ISBN 8170741289 Byomkes Chakrabarti
  • Asiatic Society of Bangladesh (2003), Banglapedia, the national encyclopedia of Bangladesh, Asiatic Society of Bangladesh, Dhaka.
  • Cardona, G & D Jain (2003), The Indo-Aryan languages, RoutledgeCurzon, London.
  • Chatterji, SK (1921), "Bengali Phonetics", Bulletin of the School of Oriental and African Studies.
  • Chatterji, SK (1926), The Origin and Development of the Bengali Language.
  • Ferguson, CA & M Chowdhury (1960), "The Phonemes of Bengali", Language, 36(1), Part 1.
  • Hayes, B & A Lahiri (1991), "Bengali intonational phonology", Natural Language & Linguistic Theory.
  • Klaiman, MH (1987), "Bengali", in Bernard Comrie, The World's Major Languages, Croon Helm, London and Sydney, ISBN 0195065115.

  • Masica, C (1991), The Indo-Aryan Languages, Cambridge Univ. Press.
  • Radice, W (1994), Teach Yourself Bengali: A Complete Course for Beginners, NTC/Contemporary Publishing Company, ISBN 0844237523.
  • Ray, P; MA Hai & L Ray (1966), Bengali language handbook, Center for Applied Linguistics, Washington, ISBN ASIN B000B9G89C.
  • Sen, D (1996), Bengali Language and Literature, International Centre for Bengal Studies, Calcutta.
  • Bhattacharya, T (2000), "Bangla (Bengali)", in Gary, J. and Rubino. C., Encyclopedia of World's Languages: Past and Present (Facts About the World's Languages), WW Wilson, New York, ISBN 0824209702.
  • Baxter, C (1997), Bangladesh, From a Nation to a State, Westview Press, ISBN 0813336325.

External links

Bengali people are the ethnic community from Bengal (divided between India and Bangladesh) on the Indian subcontinent with a history dating back four millennia. They speak Bengali (বাংলা Bangla
..... Click the link for more information.
This page is currently protected from editing until disputes have been resolved.
Protection is not an endorsement of the current [ version] ([ protection log]).
..... Click the link for more information.
Amar Shonar Bangla
My Golden Bengal

(and largest city) Dhaka

..... Click the link for more information.
South Asia, also known as Southern Asia, is a southern geopolitical region of the Asian continent comprising territories on and in proximity to the Indian subcontinent. It is surrounded by (from west to east) Western Asia, Central Asia, Eastern Asia, and Southeastern Asia.
..... Click the link for more information.
This is a list of languages, ordered by the number of native-language speakers, with some data for second-language use. Languages are listed for secondary locations only when spoken by more than 1% of the population.
..... Click the link for more information.
A language family is a group of languages related by descent from a common ancestor, called the proto-language. As with biological families, the evidence of relationship is observable shared characteristics.
..... Click the link for more information.
Indo-Iranian language group constitutes the easternmost extant branch of the Indo-European family of languages. It consists of four language groups: the Indo-Aryan, Iranian, Nuristani, and Dardic.
..... Click the link for more information.
Indo-Aryan languages form a subgroup of the Indo-Iranian languages, which belong to the Indo-European family of languages. The term "Indic" refers to the same group without what some see as the negative connotations of "Aryan".
..... Click the link for more information.
Eastern Indo-Aryan languages include some 210 (SIL estimate) languages and dialects spoken by many people in Asia; this language group is a part of the Indo-Aryan language branch of the Indo-European language family.
..... Click the link for more information.
writing system is a type of symbolic system used to represent elements or statements expressible in language.

General properties

Writing systems are distinguished from other possible symbolic communication systems in that one must usually understand something of the
..... Click the link for more information.
Bengali abugida

ISO 15924 Beng

Note: This page may contain IPA phonetic symbols in Unicode.
The Bengali script (Bengali: বাংলা লিপি Bangla lipi
..... Click the link for more information.
This page is currently protected from editing until disputes have been resolved.
Protection is not an endorsement of the current [ version] ([ protection log]).
..... Click the link for more information.
Coordinates: West Bengal (Bengali: পশ্চিমবঙ্গ Poshchimbôŋgo) is a state in eastern India.
..... Click the link for more information.
Tripura   (Bengali script: ত্রিপুরা) is a state in North-East India.
..... Click the link for more information.
This is a list of bodies that regulate standard languages.

Afrikaans Die Taalkommissie, South Africa
Arabic Academy of the Arabic Language (مجمع اللغة العربية, Syria, Egypt, Jordan,
..... Click the link for more information.
Bangla Academy, established on 3 December 1955, is the national academy for promoting Bangla language in Bangladesh. The main office of the organisation is located at the Burdwan House on the campus of the University of Dhaka, beside Ramna Park.
..... Click the link for more information.
Paschimbanga Bangla Akademi (Bengali: পশ্চিমবঙ্গ বাংলা আকাদেমি), or West Bengal Bangla Academy, established on 20 May 1986, is the main academy for
..... Click the link for more information.
ISO 639-1 is the first part of the ISO 639 international-standard language-code family. It consists of 136 two-letter codes used to identify the world's major languages. These codes are a useful international shorthand for indicating languages.
..... Click the link for more information.
ISO 639-2 is the second part of the ISO 639 standard, which lists codes for the representation of the names of languages. The three-letter codes given for each language in this part of the standard are referred to as "Alpha-3" codes. There are 464 language codes in the list.
..... Click the link for more information.
ISO 639-3 is an international standard for language codes. It extends the ISO 639-2 alpha-3 codes with an aim to cover all known natural languages. The standard was published by ISO on 5 February 2007[1].
..... Click the link for more information.
International Phonetic Alphabet

Note: This page may contain IPA phonetic symbols in Unicode.

The International
Phonetic Alphabet
Nonstandard symbols
Extended IPA
Naming conventions
IPA for English The
..... Click the link for more information.
Indo-Aryan languages form a subgroup of the Indo-Iranian languages, which belong to the Indo-European family of languages. The term "Indic" refers to the same group without what some see as the negative connotations of "Aryan".
..... Click the link for more information.

A language is a system of symbols and the rules used to manipulate them. Language can also refer to the use of such systems as a general phenomenon.
..... Click the link for more information.
Indian subcontinent is a large section of the Asian continent consisting of countries lying substantially on the Indian tectonic plate. These include countries on the continental crust— India, Pakistan, Bangladesh and parts of Afghanistan, Nepal and Bhutan, island countries
..... Click the link for more information.
Magadhi Prakrit is of one of the three Dramatic Prakrits, the written languages of Ancient India after the decline of Sanskrit as an official language. Magadhi Prakrit was spoken in the eastern Indian Subcontinent, in a region spanning what is now eastern India, Bangladesh, and
..... Click the link for more information.
Sanskrit}}}  | style="padding-left: 0.5em;" | Writing system: | colspan="2" style="padding-left: 0.5em;" | Devanāgarī and several other Brāhmī-based scripts  ! colspan="3" style="text-align: center; color: black; background-color: lawngreen;"|Official
..... Click the link for more information.
South Asia, also known as Southern Asia, is a southern geopolitical region of the Asian continent comprising territories on and in proximity to the Indian subcontinent. It is surrounded by (from west to east) Western Asia, Central Asia, Eastern Asia, and Southeastern Asia.
..... Click the link for more information.
Bengal (Bengali: বঙ্গ Bôngo, বাংলা Bangla, বঙ্গদেশ Bôngodesh or বাংলাদেশ Bangladesh
..... Click the link for more information.
Amar Shonar Bangla
My Golden Bengal

(and largest city) Dhaka

..... Click the link for more information.
This page is currently protected from editing until disputes have been resolved.
Protection is not an endorsement of the current [ version] ([ protection log]).
..... Click the link for more information.

This article is copied from an article on - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.