Portuguese orthography

From MedBib.com - Medicine & Nature

The orthography of Portuguese is based on the Latin alphabet, and makes use of the acute accent, the circumflex accent, the grave accent, the tilde, and the cedilla, to denote stress, vowel height, nasalization, and other sound changes. Brazilian Portuguese also uses the diaeresis mark. Accented letters and digraphs are not counted as separate characters for collation purposes.

A full list of sounds, diphthongs, and their main spellings, is given at Portuguese phonology. For the main values of each letter and digraph, see Portuguese alphabet. This article addresses the less trivial details of the spelling of Portuguese, as well as other issues of orthography, such as accentuation.

This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters.

Contents

Overview

The spelling of Portuguese is largely phonemic, but some phonemes can be spelled in more than one way. In ambiguous cases, the correct spelling is determined through a combination of etymology with morphology and common tradition, so there is not a perfect one-to-one correspondence between sounds and letters or digraphs. Knowing the main inflectional paradigms of Portuguese, and being acquainted with the orthography of other languages of Western Europe, can be helpful in this regard.

In the remainder of this article, the phrase "at the end of a syllable" can be understood as "before a consonant, or at the end of a word".

Sounds with more than one spelling

Most consonants have the same values as in the International Phonetic Alphabet, except for the palatals /ʎ/ and /ɲ/, which are spelled lh and nh, respectively, and the following velars, rhotics, and sibilants:

Velar plosives

Phoneme Default Before e or i
Spelling Examples Spelling Examples
/k/ c, q1 casa, quatro qu quente, aqui
/g/ g gato, grão gu guerra, guitarra
1 in some latinisms

Rhotics

The alveolar tap /ɾ/ is always spelled as a single r. The other rhotic phoneme of Portuguese, which may be pronounced as a trill /r/ or as one of the fricatives /x/, /ʁ/, or /h/, according to the idiolect of the speaker, is either written rr or r, as described below. For this phoneme, the phrase "at the start of a syllable" can be understood as "at the beginning of words and after /l/, /n/, /z/, /ʒ/".

Phoneme Start of syllable1 Between vowels End of syllable2
Spelling Examples Spelling Examples Spelling Examples
/ʁ ~ x ~ h ~ r/ r rosa, tenro rr carro r sorte, mar
1 only when it is the first sound in the syllable (in which case it is always followed by a vowel). For instance, a word like prato is pronounced with a tap, /ɾ/
2 in some dialects; in the others, the r is usually a tap at the end of syllables

Sibilants

For the following phonemes, the phrase "at the start of a syllable" can be understood as "at the start of a word, or between a consonant and a vowel, in that order".

Phoneme Start of syllable1 Between vowels End of syllable
Spelling Examples Spelling Examples Spelling Examples
/s/ s, ç2, c3 sapo, psique,
maçã, cedo
ss, ç2, c3, x4 assado, passe,
açorda, alperce
próximo
s, x5, z6 isto,
externo,
paz
/z/ z zumbido,
zero
z, s, x7 prazo, azeite,
rosa, Brasil,
exemplo
s, x8, z8 turismo,
ex-mulher,
felizmente
/ʃ/ ch, x chuva, cherne,
xarope, xisto
ch, x fecho, duche,
caixa, mexilhão
s, x5, z6 isto,
externo,
paz
/ʒ/ j, g3 jogo, jipe,
gente
j, g3 ajuda, pajem,
agenda
s, x8, z8 turismo,
ex-mulher,
felizmente
1 including consonant clusters that belong to a single syllable, like psique
2 before a, o, u
3 before e, i
4 only in a very small number of words derived from Latin, such as trouxe and próximo
5 only in words derived from Latin or Greek, preceded by e and followed by one of the voiceless consonants c, p, s, t
6 only at the end of words and in rare compounds
7 only in a few words derived from Latin or Greek that begin with ex- or hex- followed by a vowel, and in compounds made from such words
8 only in a few compound words

Note that there are two main groups of accents in Portuguese, one in which the sibilants are alveolar at the end of syllables (/s/ or /z/), and another in which they are postalveolar (/ʃ/ or /ʒ/). In this position, the sibilants occur in complementary distribution, voiced before voiced consonants, and voiceless before voiceless consonants or at the end of utterances.

Nasal vowels and diphthongs

Nasal vowels are normally indicated with a tilde before other vowels. Before consonants, they are usually spelled by writing a silent m or n next to the vowel itself. At the end of words, most nasal vowels are indicated by a trailing silent m. The low central vowel /ɐ̃/, however, is spelled ã at the end of words, before word-final s, and in compounds.

Monophthongs Diphthongs
Sound Spelling Sound Spelling
Before a consonant Word final
/ɐ̃/ an, am, ân, âm -ã-1 /ɐ̃ĩ/ ãe, ãi2
/ẽ/ en, em, ên, êm - /ɐ̃ũ/ ão, -am
/ĩ/ in, im, ín, ím -im /ɐ̃ĩ ~ ẽĩ/ -em, -ém, -en-, -én-
/õ/ on, om, ôn, ôm -om /ɐ̃ĩ ~ ẽĩ/ -êm
/ũ/ un, um, ún, úm -um /õĩ/ õe
1 also before word-final s
2 the spelling ãi appears in non-final syllables, in a small number of words such as cãibra

The letter m is conventionally written before b or p or at the end of words (also in a few compound words such as comummente), and n is written before other consonants. In the plural, the ending -m changes into -ns; for example bem, rim, bom, umbens, rins, bons, uns. Some learned words end with -n (which is usually pronounced in European Portuguese).

Nasalization of u is left unmarked in the six words muito, muita, muitos, muitas, mui, ruim (the latter one only in Brazilian Portuguese).

The word endings -am, -em, -en(+s), with or without an accent mark on the vowel, represent nasal diphthongs derived from various Latin endings, often -ant, -unt or -en(t)-. Final -am, which appears in polysyllabic verbs, is always unstressed. The grapheme -en- is also pronounced as a nasal diphthong in a few compound words, such as bendito (bem + dito), homenzinho (homem + zinho), and Benfica.

Semivowels

The semivowel in an oral (falling) diphthong is spelled with i or u (ai, ei, oi, ui; au, eu, iu, ou). Nasal diphthongs are spelled with e or o at the end of words (-ãe, -ão, -õe; -ães, -ãos, -ões).

Morphological considerations

Verbs whose infinitive ends in -jar have j in the whole conjugation. Thus, viagem "voyage" (noun), but viajem (3rd. pers. plur. of the present subjunctive of the verb viajar "to travel").

Verbs whose thematic vowel becomes a stressed i in one of their inflections are spelled with an i in the whole conjugation, as are other words of the same family. Thus, crio (I create) implies criar (to create) and criatura (creature).

Verbs whose thematic vowel becomes a stressed ei in one of their inflections are spelled with an e in the whole conjugation, as are other words of the same family. Thus, nomeio (I nominate) implies nomear (to nominate) and nomeação (nomination).

Etymological considerations

The majority of the Portuguese lexicon is derived from Latin, Greek, or Arabic, so in principle this would require some knowledge of those languages. However, Greek words are Latinized before being incorporated into the language, and many words of Latin or Greek origin have easily recognizable cognates in English and other western European languages which are spelled according to similar principles. For instance, glória, "glory", glorioso, "glorious", herança "inheritance", real "real/royal". Some general guidelines for spelling are given below:

Diacritics

Portuguese makes use of six diacritics.

The cedilha indicates that ç is pronounced /s/ (due to a historical palatalization). By convention, s is written instead of etymological ç at the beginning of words.

The acute accent and the circumflex accent indicate that a vowel is stressed, and also the quality of the accented vowel, more precisely its height: á, é, and ó are low vowels (except in the word endings -ém, -éns), while â, ê, and ô are high vowels. They also distinguish a few homographs: cf. para "for" with pára "it stops", por "by" with pôr "to put".

The tilde marks nasal vowels before other vowels, at the end of words, before final -s, and in some compounds, like romãzeira "pomegranate tree", from romã "pomegranate", and vãmente "vainly", from "vain". It usually coincides with the stressed vowel, unless there is an acute or circumflex accent elsewhere in the word, or if the word is compound, e.g. órgão "organ", irmã + zinha ("sister" + diminutive suffix) = irmãzinha "little sister".

The grave accent marks the contraction of two consecutive vowels in adjacent words (crasis), normally the preposition a and an article or a demonstrative pronoun: a + aquela = àquela "at that", a + a = à "at the", and so on. It does not indicate stress.

The trema indicates that the letter u is pronounced, exceptionally, in the graphemes gue, gui, que, qui; e.g. agüentar "to bear", freqüência "frequency", argüição "questioning", qüinqüelíngüe "in five languages" (conjectured to be the Portuguese word with most diacritics). It is only used in Brazil, and increasingly omitted (some newspapers and many people don't use it).

The graphemes â, ê and ô typically represent oral vowels, but before m or n followed by another consonant they are nasalized. Elsewhere, nasal vowels are indicated with a tilde (ã, õ).

Stress

Below are the general rules for the use of the acute accent and the circumflex in Portuguese. Primary stress may fall on any of the three final syllables of a word, but occurs mainly on the last two. A word is called oxytone if it is stressed on its last syllable, paroxytone if stress falls on the syllable before the last (the penult), and proparoxytone if stress falls on the third syllable from the end (the antepenult). Most words are stressed on the penult.

All words stressed on the antepenult take an accent mark. Words with two or more syllables, stressed on their last or their next-to-last syllable, may or may not be accented, depending on their ending, on whether they are composed of one syllable or more, and on whether their stressed vowel belongs to a rising diphthong or not.

Monosyllables

Monosyllables are typically not accented, but those whose last vowel is a, e, or o, possibly followed by final -s or final -m, may require an accent mark.

Polysyllables

Aside from these cases, there are a few more words that take an accent, usually to disambiguate frequent homographs such as pode (present tense of the verb poder) and pôde (past tense of the same verb), or para (preposition) and pára (verb). Note also that the accentuation rules of Portuguese are somewhat different from those of Spanish.

Syllabification and collation

Portuguese syllabification rules require a syllable break between double letters cc, , mm, nn, rr, ss, or other combinations of letters which may be pronounced as a single sound, e.g. fric-ci-o-nar, pro-ces-so, car-ro, ex-ce(p)-to, ex-su-dar. Only the digraphs ch, lh, nh, gu, qu, and ou are indivisible. All digraphs are however broken down into their constituent letters for the purposes of collation, aloud spelling, and in crossword puzzles.

Other symbols

Apostrophe

The apostrophe (') appears as part of certain phrases, usually to indicate the elision of a vowel in the contraction of a preposition with the word that follows it: de + água = d'água. It is used almost exclusively in poetry.

Hyphen

The hyphen (-) is used to make compound words, especially animal names like papagaio-de-rabo-vermelho "red-tailed parrot". It is also extensively used to append clitic pronouns to the verb, as in quero-o "I want it" (enclisis), or even to embed them within the verb, as in levaria + te + os = levar-tos-ia "I would take them to you" (mesoclisis). Proclitic pronouns are not connected graphically to the verb: não o quero "I do not want it". Each element in such compounds is treated as an individual word for accentuation purposes.

Quotation marks

In European Portuguese, as in many other European languages, angular quotation marks are used for general quotations in literature:

«Isto é um exemplo de como fazer uma citação em português europeu.»
"This is an example of how to make a quotation in European Portuguese."

Although American-style (“…”) or British-style ('...') quotation marks are sometimes used as well, especially in less formal types of writing (due to being more easily produced in keyboards) or inside nested quotations, they are less common in careful writing. In Brazilian Portuguese, only American and British-style quote marks are used.

"Isto é um exemplo de como fazer uma citação em português brasileiro."
"This is an example of how to make a quotation in Brazilian Portuguese."

In both varieties of the language, dashes are normally used for direct speech, rather than quotation marks:

― Aborreço-me tanto ― disse ela.
― Não tenho culpa disso ― retorquiu ele.
“I’m so bored,” she said.
“That’s not my fault,” he shot back.

Brazilian vs. European spelling

Written varieties
Portugal/Africa/Asia Brazil translation
Different pronunciation
anónimo anônimo anonymous
Vénus Vênus Venus
facto fato fact
ideia idéia idea
Silent consonants
acção ação action
direcção direção direction
eléctrico elétrico electric
óptimo ótimo very good
Diacritics
frequente freqüente frequent
voo vôo flight
Further information: Spelling reforms of Portuguese

As of 2005, Portuguese has two orthographic standards:

In East Timor, both orthographies are currently being taught in schools.

The table to the right illustrates typical differences between the two orthographies. Some are due to different pronunciations, but others are merely graphic. The main ones are:

See also

References

External links