Diacritic

From MedBib.com - Medicine & Nature

Example of the letter a with a diacritic
Diacritical marks

accent

acute accent ( ´ )
double acute accent ( ˝ )
grave accent ( ` )
double grave accent (  ̏ )

breve ( ˘ )
caron / háček ( ˇ )
cedilla ( ¸ )
circumflex ( ^ )
diaeresis / umlaut ( ¨ )
dot ( · )

anunaasika ( ˙ )
anusvara (  ̣ )
chandrabindu (   ँ   ঁ   ઁ   ଁ ఁ )

hook / dấu hỏi (  ̉ )
horn / dấu móc (  ̛ )
macron ( ¯ )
ogonek ( ˛ )
ring / kroužek ( ˚, ˳ )
rough breathing / spiritus asper (    )
smooth breathing / spiritus lenis (  ᾿  )

Marks sometimes used as diacritics

apostrophe ( )
bar ( | )
colon ( : )
comma ( , )
hyphen ( ˗ )
tilde ( ~ )
titlo (  ҃ )

A diacritic (/daɪəˈkrɪtɪk/), also called a diacritical mark, point, or sign, is a small sign added to a letter to alter pronunciation or to distinguish between similar words. The term derives from the Greek διακριτικός (diakritikós, "distinguishing"). "Diacritic" is both adjective and noun, whereas "diacritical" is only an adjective. Some diacritical marks are often called accents; that applies to the grave and acute accents, but not the cedilla.

A diacritical mark can appear above or below a letter or in some other position. Its main usage is to change the phonetic value of the letter to which it is added; but it may also be used to modify the pronunciation of a whole word or syllable, like the tone marks of tonal languages, to distinguish between homographs, to make abbreviations, such as the titlo in old Slavic texts, or to change the meaning of a letter, such as denoting numerals in numeral systems like early Greek numerals.

A letter which has been modified by a diacritic may be treated either as a new, distinct letter or as a letter-diacritic combination in orthography and collation. This varies from language to language and, in some cases, from symbol to symbol within a single language.

Contents

Types of diacritic

The types of diacritic include:

Some of these marks are sometimes diacritics, but also have other uses: that applies to the tilde, dot, comma, titlo, apostrophe, bar, and colon.

The dot on the letter i of the Latin alphabet originated as a diacritic to clearly distinguish i from the vertical strokes of adjacent letters. It first appeared in the sequence ii (as in ingeníí) in Latin manuscripts of the 11th century, then spread to i adjacent to m, n, u before being used for all lower-case i. The j, which separated from the i later, inherited the "dot". The shape of the diacritic developed from initially resembling today's acute accent to a long flourish by the 15th century. With the advent of Roman type it was reduced to the round dot we have today.[1]

The ISO basic Latin alphabet
Aa Bb Cc Dd Ee Ff Gg Hh Ii Jj Kk Ll Mm Nn Oo Pp Qq Rr Ss Tt Uu Vv Ww Xx Yy Zz

history palaeography derivations diacritics punctuation numerals Unicode list of letters

Diacritics specific to non-Latin alphabets

Arabic

Further information: Arabic alphabet

Greek

Further information: Greek diacritics

Various combinations of accents with other diacritics and breathings.

Hebrew

Further information: Hebrew alphabet

Non-alphabetic scripts

Some non-alphabetic scripts also employ symbols that function essentially as diacritics.

Alphabetization or collation

Main article: Collation

Different languages use different rules to put diacritic characters in alphabetical order. French treats letters with diacritical marks the same as the underlying letter for purposes of ordering and dictionaries.

The Scandinavian languages, by contrast, treat the characters with diacritics ä, ö and å as new and separate letters of the alphabet, and sort them after z. Usually ä is sorted as equal to æ (ash) and ö is sorted as equal to ø (o-slash). Also, aa, when used as an alternative spelling to å, is sorted as such. Other letters modified by diacritics are treated as variants of the underlying letter, with the exception that ü is frequently sorted as y.

Languages that treat accented letters as variants of the underlying letter usually alphabetize words with such symbols immediately after similar unmarked words. For instance, in German where two words differ only by an umlaut, the word without it is sorted first in German dictionaries (e.g. schon and then schön, or fallen and then fällen). However, when names are concerned (e.g. in phone books or in author catalogues in libraries), umlauts are often treated as combinations of the vowel with a suffixed e; Austrian phone books now treat characters with umlauts as separate letters (immediately following the underlying vowel).

In Spanish, the grapheme ñ is considered a new letter different from n and collated between n and o, as it denotes a different sound from that of a plain n. But the accented vowels á, é, í, ó, ú are not separated from the unaccented vowels a, e, i, o, u as the acute accent in Spanish only modifies stress within the word, not the sound of a letter.

For a comprehensive list of the collating orders in various languages, see Collating sequence.

Generation with computers

Modern computer technology was developed mostly in the English-speaking countries, so data formats, keyboard layouts, etc., were developed with a bias favoring English, a language with a "simple" alphabet, one without diacritical marks. This has led to fears internationally that the marks and accents may be made obsolete to facilitate the worldwide exchange of data.[citation needed] Efforts have been made to create internationalized domain names that further extend the English alphabet (e.g., "pokémon.com").

Depending on the keyboard layout, which differs amongst countries, it is more or less easy to enter letters with diacritics on computers and typewriters. Some have their own keys; some are created by first pressing the key with the diacritic mark followed by the letter to place it on. Such a key is sometimes referred to as a dead key, as it produces no output of its own, but modifies the output of the key pressed after it.

In modern Microsoft Windows operating systems, the keyboard layout US International allows one to type almost all diacritics directly: "+e gives ë, ~+o gives õ, etc. On Apple Macintosh computers, there are keyboard shortcuts for the most common diacritics; Option-e followed by a vowel places an acute accent, Option-u followed by a vowel gives an umlaut, option-c gives a cedilla, etc. Diacritics can be composed in most X Window System keyboard layouts.

On computers, the availability of code pages determines whether one can use certain diacritics. Unicode solves this problem by assigning every known character its own code; if this code is known, most modern computer systems provide a method to input it. With Unicode, it is also possible to combine diacritical marks with most characters.

Languages with letters containing diacritics

The following languages have letters which contain diacritics.

Germanic and Celtic
Romance
Slavic
Baltic
Finno-Ugric
Turkic
Other
Cyrillic alphabets

Diacritics that do not produce new letters

Blackboard used in class at Harvard shows students' efforts at placing the ü and acute accent diacritic used in Spanish orthography.

English is one of the few European languages that does not regularly use diacritical marks. Exceptions are unassimilated foreign loanwords, including borrowings from French and increasingly Spanish; however, the diacritic is also often omitted from such words. Loanwords that frequently appear with the diacritic in English include café, résumé (a usage that helps distinguish it from the verb resume, though the former is often miswritten resumé [sic]), and naïveté (see List of English words with diacritics). In older practice (and even among some orthographically conservative modern writers) one may see examples such as élite and rôle.

English once used the diaeresis more often than not in words such as coöperation (from Fr. coopération) and zoölogy (from Lat. zoologia), but this practice has become far less common (The New Yorker's house style is one of the few major publications to retain this feature, and various individual writers still use it). English has also added a diacritic to a common spelling of maté (from Sp. and Port. mate).

A few English words can only be distinguished from others by a diacritic or modified letter, including animé, exposé, lamé, maté, öre, øre, pâté, piqué, rosé. The same is true of résumé, alternately resumé, but nevertheless it is often spelled resume in the US, and saké, which is more commonly spelled sake.

The acute and grave accents are occasionally used in poetry and lyrics: the acute to indicate stress overtly where it might be ambiguous (rébel vs. rebél) or nonstandard for metrical reasons (caléndar), the grave to indicate that an ordinarily silent or elided syllable is pronounced (warnèd, parlìament). In certain personal names such as Renée and Zoë, the diacritical marks are included more often than omitted.

Other languages

The following languages have letter-diacritic combinations that are not considered independent letters.

Transliteration

Several languages which are not written with the Roman alphabet are transliterated, or romanized, using diacritics. Examples:

See also

External links