C



C is the third letter in the English alphabet and a letter of the alphabets of many other writing systems which inherited it from the Latin alphabet. It is also the third letter of the ISO basic Latin alphabet. It is named cee (pronounced ) in English.

History
"C" comes from the same letter as "G". The Semites named it gimel. The sign is possibly adapted from an Egyptian hieroglyph for a staff sling, which may have been the meaning of the name gimel. Another possibility is that it depicted a camel, the Semitic name for which was gamal. Barry B. Powell, a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)".

In the Etruscan language, plosive consonants had no contrastive voicing, so the Greek 'Γ' (Gamma) was adopted into the Etruscan alphabet to represent. Already in the Western Greek alphabet, Gamma first took a  form in Early Etruscan, then  in Classical Etruscan. In Latin it eventually took the '' form in Classical Latin. In the earliest Latin inscriptions, the letters '' were used to represent the sounds and  (which were not differentiated in writing). Of these,  was used to represent or  before a rounded vowel,  before , and  elsewhere. During the 3rd century BC, a modified character was introduced for, and '' itself was retained for. The use of  (and its variant ) replaced most usages of  and . Hence, in the classical period and after,  was treated as the equivalent of Greek gamma, and  as the equivalent of kappa; this shows in the romanization of Greek words, as in 'ΚΑΔΜΟΣ', 'ΚΥΡΟΣ', and 'ΦΩΚΙΣ' came into Latin as ,  and '', respectively.

Other alphabets have letters homoglyphic to 'c' but not analogous in use and derivation, like the Cyrillic letter Es (С, с) which derives from the lunate sigma, named due to its resemblance to the crescent moon.

Later use
When the Roman alphabet was introduced into Britain, $⟨c⟩$ represented only, and this value of the letter has been retained in loanwords to all the insular Celtic languages: in Welsh, Irish, Gaelic, $⟨c⟩$ represents only. The Old English Latin-based writing system was learned from the Celts, apparently of Ireland; hence $⟨c⟩$ in Old English also originally represented ; the Modern English words kin, break, broken, thick, and seek, all come from Old English words written with $⟨c⟩$: cyn, brecan, brocen, þicc, and séoc. But during the course of the Old English period, before front vowels ( and ) were palatalized, having changed by the tenth century to, though $⟨c⟩$ was still used, as in cir(i)ce, wrecc(e)a. On the continent, meanwhile, a similar phonetic change had also been going on (for example, in Italian).

In Vulgar Latin, became palatalized to  in Italy and Dalmatia; in France and the Iberian peninsula, it became. Yet for these new sounds $⟨⟩$ was still used before the letters $⟨e⟩$ and $⟨i⟩$. The letter thus represented two distinct values. Subsequently, the Latin phoneme (spelled $⟨⟩$) de-labialized to  meaning that the various Romance languages had  before front vowels. In addition, Norman used the letter $⟨k⟩$ so that the sound could be represented by either $⟨k⟩$ or $⟨c⟩$, the latter of which could represent either  or  depending on whether it preceded a front vowel letter or not. The convention of using both $⟨c⟩$ and $⟨k⟩$ was applied to the writing of English after the Norman Conquest, causing a considerable re-spelling of the Old English words. Thus while Old English candel, clif, corn, crop, cú, remained unchanged, Cent, cæ´ᵹ (cé´ᵹ), cyng, brece, séoce, were now (without any change of sound) spelled 'Kent', 'keȝ', 'kyng', 'breke', and 'seoke'; even cniht ('knight') was subsequently changed to 'kniht' and þic ('thick') changed to 'thik' or 'thikk'. The Old English 'cw' was also at length displaced by the French 'qu' so that the Old English cwén ('queen') and cwic ('quick') became Middle English 'quen' 'quik', respectively. The sound, to which Old English palatalized had advanced, also occurred in French, chiefly from Latin  before 'a'. In French it was represented by the digraph $⟨ch⟩$, as in champ (from Latin camp-um) and this spelling was introduced into English: the Hatton Gospels, written about 1160, have in Matt. i-iii, child, chyld, riche, mychel, for the cild, rice, mycel, of the Old English version whence they were copied. In these cases, the Old English $⟨c⟩$ gave place to $⟨k qu ch⟩$ but, on the other hand, $⟨c⟩$ in its new value of came in largely in French words like processiun, emperice, grace, and was also substituted for 'ts' in a few Old English words, as miltse, bletsien, in early Middle English milce, blecien. By the end of the thirteenth century both in France and England, this sound de-affricated to ; and from that time $⟨c⟩$ has represented  before front vowels either for etymological reasons, as in lance, cent, or to avoid the ambiguity due to the "etymological" use of $⟨s⟩$ for, as in ace, mice, once, pence, defence.

Thus, to show etymology, English spelling has advise, devise (instead of advize, devize), while advice, device, dice, ice, mice, twice, etc., do not reflect etymology; example has extended this to hence, pence, defence, etc., where there is no etymological reason for using $⟨c⟩$. Former generations also wrote sence for sense. Hence, today the Romance languages and English have a common feature inherited from Vulgar Latin spelling conventions where $⟨c⟩$ takes on either a "hard" or "soft" value depending on the following letter.

English
In English orthography, $⟨c⟩$ generally represents the "soft" value of before the letters $⟨e⟩$ (including the Latin-derived digraphs $⟨ae⟩$ and $⟨oe⟩$, or the corresponding ligatures $⟨æ⟩$ and $⟨œ⟩$), $⟨i⟩$, and $⟨y⟩$, and a "hard" value of  before any other letters or at the end of a word. However, there are a number of exceptions in English: "soccer" and "Celt" are words that have where  would be expected.

The "soft" $⟨c⟩$ may represent the sound in the digraph $⟨ci⟩$ when this precedes a vowel, as in the words 'delicious' and 'appreciate'.

The digraph $⟨ch⟩$ most commonly represents, but can also represent (mainly in words of Greek origin) or  (mainly in words of French origin). For some dialects of English, it may also represent in words like loch, while other speakers pronounce the final sound as. The trigraph $⟨tch⟩$ always represents.

The digraph $⟨ck⟩$ is often used to represent the sound  after short vowels.

Other languages
In the Romance languages French, Spanish, Italian, Romanian and Portuguese, $⟨c⟩$ generally has a "hard" value of and a "soft" value whose pronunciation varies by language. In French, Portuguese, Catalan and Spanish from Latin America and southern Spain, the soft $⟨c⟩$ value is as it is in English. In the Spanish spoken in northern and central Spain, the soft $⟨c⟩$ is a voiceless dental fricative. In Italian and Romanian, the soft $⟨c⟩$ is.

All Balto-Slavic languages that use the Latin alphabet, as well as Albanian, Hungarian, Pashto, several Sami languages, Esperanto, Ido, Interlingua, and Americanist phonetic notation (and those aboriginal languages of North America whose practical orthography derives from it) use $⟨c⟩$ to represent, the voiceless alveolar or voiceless dental sibilant affricate. In romanized Mandarin Chinese, the letter represents an aspirated version of this sound,.

Among non-European languages that have adopted the Latin alphabet, $⟨c⟩$ represents a variety of sounds. Yup'ik, Indonesian, Malay, and a number of African languages such as Hausa, Fula, and Manding share the soft Italian value of. In Azeri, Crimean Tatar, Kurmanji Kurdish, and Turkish $⟨c⟩$ stands for the voiced counterpart of this sound, the voiced postalveolar affricate. In Yabem and similar languages, such as Bukawa, $⟨c⟩$ stands for a glottal stop. Xhosa and Zulu use this letter to represent the click. In some other African languages, such as Beninese Yoruba, $⟨c⟩$ is used for. In Fijian, $⟨c⟩$ stands for a voiced dental fricative, while in Somali it has the value of.

The letter $⟨c⟩$ is also used as a transliteration of Cyrillic $⟨ц⟩$ in the Latin forms of Serbian, Macedonian, and sometimes Ukrainian, along with the digraph $⟨ts⟩$.

Other systems
As a phonetic symbol, lowercase $⟨⟩$ is the International Phonetic Alphabet (IPA) and X-SAMPA symbol for the voiceless palatal plosive, and capital $⟨C⟩$ is the X-SAMPA symbol for the voiceless palatal fricative.

Digraphs
There are several common digraphs with $⟨c⟩$, the most common being $⟨ch⟩$, which in some languages (such as German) is far more common than $⟨c⟩$ alone. $⟨ch⟩$ takes various values in other languages.

As in English, $⟨ck⟩$, with the value, is often used after short vowels in other Germanic languages such as German and Swedish (but some other Germanic languages use $⟨kk⟩$ instead, such as Dutch and Norwegian). The digraph $⟨cz⟩$ is found in Polish and $⟨cs⟩$ in Hungarian, both representing. The digraph $⟨sc⟩$ represents in Old English, Italian, and a few languages related to Italian (where this only happens before front vowels, while otherwise it represents ). The trigraph $⟨sch⟩$ represents in German.

Ancestors, descendants and siblings

 * 𐤂 : Semitic letter Gimel, from which the following symbols originally derive
 * Γ γ : Greek letter Gamma, from which C derives
 * G g : Latin letter G, which is derived from Latin C
 * Phonetic alphabet symbols related to C:
 * : Small c with curl
 * ʗ : ʗ|stretched C
 * ᶜ : Modifier letter small c
 * ᶝ : Modifier letter small c with curl
 * ᴄ : Small capital c is used in the Uralic Phonetic Alphabet.
 * C with diacritics: Ć|Ć ć Ĉ|Ĉ ĉ Č|Č č Ċ|Ċ ċ Ḉ|Ḉ ḉ Ƈ|Ƈ ƈ C̈ c̈ Ȼ|Ȼ ȼ Ç|Ç ç ꞔ Ꞓ|Ꞓ ꞓ
 * Ↄ ↄ : Claudian letters

Derived ligatures, abbreviations, signs and symbols

 * © : ©|copyright symbol
 * ℃ : ℃|degree Celsius
 * ¢ : ¢|cent
 * ₡ : ₡|colón (currency)
 * ₢ : ₢|Brazilian cruzeiro (currency)
 * ₵ : ₵|Ghana cedi (currency)
 * ₠ : European Currency Unit CE
 * ℂ : ℂ|double struck C
 * ℭ : ℭ|blackletter C
 * Ꜿ ꜿ : Medieval abbreviation for Latin syllables con- and com-, Portuguese -us and -os

Computing codes

 * 1 Also for encodings based on ASCII, including the DOS, Windows, ISO-8859 and Macintosh families of encodings.