Ġ

Ġ (minuscule: ġ) is a letter of the Latin script, formed from G with the addition of a dot above the letter.

Ġ ġ
Doulos SIL glyphs for majuscule and minuscule ġ.

Usage

Arabic

Ġ is used in some Arabic transliteration schemes, such as DIN 31635 and ISO 233, to represent the letter غ (ġain).

Armenian

Ġ is used in the romanization of Classical or Eastern Armenian to represent the letter Ղ/ղ (ġat]]).

Chechen

Ġ is present in the Chechen Latin alphabet, created in the 1990s. The Cyrillic equivalent is гI, which represents the sound /ɣ/.[1]

Inupiat

Ġ is used in some dialects of Inupiat to represent the voiced uvular fricative /ʁ/.

Irish

Ġ was formerly used in Irish to represent the lenited form of G. The digraph gh is now used.[2]

Maltese

Ġ is the 7th letter of the Maltese alphabet, preceded by F and followed by G. It represents the voiced postalveolar affricate [dʒ].[3]

Old Czech

ġ is sometimes (about 16th century) used to represent real [g], to distinguish it from the letter g which represented the consonant [j].

Old English

Ġ is sometimes used in modern scholarly transcripts of Old English to represent [j] or [dʒ] (after n), to distinguish it from g pronounced as /ɣ/, which is otherwise spelled identically. The digraph cg was also used to represent [dʒ].[4]

Ukrainian

Ġ is used in some Ukrainian transliteration schemes, mainly ISO 9:1995, as the letter Ґ.

Phonetic transcription

ġ is sometimes used as a phonetic symbol transcribing [ɣ] or [ŋ].

Georgian

Ġ is used in the transliteration of Georgian to represent the letter .

Computer encoding

ISO 8859-3 (Latin-3) includes Ġ at D5 and ġ at F5 for use in Maltese, and ISO 8859-14 (Latin-8) includes Ġ at B2 and ġ at B3 for use in Irish.

Precomposed characters for Ġ and ġ have been present in Unicode since version 1.0. As part of WGL4, it can be expected to display correctly on most computer systems.

Appearance Code points Name
Ġ U+0120
U+0047, U+0307
LATIN CAPITAL LETTER G WITH DOT ABOVE
LATIN CAPITAL LETTER G + COMBINING DOT ABOVE
ġ U+0121
U+0067, U+0307
LATIN SMALL LETTER G WITH DOT ABOVE
LATIN SMALL LETTER G + COMBINING DOT ABOVE

OpenAI's GPT-2 uses 0xC4 0xA0 (Ġ) as the start of a word in its tokens.[5]

References

  1. Koryakov, Yuri B. (2002). Atlas of Caucasian Languages (PDF). Moscow: Institute of Linguistics, Russian Academy of Sciences. pp. 6–7.
  2. "Symbol Codes | Irish, Old Irish and Manx". Pennsylvania State University. 21 April 2017. Retrieved 10 January 2023.
  3. Robert D. Hoberman (2007). Kaye, Alan S. (ed.). "Chapter 13. Maltese Morphology" (PDF). Morphologies of Asia and Africa. Winona Lake, Ind.: Eisenbrauns: 258. ISBN 978-1-57506-109-2. Retrieved 10 January 2023.
  4. Daniel Paul O'Donnell. "The Pronunciation of Old English". University of Lethbridge Personal Web Sites. Retrieved 26 October 2022.
  5. "Why \u0120 (Ġ) is in so many pairs? · Issue #80 · openai/GPT-2". GitHub.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.