International Computer Archive of Modern and Medieval English

The International Computer Archive of Modern and Medieval English (ICAME) is an international group of linguists and data scientists working in corpus linguistics to digitise English texts.[1] The organisation was founded in Oslo, Norway in 1977 as the International Computer Archive of Modern English, before being renamed to its current title.[2]

The portal to their materials is hosted at the University of Bergen, where they have set out the aim of the organization to "collect and distribute information on English language material available for computer processing and on linguistic research to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions."[3] Creating computer corpora, i.e. collections of texts in machine-readable form, is the most accessible way to study both transcribed spoken language and various genres of written texts for modern scholars, including both "descriptive and more theoretically-minded linguists".[4]

The ICAME group hosts academic conferences that focus on corpus linguistic studies of historical changes and contemporary grammatical descriptions of English, and makes corpora of different varieties of English available to scholars, starting with editions of the 1960s Brown Corpus. Their first academic conference was held in Bergen, Norway in 1979, and scholars who were interested in corpus linguistics continued to meet each spring in different European and English-speaking countries. At these meetings, the compilation and distribution of corpora they enabled played a key role in the creation of the field of corpus linguistics in the 20th century, a precursor to current big data analytics. In summarizing the field, Kennedy's Introduction to Corpus Linguistics notes that "for corpus linguists with an interest in the description of English, the International Computer Archive of Modern and Medieval English has been the major resource".[5] The influence of ICAME on the field has also be laid out in Facchinetti's history, Corpus Linguistics Twenty-five Years On.[6]

One influential resource that ICAME made available was a CD of 20 different corpora, including those covering different regional Englishes (such as the Australian Corpus of English, the Wellington Corpus of Spoken New Zealand English, the Kolhapur Corpus of Indian English, the Bergen Corpus of London Teenage Language (COLT), the Helsinki Corpus of Older Scots, and the International Corpus of English—East-African component), as well as versions of the Brown Corpus and the Lancaster-Bergen-Oslo (LOB) corpus tagged for part of speech.[7]

ICAME also published an annual journal, the ICAME Journal, formerly ICAME News,[8] that contains articles, conference reports, reviews and notices related to corpus linguistics.[9] The current editors of the ICAME Journal are Merja Kytö and Anna-Brita Stenström.[10]

References

  1. Corpus Linguistics and Beyond: Proceedings of the Seventh International Conference on English Language Research on Computerized Corpora. Vol. 59. Rodopi. 1987. p. vi. ISBN 978-9-062-03569-4.
  2. Kennedy, Graeme (19 September 2014). An Introduction to Corpus Linguistics. Routledge. p. 85. ISBN 978-1-317-89258-8.
  3. "ICAME". Retrieved March 28, 2015.
  4. Johansson, Stig (1994). "ICAME-Quo Vadis? Reflections on the use of computer corpora in linguistics". Computers and the Humanities. 28 (4–5): 243–252. doi:10.1007/BF01830271. S2CID 20568137.
  5. Kennedy, Graeme (2014). Introduction to Corpus Linguistics. Routledge. pp. ch. 2.
  6. Facchinetti, Roberta (2007). Corpus Linguistics Twenty-Five Years On. Brill / Rodopi.
  7. Hofland, K.; et al. (1999). ICAME collection of English language corpora [CD].
  8. "degruyter ICAME supplement" (PDF). Retrieved March 28, 2015.
  9. "The LinguistList--ICAME Journal". Retrieved March 28, 2015.
  10. "ICAME Journal". Retrieved March 28, 2015.

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.