Index Thomisticus
The Index Thomisticus was a digital humanities project begun in the 1940s that created a concordance to 179 texts centering around Thomas Aquinas. Led by Roberto Busa, the project indexed 10,631,980 words over the course of 34 years, initially onto punched cards. It is considered a pioneering project in the field of digital humanities.
Project
Busa began the project in 1946.[1] IBM agreed in 1949 to sponsor the project until its completion.[2] They assigned Paul Tasman, an executive at the company, to work with Busa.[3] Busa selected 179 texts centering around Thomas Aquinas that would be put into a form that was machine-readable. 118 of the works were written by Aquinas, and the remaining 61 items were either at one point mis-attributed to him or an attempt to complete an unfinished work begun by Aquinas.[2] Between 1950 and 1966 the project punched the texts. They worked in Gallarate, Italy,[4][5] and the project peaked in size in 1962 with 70 workers.[6] After the punching was complete, the data was lemmatised in a semi-automatic process.[4]
The completed project indexed a total of 10,631,980 words in fifty-six volumes over 70,000 pages—divided into ten volumes of indexes, followed by thirty-one volumes of concordances of Aquinas's works, eight volumes of concordances of related authors, and seven volumes that reprinted the source texts.[2][7] The seven completely reprinting the source texts were sold separately.[2] The first volume was published in 1974,[8] and publication was completed in 1980. The project used a total of 1,500 kilometres (930 mi) of tape [9] and it took an estimated 10,000 hours of computer work and 1 million hours of human work to complete.[3] The Index was released on CD-ROM in 1992 and a website was launched in 2005.[9]
Reception, impact, and legacy
A review published of the project in Computers and the Humanities described it as "as innovative and fascinating a reference work as the technology that made it possible."[10] In 1993, the project was described as the "second largest printed work of this century". The same review called it "excessive" and asked what its purpose was, going on to describe it as "the most pedantic work ever written".[7] In 2020, The Economist described it as "the creation story of the digital humanities."[9] An article in Umanistica Digitale wrote that "the project developed for the first time, methods for dealing with unstructured language".[11] It influenced projects such as Key Word in Context.[11] The project is also sometimes listed as one of the earliest instances of an e-book.[12]
References
- Busa, R. (1980). "The Annals of Humanities Computing: The Index Thomisticus". Computers and the Humanities. 14 (2): 83–90. doi:10.1007/BF02403798. ISSN 0010-4817. JSTOR 30207304. S2CID 38602853.
- Burton 1984, pp. 109–110.
- "Paul Tasman, Executive, 74". The New York Times. 1988-03-07. ISSN 0362-4331. Retrieved 2020-12-27.
- Gouws, Rufus; Heid, Ulrich; Schweickard, Wolfgang; Wiegand, Herbert Ernst (2013-12-18). Dictionaries. An International Encyclopedia of Lexicography: Supplementary Volume: Recent Developments with Focus on Electronic and Computational Lexicography. Walter de Gruyter. p. 972. ISBN 978-3-11-023813-6.
- Sprokel, Nico (1978). "The "Index Thomisticus"". Gregorianum. 59 (4): 739–750. ISSN 0017-4114. JSTOR 23576117.
- Rockwell & Passarotti 2019, p. 13.
- Guietti, Paolo (1993). "Hermeneutic of Aquinas's Texts: Notes on the Index Thomisticus". The Thomist: A Speculative Quarterly Review. 57 (4): 667–686. doi:10.1353/tho.1993.0006. ISSN 2473-3725. S2CID 171327330.
- Hockey, Susan (2006-01-01). Dawson, Andy; Brown, David (eds.). "The rendering of humanities information in a digital context: Current trends and future developments". ASLIB Proceedings. 58 (1/2): 89–101. doi:10.1108/00012530610648699. ISSN 0001-253X.
- "How data analysis can enrich the liberal arts". The Economist. 2020-12-19. ISSN 0013-0613. Retrieved 2020-12-27.
- Burton 1984, p. 109.
- Rockwell & Passarotti 2019, p. 15.
- Anderson, Craig; Pham, Jeanie (March 2013). "Practical overlap: The possibility of replacing print books with e-books". Australian Academic & Research Libraries. 44 (1): 40–49. doi:10.1080/00048623.2013.773866. ISSN 0004-8623.
Bibliography
- Burton, Dolores (1984). "Review of Index Thomisticus: Sancti Thomae Aquinatis operum omnium indices et concordantiae, ; Sancti Thomae Aquinatis opera omnia". Computers and the Humanities. 18 (2): 109–120. doi:10.1007/BF02274166. ISSN 0010-4817. JSTOR 30200002. S2CID 29640298.
- IBM Data Processing Division (1973). "Jesuit Father Uses Computer to Analyze Works of St. Thomas Aquinas" (PDF). Modern Data. 6 (9): 39–40. ISSN 0026-7678.
- Rockwell, Geoffrey; Passarotti, Marco (2019-05-27). "The Index Thomisticus as a Big Data Project". Umanistica Digitale (5). doi:10.6092/issn.2532-8816/8575. ISSN 2532-8816.