Non-English-based programming languages
Non-English-based programming languages are programming languages that do not use keywords taken from or inspired by English vocabulary.
Prevalence of English-based programming languages
The use of the English language in the inspiration for the choice of elements, in particular for keywords in computer programming languages and code libraries, represents a significant trend in the history of language design. According to the HOPL online database of languages,[1] out of the 8,500+ programming languages recorded, roughly 2,400 of them were developed in the United States, 600 in the United Kingdom, 160 in Canada, and 75 in Australia.
Thus, over a third of all programming languages have been developed in countries where English is the primary language. This does not take into account the usage share of each programming language, situations where a language was developed in a non-English-speaking country but used English to appeal to an international audience (see the case of Python from the Netherlands, Ruby from Japan, and Lua from Brazil), and situations where it was based on another programming language which used English.
International programming languages
The concept of international style programming languages was inspired by the work of British computer scientists Christopher Strachey, Peter Landin, and others. It represents a class of languages of which the line of the algorithmic languages ALGOL was exemplary.
ALGOL 68
ALGOL 68's standard document was published in numerous natural languages. The standard allowed the internationalization of the programming language. On December 20, 1968, the "Final Report" (MR 101) was adopted by the Working Group, then subsequently approved by the General Assembly of UNESCO's IFIP for publication. Translations of the standard were made for Russian, German, French, Bulgarian, and then later Japanese. The standard was also available in Braille. ALGOL 68 went on to become the GOST/ГОСТ-27974-88 standard in the Soviet Union.
- GOST 27974-88 Programming language ALGOL 68 – Язык программирования АЛГОЛ 68[2]
- GOST 27975-88 Programming language ALGOL 68 extended – Язык программирования АЛГОЛ 68 расширенный[3]
In English, Algol68's case statement reads case ~ in ~ out ~ esac. In Russian, this reads выб ~ в ~ либо ~ быв.
Citrine
Localization is the core feature of the Citrine Programming Language. Citrine is designed to be translatable to every written human language. For instance the West Frisian language version is called Citrine/FY. Citrine features localized keywords, localized numbers and localized punctuation. Users can translate code files from one language into another using a string-based approach. At the time of writing, Citrine supports 111 human languages. Support is not limited to well-known languages; all natural human languages are being accepted for inclusion, up to EGIDS-6.
Hedy
Hedy is a open-source programming language which was developed for programming education. It was designed to be as instructive as possible and as accessible as possible with a few unique features. It supports over 30 different languages, meaning its keywords can be typed in any of those. It supports languages which do not use the Latin alphabet for its keywords and variable names and it also supports more numbering systems than Arabic numerals, like Eastern Arabic numerals. All of these can be used interchangeably. The error messages are as verbose as possible, explaining what is wrong and what might be a fix, just like the Rust compiler.
Scheme
While Internationalization is not a part of any Scheme standard, the expressiveness and flexibility of the language allows for the addition of internationalization as a library. International Scheme is an open source project to which anyone can contribute a translation. Since translations of Scheme can be loaded as libraries, Scheme programs can be multilingual.
Scratch
Scratch is a block-based educational language. The text of the blocks is translated into many languages, and users can select different translations. Unicode characters are supported in variable and list names. (Scratch lists are not stored inside variables the way arrays or lists are handled in most languages. Variables only store strings, numbers, and, with workarounds, boolean values, while lists are a separate data type that store sequences of these values.) Projects can be "translated" by simply changing the language of the editor, although this does not translate the variable names.
Based on non-English languages
Basis | Name | Description | Link |
---|---|---|---|
Amharic | AxumLight / Geez# | Amharic-based programming language on the .NET platform | |
BunnaScript | A multilingual JavaScript and TypeScript programming language that enables writing computer code in the user's native language. Currently supports Amharic language. | ||
Arabic | Al-Khawarizm | Middle-level language for system programming with custom IDE. Intended for professional purpose. supports Windows and Linux platforms. Firsts design and developments in 2008. Development status: active. | |
Jeem | Arabic procedural programming language that uses features from both C and Pascal. Supports simple graphics. Intended for educational purposes. Developed by Mohammad Ammar Alsalka. Released in 2006. | ||
ARLOGO | An open-source language based on the UCB Logo interpreter. | ||
Ebda3 | A multi-paradigm high-level programming language. | ||
Qalb | A Scheme-like programming language.[4] | ||
Kalimat | A programming language that aims to help Arab children learn about programming.[5] | ||
Ammoria | Object-oriented language | ||
Loughaty (MyProLang) | A general-purpose natural Arabic programming language based on a proprietary syntax.[6] | ||
Phoenix | A C-like high-level imperative procedural language. | ||
Seen | A Bilingual English/Arabic based language that transpiles to Rust. | ||
Bengali | ChaScript | Built using ECMAScript grammar.[7] | |
Chinese | Chinese BASIC | Chinese-localized BASIC dialects based on Applesoft BASIC; for Taiwanese Apple II clones and the Multitech Microprofessor II. | |
ChinesePython (ZhPy) | A version of Python with keywords and built-in identifiers all translated to Chinese. | ||
Easy Programming Language | A Chinese rapid application development language. | ||
Wenyan | An esoteric language that closely follows the grammar and sentence structure of Classical Chinese; it compiles to JavaScript and Python (with C, etc. forthcoming). | wenyan-lang on GitHub | |
Czech and Slovak | Karel | An educational programming language with Czech and Slovak versions. | |
Dutch | Superlogo | Created for computer-aided instruction, based on Logo. | |
Drama | An assembly language for educational purposes. | ||
Finnish | sampo | An educational Forth-like programming language used in the 1980s by a high school in the city of Oulu. | |
Tampio | Made to look like a natural language. Object-oriented. | tampio on GitHub | |
French | GOTO++ | An esoteric programming language loosely based on French and English. | |
Linotte | |||
LSE (Language Symbolique d'Enseignement) | A pedagogical programming language designed in the 1970s at the École Supérieure d'Électricité. A kind of BASIC, but with procedures, functions, and local variables, like in Pascal. | ||
BASICOIS | BASIC with French keywords[8] | ||
SPIP | Templating language with French syntax[9] | ||
WLangage | WinDev programming language | ||
Rouille | Rust translated into French[10] | rouille on GitHub | |
German | Teuton | A German implementation of Python. | |
Greek | Glossa | Based on Pascal, used for educational purposes in secondary education. | |
M2000 | Programming Environment using Greek and or English Keywords. Basic like language (interpreter) with over 500 Keywords, including OOP, Threads, Events, Lambda Functions. | List of BASIC dialects, | |
Hebrew | TamliLogo | A Hebrew implementation of Logo. | |
Genesis | An esoteric programming language based on "Paleo-Hebrew". | ||
Hindi | OM Lang | OM Lang is a multilingual, statically typed programming language that supports more than 9+ Indian languages. OM Lang is very similar to GO, C and Rust. It has its own interpreter and compiler. | |
Bharat programming language | A Hindi programming language with all the modern basic functionalities like loops, function, classes, structure. Its interpreter is written in Python. You can install in your local system or you can use online interpreter. | ||
Kalaam.io | Kalaam is the programming language in Hindi and has all the basic functionalities of a modern programming language from for loops to while loops and from functions to conditional statements. Kalaam has its own parser, interpreter, Virtual memory stack and executes source code from parse tree generated by the parser. | ||
Bhailang | Bhailang is dynamically typed toy programming language, based on an inside joke, written in Typescript. | ||
Hindi Programming Language | A Hindi programming language for the .NET Framework. | ||
W language | A Hindi translated HTML language | w language on GitHub | |
Hindi, Bengali, Gujarati | Hindawi Programming System | A set of variants of C, C++, lex, yacc, assembly, BASIC, Logo and Ada | |
Icelandic | Fjölnir | An imperative programming language from the 1980s. | |
Indonesian | BAIK | C with Indonesian keywords. | BAIK on SourceForge |
Irish | Setanta | High level general purpose language that runs in the browser. Contains a graphics API that can be used to make games and animations | |
Italian | CadregaLisp | A Lisp dialect based a comedian sketch of Aldo Giovanni and Giacomo | CadregaLisp on GitHub |
GomorraSQL | A SQL dialect based upon the Italian series Gomorrah | GomrraSQL on GitHub | |
Monicelli | An esoteric programming language based upon the so-called "Supercazzola" of the movie My Friends | monicelli on GitHub | |
Japanese | Dolittle | Developed for educational purposes. Freeware. | |
Himawari (ひまわり) | Succeeded by Nadeshiko. Freeware. | ||
Kotodama (ことだま) on Squeak | Based on Squeak. For educational purposes. | ||
Mind | A variant of FORTH.[11] Commercial. | ||
Nadeshiko (なでしこ) | Successor of Himawari. Open source freeware. Chosen (along with MIT Scratch) in one of the 3 official textbooks of technical arts for junior high schools in Japan, from 2022.[12][13] | ||
Produire | An object-oriented Japanese programming language. Successor of TTSneo. Freeware. | ||
TTSneo | A Japanese programming language. Succeeded by Produire. Freeware. | ||
Unchiku (うんちく) | A Japanese programming language. Has limited functions/operations. Developed for educational purposes. | ||
Javanese | sawa | A Javanese translated Python language. | sawa on GitHub |
Korean | Ssiat | A language using Hangul (Korean). The name Ssiat (씨앗) means "seed" in Korean. | |
Aheui | An esoteric programming language similar to Befunge but using Hangul[14] | ||
Changjo | A language using Hangul. It is used for multimedia and game programming. | ||
hForth | A Forth system with an optional Korean keyword set. | ||
Latin | Lusus | ||
Lingua::Romana::Perligata | Alternative Syntax for Perl 5 that allows programming in Latin. | ||
Latvian | Dzintars | Ruby translated into Latvian. | dzintars on GitHub |
Lithuanian | Setonas | Python translated into Lithuanian. | setonas on GitHub |
Malay | Malaybasic | BASIC with Malay keywords | |
Malayalam | Thengascript | A dialect of JavaScript in the Malayalam language. | |
Malluscript | An esoteric scripting language made with Rust based on Manglish and Malayalam memes. | ||
Persian | Farsinet | An object-oriented programming language for .NET Framework. It is similar to C# and Delphi. | |
Kharazmi | A Persian programming language and IDE designed to teach programming to middle school students, similar to Pascal | ||
Zangar | Zangar (Persian for Rust) allows you to write Rust programs in Persian, using Persian keywords and function names. It is adapted from Rouille, the original French implementation of Rust. | zangar on GitHub | |
Kati (زبان کاتی) | An online programming language in Persian language. | ||
Hashemi (آقای هاشمی) | forked from "SimpleLanguage" that is a simple example language built using the Truffle API. | mr-hashemi on GitHub | |
Simorgh (SPL) | SPL has been invented with the aim that non-technical people would be able to produce their own portable applications which can be written in their own native language. | ||
Polish | AC-Logo | Logo with Polish commands IDŹ, OTO, NAPRZÓD etc. created in 1992 | |
SAKO | A language created in the 1950s and nicknamed the "Polish Fortran" | ||
EOL | Expression Oriented Language allowed for using Polish or English keywords. | ||
Zdzich | Amateur programming language for DOS. | ||
Portuguese | VisuAlg | A language designed to teach programming, based on Pascal. | |
potigol | A functional programming language in Portuguese for beginners. | ||
Portugol Studio | based in C and PHP for learning purposes. | ||
G-Portugol | A programming language with Portuguese keywords. | gpt on GitHub | |
NPortugol | A .NET Programming language for embedded scripts | ||
Russian | 1C:Enterprise | A framework and language for business applications. English keywords can also be used. | |
РАПИРА | An interpreted procedural programming language with strong dynamic type system. | ||
Аналитик | A language for symbolic manipulations with algebraic expressions used in the Soviet series of MIR computers.[15] | ||
Эль-76 | A language for symbolic manipulations with algebraic expressions used in the Soviet series of Elbrus computers.[16] | ||
ЯМБ | ЯМБ (язык машин бухгалтерских) (machine language for accounting machines) – A Russian programming language for Iskra-554, Iskra-555, and Neva-501 specialized accounting computers. | ||
КуМир | Similar to Pascal and IDE, mainly intended for educational usage in schools. The name is an acronym, which means Комплект ученический 'Мир' ('Mir' student's environment). | ||
Робик | A simple language for teaching the basics of programming to children. | ||
ЯАП | A language used on Nairi | ||
АЛМИР-65 | A language used on MIR | ||
УАЯ | A language to study programming | ||
Sanskrit | OM Lang | OM Lang is a statically typed, multilingual programming language with first-class support for Sanskrit. It claims to be the world's first Sanskrit programming language, offering Sanskrit keywords and syntax. The language aims to experiment with using Sanskrit in programming. | |
Serbian | Ћ плус плус | Ћ плус плус (Serbian, "C plus plus") is a JavaScript like programming language based on Cyrillic script | |
Sinhala | Ceylonicus | Ceylonicus is an open source, interpreted, functional and an bilingual programming language. It is the first Sinhala Programming Language (first to support Sinhala-based syntax). Capable of executing both English and Sinhala based syntax in a single code-base. Language is written in Python and web IDE is developed with Brython. | Run Online (WEB IDE), Source Code Ceylonicus Source Code on GitHub |
Spanish | GarGar | A procedural programming language based on Pascal for learning purposes.[17] | |
Vainilla | A pseudocode interpreter for Spanish that runs in the browser. | ||
PSeInt | A pseudocode interpreter for Spanish, like Pascal, with a completely Spanish-based syntax. PSeInt is an abbreviation for Pseudocode Interpreter. | ||
Qriollo | An impure strict functional programming language that compiles to C, Python, and JVM Bytecode, with keywords in Rioplatense Spanish, spoken in Buenos Aires. | ||
Latino | A language with a completely Spanish-based syntax. | ||
Sí | A direct translation pseudo-language for coding in C and C++ with Spanish keywords. | ||
Pauscal | A language with a completely Spanish-based syntax; compiler for 32-bit Windows. | ||
InformATE | A translation of Inform, used for creating text-based games. | ||
EsJS | An interpreted programming language with Spanish syntax, based on JavaScript. | WebsiteGitHubPlayground | |
Swahili | swap | Swahili Programming Language; it is an educational programming language built over Nodejs, It was built as way to help Swahili speakers to easily grasp the concept of coding and programming in general. | |
Swedish | Enkelt | A language with a completely Swedish-based syntax. Transpiled to Python. | |
Tagalog | Bato | Ruby translated into Tagalog. | bato on GitHub |
Tamil | Ezhil | Developed for educational purposes. | |
Swaram | A simple, general-purpose and procedural language.[18] | ||
W language | HTML translation in Tamil language | w language on GitHub | |
Turkish | Yakamoz | A Turkish implementation similar to that of Python. | |
Urdu | UrduScript | A dialect of JavaScript. | |
Yoruba | Yorlang | Built on top of Node.js. | |
Various | Logo | In one of its Apple II editions, it was available in French. A version used in Bulgarian education with Bulgarian Apple II clones was translated in Bulgarian. LOGO for the Commodore 64 had an Italian localization. | |
4th Dimension | On local versions, its internal language uses French or German keywords. | ||
FOCAL | Keywords were originally in English, but DEC produced versions of FOCAL in several European languages. | ||
Mama | An educational programming language and development environment, designed to help young students start programming by building 3D animations and games. It is currently available in English, Hebrew, Yiddish, and Chinese. | ||
MS Word and MS Excel | Their macro languages used to be localized in non-English languages. | ||
ML4 | A language for client/server database programming, with keywords in English or German.[19] | ||
RoboMind | An educational programming language available in Arabic, Chinese, Dutch, English, French, German, Greek, Polish, Brazilian Portuguese, Spanish, Swedish, and Turkish. | ||
Scratch | An introductory visual programming language from MIT's Media Lab with support for programming in multiple languages included as standard. | ||
Catrobat | A visual programming language targeted at teenagers with support for more than 60 languages like Slovenian, Polish, Hindi, German, Romanian, Turkish, Norwegian, Akan, Twi, Hebrew, Chinese Simplified, Indonesian, Swedish, Chinese Traditional, Portuguese, Brazilian, Croatian, Bosnian, Thai, Japanese, Sindhi, Tamil, Swahili, Arabic, Persian, Korean, Serbian (Cyrillic), Serbian (Latin), Bengali, French, Slovak, Vietnamese, Italian, Macedonian, Russian, Portuguese, English, Canada, Czech, Sinhala, Azerbaijani, Kazakh, Dutch, Catalan, Albanian, Pashto, Gujarati, Finnish, Malay, Lithuanian, Malayalam, Danish, Hungarian, Afrikaans, Kannada, Greek, Bulgarian, Galician, Urdu (Pakistan), English, Australia, English, United Kingdom, Telugu, Ukrainian, Hausa and Igbo. | on GitHub | |
TECO | The programming language used to drive the actions of the TECO text editor. The MIT dialect of this language was used to program early versions of Emacs before its 1985 rewrite in Emacs Lisp. | ||
TI-Calculator BASIC | The 68000 version is localized. Various configuration strings are localized too, preventing direct binary compatibility. | ||
Citrine | Programming language that can be used to code in any native human language, includes a translator module to translate code from one human language to another. At the time of writing it supports English, Dutch, Lithuanian and Romanian. |
Based on symbols instead of keywords
- APL – A language based on mathematical notation and abstractions.
- Brainfuck – A minimalist esoteric programming language, created for the purpose of having the smallest possible compiler.[20]
- Emojicode – An esoteric language based on Emojis.
- G – Graphical language used in LabVIEW (not to be confused with G-code).
- Hoon – A systems programming language for Urbit, compiling to Nock.
- J – An APL-like language which uses only ASCII special characters and adds function-level programming.
- Light Pattern – A language which uses a series of photographs rather than text as source code.[21]
- Mouse – A minimalist language created by Dr. Peter Grogono which uses ASCII characters for keywords.
- Piet – An art-based esoteric programming language.
- Plankalkül – The first high-level non-von Neumann programming language, designed by Konrad Zuse during World War II in Germany.
- Whitespace – An esoteric language based on whitespace characters (spaces, tabs, and line breaks).
Modifiable parser syntax
- Babylscript – A multilingual version of JavaScript which uses multiple tokenizers to support localized keywords in different languages and which allows objects and functions to have different names in different languages.
- Component Pascal – A preprocessor that translates native-language keywords into English in an educational version of the BlackBox Component Builder available as open source. The translation is controlled via a modifiable vocabulary and supported by modifiable compiler error messages. A complete Russian version is used in education, and it should be possible to accommodate other left-to-right languages (e.g., the Kabardian language has been tried as a proof of concept).
- HyperTalk – A programming language, which allows translation via custom resources, used in Apple's HyperCard.
- IronPerunis – An IronPython 2.7 localization to Lithuanian and Russian.
- AppleScript – A language which once allowed for different "dialects"[22] including French and Japanese; however, these were removed in later versions.
- Maude – Completely user-definable syntax and semantics, within the bounds of the ASCII character set.[23]
- Perl – While Perl's keywords and function names are generally in English, it allows modification of its parser to modify the input language, such as in Damian Conway's Lingua::Romana::Perligata module, which allows programs to be written in Latin or Michael G. Schwern's Lingua::tlhInganHol::yIghun Perl language in Klingon. They do not just change the keywords but also the grammar to match the language.
- Perunis – Python 2.6 localization to Lithuanian and Russian.
- Ioke – Ioke is a folding language. It allows writing highly expressive code that writes code. Examples of same program in Chinese, Danish, Hindi and Spanish
References
- In HOPL (History of Programming Languages), advanced search finds languages by country.
- "GOST 27974-88 Programming language ALGOL 68 – Язык программирования АЛГОЛ 68" (PDF) (in Russian). GOST. 1988. Archived from the original (PDF) on March 24, 2017. Retrieved November 15, 2008.
- "GOST 27975-88 Programming language ALGOL 68 extended – Язык программирования АЛГОЛ 68 расширенный" (PDF) (in Russian). GOST. 1988. Retrieved November 15, 2008.
- "Hello world: Meet قلب, a programming language based on Arabic script". 25 January 2013.
- "Arabnet | Kalimat: First Arabic Programming Language for Kids".
- IA eng (PDF).
- ChaScript: Breaking the language barrier using Bengali programming system, IEEE, doi:10.1109/ICECE.2014.7026875, S2CID 39503002.
- Labelle, Marcel, Les langages de programmation (PDF).
- Spip documentation team, Language glossary.
- Rouille language homepage, 6 June 2022.
- Katagiri, Akira. 1 Mindのしくみと特徴 [Mind – its mechanism and features]. Scripts Lab (in Japanese). Retrieved 2021-07-31.
(rough translation): Mind uses FORTH's mechanism. Even though FORTH was born in America, its grammar is similar to Japanese grammar.
- Tarui, Hideto (2020-09-17). 日本語で書けちゃうプログラミング言語「なでしこ」が中学の教科書に採用! [Japanese programming language "Nadeshiko" adopted in a junior high textbook!]. WINDOWS FOREST (IT news site) (in Japanese). Impress Corporation. Retrieved 2021-07-31.
- 令和3年度中学校教科書 [2021 technical arts textbook for junior high school]. Kyoiku tosho corporate website (in Japanese). Archived from the original on 2021-06-25. Retrieved 2021-07-31.
- "Aheui", Esolang (wiki).
- Husberg, Nisse; Seppänen, Jouko (1974), "Analitik", ACM Sigsam Bulletin, ACM, 8 (3): 24–25, doi:10.1145/1086837.1086841, S2CID 510819.
- Эль-76, Кірыліца ў сеціве.
- Manual del lenguaje GarGar [GarGar Manual] (in Spanish), archived from the original on November 5, 2016.
- Ganesh (PDF), Infitt, 2003.
- C/S Entwicklungsumgebung ML4, ML-Software.
- "The Brainfuck Programming Language". Muppetlabs.com. Retrieved 2013-10-30.
- Temkin (August 2015). "Light Pattern: Writing Code with Photographs". Leonardo. 48 (4): 375–381. doi:10.1162/LEON_a_01091. S2CID 57571969.
- Cook, William R. (2007). "Apple Script". Proceedings of the third ACM SIGPLAN conference on History of programming languages. doi:10.1145/1238844.1238845. ISBN 9781595937667. S2CID 220938191 – via ResearchGate.
- Language Design in Maude, by Matthias, 2006/06/05, LShift Ltd.
Sources
- Pigott, Diarmuid (2006). "HOPL, the History of Programming Languages". Archived from the original on 2011-02-20. Retrieved 2008-04-14.
External links
- SAKO information page at HOPL – By Diarmuid Pigott