top | item 46661897

Kip: A programming language based on grammatical cases of Turkish

254 points| nhatcher | 1 month ago |github.com

65 comments

order

joomy|1 month ago

Hi all, Kip's developer here! I was going to wait until we had finished the playground and landing page before posting about the project more, but here's the browser-based playground we have so far (thanks to Alperen Keles) for anyone who wants to play with the language: https://alpaylan.github.io/kip/

(The work on JavaScript transpilation just started today and currently doesn't work, but running the language should mostly work, though it probably has bugs, which I'd love to hear about in the repo's issues!)

yowayb|1 month ago

I studied Turkish for a few years and remember thinking it could make an interesting programming language (due to the grammatical/agglutinative features). I was gonna call it Ç, but I was never seriously going to make it. Happy to see someone went for it!

gerdesj|1 month ago

"Kip (meaning "grammatical mood" in Turkish)"

I'm not sure what a grammatical mood is, so I tried a couple of well known translation services and got: kip == "mode". However big G did also manage "modal", "paradigm", "tense" and "module".

For my money: "tense". Just to confuse the issue, tense has several meanings in english! Here I think we are talking about a verbal tense:

https://www.bbc.co.uk/bitesize/articles/zh4thbk#zyh2s82

Tense can also be synonymous with emotion: dangerous/exciting and also as a measure: tension/tight.

araes|1 month ago

Question on some of the syntax. It's neat, and think the idea's cool. Would definitely be something if nothing else for security through obscurity. Is it even code?

However, for some of the number stuff, if you write something like:

  (5'le 3'ün farkını) yaz.
  (3'ün 5'le farkını) yaz.
How does it tell whether it is:

  5 - 3 =  2, or
  3 - 5 = -2  ?
Does it always just return 2 because of the meaning of "farkını" and the placement of 'le and 'ün? Like:

  (5 first, 3 second, difference) write, vs
  (3 second, 5 first, difference) write  ?
Google just gave back:

  Write (the difference between 5 and 3).
  Write (the difference between 3 and 5).
Not especially familiar with Turkish, and mostly had to use translation, yet it looks like a language for defining math theorems? Number following "zero" shall be called "one", number following "one" shall be called "two". Or is that more just a feature of using natural language for the writing syntax?

sedatk|1 month ago

Fantastic work, an area I’ve always wanted to explore.

nhatcher|1 month ago

Sorry! It was too beautiful not to share it. But I'm sure you can do a show HN once you think is ready.

celaleddin|1 month ago

Got especially excited, since I also experimented on a similar idea a few years ago:

https://github.com/celaleddin/sembolik-fikir

Will check this out further in the following days. Thanks for sharing!

joomy|1 month ago

That's pretty cool! From what I can tell, it does a morphological guess based on the suffix. If you didn't have the apostrophe, it'd have issues with ambiguity (say "aşı", does it mean vaccine or does it mean "aş" in accusative case?) but the apostrophe solves that problem too.

My solution for this problem in Kip was to go all the way with the morphological analysis using TRmorph (https://github.com/coltekin/TRmorph) for it, and then resolve the ambiguities in type checking / elaboration. (Therefore Kip almost never needs apostrophes.) Whether it was worth it, I don't know, but it was a fun problem to solve. :)

arikana|1 month ago

It seems the order of the parent structure is still following the Anglo-saxon grammar even though lines and pharisees are Turkish , I would like to see something:

Fibonacci-Dizisi (n):

  n 0'a eşitse: 
    dur.

  değilse:

    bu-sayıyı yazdır.
    şu-sayıyı, (bu-sayı + şu-sayı), (n - 1) ile Fibonacci-Dizisi'ne devam et.
Ana-Akış:

  "Bir sayı girin: " yazdır.
  Girdiyi oku -> girdi.

  girdi'nin tam-sayı-hali:

    Yokluksa: "Geçersiz sayı" yazdır.
    Varlıksa (n): 0, 1, n ile Fibonacci-Dizisi'ni başlat.
Başlat: Ana-Akış.

nevertheless, that is a good start, congratulations.

tristenharr|1 month ago

Woah this is AWESOME! I am going to have to take a look! Love the thoughts, doing something similar at https://logicaffeine.com/studio

Check out Logos lang, would love to chat sometime. love that you chose Haskell!

lolc|1 month ago

Haha I can read some casual Turkish and this made my day!

Funny how the case system of Turkish is both strong and standardized enough for this to work well. I don't know any other language where flexible argument order would work so well.

inkyoto|1 month ago

> I don't know any other language where flexible argument order would work so well.

Any highly inflected language has such a property. Slavic languages, Sanskrit (or, more broadly, Indo-Aryan languages) are prime examples.

Speakers of Finnish and Hungarian will likely chime in and state something similar.

thaumasiotes|1 month ago

> I don't know any other language where flexible argument order would work so well.

What kind of sample size is that? A case system and flexible argument order are largely the same thing.

Note also that flexible argument order is a robust phenomenon in English:

1. Colonel Mustard killed him in the study at 5:00 with his own knife.

2. Colonel Mustard killed him at 5:00 in the study with his own knife.

3. Colonel Mustard killed him in the study with his own knife at 5:00.

4. Colonel Mustard killed him with his own knife at 5:00 in the study.

5. Colonel Mustard killed him at 5:00 with his own knife in the study.

6. Colonel Mustard killed him with his own knife in the study at 5:00.

But if you insist on looking in other languages, there's a famous Latin poem beginning Quis multa gracilis te puer in rosa perfusus liquidis urget odoribus grato, Pyrrha, sub antro?

Translating this as closely as possible to a one-word-for-one-word standard, it says What slender boy soaked [in] liquid odors presses you among many rose[s], Pyrrha, beneath [a] pleasant cave?

(Notes: rosa is singular for unclear reasons. There is nothing corresponding to the in of "in liquid odors"; the relationship between the odors and the soaking is expressed purely by case. There is also nothing corresponding to the article in "a pleasant cave"; Latin does not mark definiteness in this way. Location inside a cave is expressed with "beneath"; compare English underwater.)

Anyway, the actual word ordering, using this translation, is: What many slender you boy among rose[s] soaked liquid presses [in-]odors pleasant, Pyrrha, beneath [a-]cave?

I've heard that Russian poetry is given to similarly intricate word orderings.

ugurs|1 month ago

Clicking the link with a prejudice in my mind, I found the definitions cleverly clean and easy to understand. I would be pleased to see a German version of it, just to have a good laugh.

abdusco|1 month ago

Yep. I was expecting to see an existing language (usually Rust) with keywords swapped with Turkish translations, but no, this is actually really nice.

Ruidy|1 month ago

I have been learning the language because my girlfriend is Turkish.

Now I can use my programming brain to fast forward the learning.

Amazing

biddit|1 month ago

Yep, this was a skillset combination I never envisioned. (am married to a Turk). Maybe a good reason to push past the intermediate plateau.

SuperNinKenDo|1 month ago

I love this kind of stuff - non-English programming languages, particularly when they utilise language features in unexpected ways like this.

My Turkish is pretty rusty - and was never any good anyway, but really cool stuff.

chuckadams|1 month ago

Reminds me a bit of Lingua::Romana::Perligata.

joomy|1 month ago

Yes, that's one of my inspirations! I'm writing a short paper about Kip and I'm citing Perligata there for sure. The closest modern non-English programming language I know that also uses grammar features is Tampio, for Finnish. https://github.com/fergusq/tampio

octoberfranklin|1 month ago

It would be really helpful if this page showed side-by-side comparisons of the same program written in Kip and some other language, like say Haskell.

I'm having a hard time seeing how this is much different from record types, except that you're limited to only eight fixed record field names (one for each grammatical case).

joomy|1 month ago

Cases essentially act like named arguments, except the names are inferred from the case of an argument, which is inferred through morphological analysis. And that analysis can be ambiguous, so the ambiguities are solved by the type checker / elaborator. It's different from record types in the sense that you can provide the arguments in any order to a function, and the system will figure it out because of the cases.

Aerbil313|1 month ago

Neat idea. I’m a native Turkish speaker, and when I was first learning English as a kid I remember always being baffled at the extreme chaos and irregularity in English. In the end I learned it mostly by exposure and not from school.

Twey|1 month ago

I love when the ‘language’ part of programming languages doesn't get left behind :) Very neat!

hahahahhaah|1 month ago

A language an LLM can choke on!

nacozarina|1 month ago

with AI every country can migrate development to a programming language that reflects the diction of their unique ethnolinguistic traditions, it's very empowering

hasanabi|1 month ago

Better call it "Kanak".

rafohy12|1 month ago

[deleted]

alpaylan|1 month ago

The Maryland one (that would be me, although I haven’t really done anything except the WASM bindings, this is really all Joomy’s work, kudos to him) is vacationing in Izmir right now, why would that even be important though?

BigTTYGothGF|1 month ago

I don't even live in England but still write in English, heck of a thing if you think about it.