top | item 17629694

(no title)

snowpanda | 7 years ago

What is katakana?

discuss

order

jcranmer|7 years ago

The Japanese first adapted writing from Chinese; this adapted system is called kanji and is highly similar to, and often homographic with, Chinese hanzi. However, Japanese and Chinese have very different phonological structure: Chinese is mostly comprised of monosyllabic words, whereas Japanese has lots of polysyllabic words. This makes it more difficult to do phonetic transcription in Japanese kanji than in Chinese hanzi.

The Japanese got around this by simplifying the script into a syllabary (every character represents roughly a syllable, or more often, a consonant-vowel pair). They did this twice: one of these syllabaries is hiragana, and the other is katakana. In modern usage, katakana is used largely for phonetic transcription, such as transliteration of foreign words and names, or onomatopoeia, whereas hiragana is used for writing out Japanese words.

As others have pointed out, Japanese has a rather constrained phonological system, so a word like strengths cannot be represented directly but rather more like "su-to-re-n-ge-tsu." Of course, this feature isn't limited to Japanese; it's how an island called "Christmas" gets transliterated to "Kiritimati", just like its parent "Kiribati" is the local pronunciation of "Gilbert." While people think it's annoying, it's largely because they haven't faced languages with challenging transciptions into Latin script. There's a reason why there's a plethora of transcriptions of "مُحمّد‎" after all.

elgenie|7 years ago

> There's a reason why there's a plethora of transcriptions of "مُحمّد‎" after all.

There's a plethora of transcriptions into (more) phonetic alphabets because there are a plethora of regionalized pronunciations [1]. And there are a plethora of pronunciations because Arabic uses an impure abjad [2]. Since the vowels are not always, exactly or uniformly specified in writing, unspecified behavior leads to varying results in each compiler.

[1] https://en.wikipedia.org/wiki/Muhammad_(name)#Transliteratio... [2] https://en.wikipedia.org/wiki/Abjad#Impure_abjads

snowpanda|7 years ago

Thank you for explaining this to me! (And thank you to everyone else's comments too of course). This is fascinating.

elgenie|7 years ago

Katakana [1] is one of the two syllabary writing systems in Japanese, with Hiragana [2] being the other. IIRC katakana is the one where every syllable starts with a consonant and ends with a vowel, so transliterated loan words with consecutive consonants end up growing vowels in the middle. They also have kanji (characters), and various romanization systems (https://en.wikipedia.org/wiki/Romanization_of_Japanese). It's really amazingly inefficient.

[1] https://en.wikipedia.org/wiki/Katakana [2] https://en.wikipedia.org/wiki/Hiragana

nneonneo|7 years ago

Japanese uses three distinct character sets in writing: Hiragana (a syllabary), Katakana (another syllabary) and Kanji (Chinese characters). Hiragana and Kanji are usually used together to write native Japanese words, whereas Katakana is used almost exclusively for loanwords.

Katakana syllables are either bare vowels, or consonant+vowel (with few exceptions), meaning that transliterated words must be contorted to fit the syllabary (the "lossy algo" referred to by the parent comment). So, something like "kernel" in English becomes kaa-ne-ru, and "subsystem" becomes "sa-bu-shi-su-te-mu".

Forkus|7 years ago

Why is everyone posting squares?

jacoblambda|7 years ago

Unicode characters probably aren't all rendering on your end due to your font.