Japanese is mora-timed: every beat has equal length, and length itself carries meaning. A long vowel is one full extra beat (おばさん aunt vs おばあさん grandmother), and the small っ is a silent beat that doubles the next consonant (きて come vs きって stamp). To pronounce Japanese correctly you must count beats, not just sounds.
Here is a mistake almost every learner makes at least once. You meet a friend’s grandmother, you want to be polite, and you greet her with a word that actually means “aunt.” The room goes briefly quiet. You did nothing rude on purpose. You simply dropped one beat of vowel length, and in Japanese that beat is the entire difference between two words.
English speakers are trained to treat vowel length as emphasis or mood. In Japanese, length is structural. It is part of the spelling, part of the meaning, and part of the rhythm. Until you internalize that, you will keep producing words that are almost right and therefore wrong.
The Core Idea: Mora Timing
Japanese is built from morae, equal-length beats. か is one beat. きゃ is one beat. ん is one beat. And crucially, a long vowel is two beats, and the small っ is a beat of silence. Native rhythm comes from giving each beat its full, equal value, like a metronome.
とうきょう
と・う・きょ・う — four beats
Tokyo (often mis-said as a rushed two beats)
Tokyo is not “TOH-kyo.” It is four even beats: to-o-kyo-o. Mishandle the long vowels and even the name of the capital comes out wrong.
Long Vowels: One Beat Changes the Word
These are real minimal pairs. The only difference is vowel length, and the meaning flips completely.
おばさん vs おばあさん
obasan vs obāsan
aunt / middle-aged woman vs grandmother / elderly woman
おじさん vs おじいさん
ojisan vs ojīsan
uncle / middle-aged man vs grandfather / elderly man
主人 vs 囚人
しゅじん vs しゅうじん
husband vs prisoner
That last pair is a famous one for a reason. Introducing your “husband” and accidentally introducing your “prisoner” is exactly the kind of error a single short vowel produces. The fix is not speaking louder. It is holding the long vowel for its full extra beat.
The Small っ: A Beat of Silence That Means Something
The small っ (sokuon) is written smaller than a normal つ and is never pronounced as “tsu.” Instead it creates a short stop and doubles the following consonant. That tiny pause is a full mora, and it too distinguishes real words.
きて vs きって
kite vs kitte
come (te-form of 来る) vs stamp / postage
かこ vs かっこ
kako vs kakko
past vs parentheses / brackets
いた vs いった
ita vs itta
was (existed) vs went / said
Say きて with no pause and you have said “come.” Insert the small っ pause before the t and you have said “stamp.” The silence is the sound. Beginners tend to skip it because silence does not feel like pronunciation, but in Japanese it is.
How to Hear It (Before You Can Say It)
Production follows perception. You cannot reliably produce a contrast you cannot hear, so the first job is ear training with minimal pairs. The method that works is short, repeated listening with the meaning attached.
| Step | What you do | Why it works |
|---|---|---|
| 1. Pair it | Listen to the two words back to back | Contrast makes the length obvious |
| 2. Tap it | Tap one beat per mora as you listen | Turns timing into a physical sense |
| 3. Shadow it | Repeat immediately, matching length | Trains your mouth to hold the beat |
| 4. Use it | Hear the word inside a full sentence | Locks length to real meaning |
Tapping is underrated. When you physically tap four beats for とうきょう, your body learns the rhythm in a way that silent reading never delivers.
Seeing It in a Real Sentence
切手を買いに郵便局へ行きました。
きってをかいにゆうびんきょくへいきました。
I went to the post office to buy stamps.
This one sentence contains a small っ (切手), a long vowel (郵便局, ゆうびんきょく), and a verb in plain past. Read it with even, full-length beats and it sounds Japanese. Rush the small っ or clip the long vowel and a native listener has to work to understand you, even though every “sound” was technically there.
Why This Quietly Limits Your Listening Too
Length errors are not only a speaking problem. If your brain does not register the extra beat, you will mishear words in listening sections and conversation, confusing 主人 with 囚人 or おじさん with おじいさん. Training the length distinction sharpens comprehension and production at the same time, which is why it pays off twice.
How to Build the Beat Into Your Ear
Mora timing is a perception-and-habit skill, and habits form through frequent, contextual contact rather than a single lesson. Kanjijo is set up to give you exactly that. Every vocabulary item is paired with listening practice and example sentences, so you hear long vowels and the small っ in natural speech, not in isolation. SRS resurfaces the tricky minimal pairs right before you forget the distinction, exclusive mnemonics help you remember which word carries the long vowel, and reading practice shows the spelling so your eyes connect the small character to the sound. Home and lock screen widgets surface a word during the day for a quick listen, OCR lets you scan a real sign and hear how it should sound, and the mock JLPT listening sections test the contrast under realistic pressure. Beat by beat, the tiny sounds stop being tiny.
Stop Losing Meaning to a Missing Beat
Kanjijo trains long vowels and the small っ through listening practice, example sentences, SRS, exclusive mnemonics, reading, OCR scanning, widgets, and mock JLPT listening.
Download Kanjijo FreeFrequently Asked Questions
No. As a small っ it is silent and doubles the following consonant. It is only “tsu” when written full-size as つ.
Exactly one extra beat, the same length as any other mora. Think of it as two equal beats of the same vowel, not one stretched-out sound.
Often yes, especially with minimal pairs like きて / きって or 主人 / 囚人. Context helps, but correct length removes the guesswork and makes you sound far more natural.