Yuri N. Klimov. Rank distribution of frequency and lengths of words in translations into Russian of the elected Japanese verses - The Japanese love lyrics (manjesu)
Literature / Internet articles / Analysis of literature
Submitted on: Feb 14, 2013, 07:42:46
Description: The total of words in the Japanese verses ~ 200 [ 1 ] is made 1637, and with frequency - 4236 by a technique [2]. The following dependences are investigated: frequencies of words from a rank, the logarithm of frequency of words from the logarithm of a rank, the logarithm of cumulative frequency of words from the logarithm of a rank, length of a word from a rank, the logarithm of length of a word from a rank, cumulative length of a word from a rank, cumulative length of a word from the logarithm of a rank, the logarithm of cumulative length of a word from the logarithm of a rank, product of frequency of words on a rank from a rank, products of the logarithm of frequency of words on a rank from the logarithm of a rank. For the first time it is shown, that imposing of a linear straight line on a logarithmic curve reveals three non-uniform zones of distribution of cumulative frequencies of words from the logarithm of a rank: I a nuclear zone - from 1 up to 206 ranks with cumulative frequency of a word 2276, II a zone - from 207 up to 1231 ranks with cumulative frequency of a word 1447 and III a zone - from 1232 up to 1638 ranks with cumulative frequency of a word 407, that is the ratio of cumulative frequencies of a word on zones will be equal 1:0,64:0,18, and distributions of cumulative lengths of words from the logarithm of a rank: I a nuclear zone - from 1 up to 237 ranks with cumulative length of a word equal 2298, II a zone - from 238 up to 1177 ranks with cumulative length of a word 5887 and III a zone - from 1178 up to 1638 ranks with cumulative length of a word 1637 and a ratio of cumulative lengths of a word on zones equal 1:2,56:0,71, that more than previous distribution Bradford. The received results practically pull together researches rank distributions of articles on thematic sections of documentary information streams and linguistic synergetic on the basis of distributions Zipf and Bradford. It agrees on G. K. Zipf product of frequencies of words in texts...