Word Frequency Studies by Ioan-Iovitz Popescu

By Ioan-Iovitz Popescu

Observe frequency performs a sought after position in lots of medical and applicational fields. The ebook offers leading edge equipment in learn and new effects very important for language and textual content characterization. in line with a normal concept, miraculous interrelations are proven among be aware frequency and different linguistic homes. Interrelations among formerly recognized tools and new features reminiscent of the h-point and different measures built within the booklet are investigated. additionally, new statistical assessments are brought.

Show description

Read or Download Word Frequency Studies PDF

Similar foreign language study & reference books

Language Change and Variation

The examine of language version in social context maintains to carry the eye of a giant variety of linguists. This examine is promoted via the yearly colloquia on New methods of reading edition in English' (NWAVE). This quantity is a variety of revised papers from the NWAVE XI, held at Georgetown college.

Diachronic problems in phonosymbolism

Phonosymbolism, or sound symbolism (Lautsymbolik), is a crucial aspect of language development. Many severe students, even though, have appeared it with embarrassment or indifference. A wary reintroduction of phonosymbolism as an element liable for alterations passed through, in various levels, through such a lot languages could now appear to be so as.

Additional info for Word Frequency Studies

Example text

We add the relative frequencies up to the given r to get F(r) = P(R ≤ r). Though the symbol P is commonly used for probability, we ‘misuse’ it here for relative frequencies. e. rrel = r/rmax . Since rmax = V , we obtain rrel = r/V . Let us illustrate this kind of counting, using Goethe’s Erlkönig (cf. Popescu & Altmann 2006). 14 presents the cumulative distribution of ranked word forms of Goethe’s Erlkönig; both relative ranks rrel , obtained by way of the vision of each r by V = 124, and the cumulative relative frequencies (Fr,rel ) are given.

E. 33% of the text. One can say that this is the coverage by auxiliaries. But as already mentioned, there can be some autosemantics among them, and vice versa, some auxiliaries are located after the h-point. 1 (p. 17). The half of this area relativized by N – which is the full area under the rank-frequency curve – will be subtracted from F(h) yielding a new index F(h) = F(h) − h2 . 1333, which represents the corrected h-coverage of Goethe’s text (G-17). This number is simply a proportion to which a constant is added, hence a construction of an asymptotic test does not present difficulties.

That point where y = 0. Inserting this value in the above formula, we obtain x = 4 = k, where k denotes both coordinates of the k-point. 5. It is a specific text characteristic, but it depends strongly on the vocabulary V which in turn depends on text length N. Nevertheless, we know that words from x = 1 to x = k are just the autosemantics making up the vocabulary richness. But before we define a new index, some notes on vocabulary richness are in order. 2 A second look at vocabulary richness The vocabulary richness of a given text is a problematic concept which suffers from the dependence of the majority of procedures on text length N.

Download PDF sample

Rated 4.39 of 5 – based on 23 votes