lybrary wrote:Bill Mullins wrote:I'd welcome the specific references to the stylometry literature. Particularly since, again, Olsson has a table of the frequency of use of phrases containing personal pronouns on p. 51.
I am not seeing that table. On page 51 is a table for synonyms of 'learn', 'study', etc. No personal pronouns I can see.
I am referring to Table 5.3 on p. 51 of the 2008 edition of his book Forensic Linguistics (2nd edition, 2008). It includes statistics on phrases including "I", "he," "she," "we," and "they." I should have made it clear that I was referring to his textbook.
Read: D. L. Hoover, “Delta prime?,” Literary and Linguistic Computing, vol. 19, no. 4, pp. 477–495, 2004, and D. L. Hoover, “Testing Burrows’s Delta,” Literary and Linguistic Computing, vol. 19, no. 4, pp. 453–475, 2004.
Thanks for the references. But 24 hour rental on these papers is $42 each, so I'll take some time to get them through ILL.
"For example, if the document of interest is a novel written in third person, the distribution of pronouns will be radically different than that of a novel written in first person, not by virtue of an authorship difference, but simply from genre."
But we aren't comparing two novels written in different voices. We are comparing two books designed to give instruction to the reader (as I mentioned in the original discussion.) Erdnase uses the third person to refer to the reader, and Gallaway uses the second person. This isn't an issue of stylometry, it is an observation that the two writers, who are doing the same thing, do so in different linguistic styles.
However, the more fundamental problem is that function word analysis doesn't work for texts of such different subject and 25-30 years apart. Some authors do modify their style over time.
This is an argument against the analysis that Olsson did as well. If the author's style stays the same, you should be able to compare the works based on style. If it evolves, how do you account for it? As near as I can tell, Olsson's conclusions are based on the style being constant between 1902 and 1927.