Relating Lexical Items to Sociolinguistic Features in a Spontaneous Speech Corpus of Spanish
This paper shows the application of statistical tests to a spontaneous speech corpus of Spanish. Our goal is to find representative differences between different parts of the corpus. To this end, we tagged n-grams in the corpus with features related to the speaker (age, gender, etc), or the context (dialogue, monologue, media, etc), and applied
Read more