The Handbook of Discourse Analysis
The Handbook of Discourse Analysis The Handbook of Discourse Analysis
186 Douglas Biber and Susan Conrad first and second person pronouns, questions, “private” verbs (such as think or know), and contractions. The bottom group has fewer features, including nouns, attributive adjectives, and prepositional phrases. The statistical analysis shows that these two groups have a complementary relationship and thus constitute a single dimension: when a text has frequent occurrences of the top group of features, it will tend to have few occurrences of the bottom group, and vice versa. When dimension scores are computed for English Dimension 1, conversation texts are identified as the register that makes the most frequent use of the top group of features. Figure 9.1 plots the Dimension 1 score for several English registers, providing a graphic representation of the relations among registers with respect to this group of linguistic features. Conversation texts, with the largest positive Dimension 1 score, tend to have frequent occurrences of first and second person pronouns, questions, stance verbs, hedges, and the other features above the dashed line; at the same time, relative to the other registers, conversation texts have notably few occurrences of nouns, adjectives, prepositional phrases, and long words. At the other extreme, registers such as official documents and academic prose have the largest negative score, showing that they are marked for the opposite linguistic characteristics: very frequent occurrences of nouns, adjectives, prepositional phrases, and long words, combined with notably few occurrences of first and second person pronouns, questions, stance verbs, etc. Considering both the defining linguistic features together with the distribution of registers, each dimension can be interpreted in functional terms. Thus, the top group of linguistic features on English Dimension 1, associated most notably with conversation, is interpreted as reflecting interactiveness, high involvement, and on-line production. For example, interactiveness and involvement are reflected in the frequent use of you and I, and the private verbs that convey the thoughts and feelings of the participants, as well as many other features. The reduced and vague forms – such as contractions, that deletions, and general emphatics and hedges – are typical of language produced under real-time constraints. The bottom group of linguistic features, associated most notably with informational exposition, is interpreted as reflecting careful production and an informational focus. That is, as exemplified below, nouns, prepositional phrases, and attributive adjectives all function to convey densely packed information, and the higher type–token ratio and longer words reflect a precise and often specialized choice of words. Such densely informational and precise text is nearly impossible to produce without time for planning and revision. As noted earlier, one of the advantages of a comparative register perspective is to understand the linguistic characteristics of a particular register relative to a representative range of registers in the language. This advantage can be illustrated with respect to the specific register of research articles in biology (in the subdiscipline of ecology). Figure 9.1 shows that this register is extremely marked on Dimension 1, with a considerably larger negative score than academic prose generally. Even a short extract from an article shows the high density of informational features from Dimension 1 (nouns are underlined, prepositions italicized, and attributive adjectives capitalized): There were MARKED differences in root growth into regrowth cores among the three communities, both in the distribution of roots through the cores and in the
Register Variation: A Corpus Approach 187 response to ELEVATED CO 2 . In the Scirpus community, root growth was evenly distributed throughout the 15-cm profile, with no SIGNIFICANT differences in root biomass among the 5-cm sampling intervals within a treatment. All three of these features serve the purpose of densely packing the text with information about specific referents. Nouns refer to entities or concepts, and are then further specified by prepositional phrases, attributive adjectives, or other nouns which function as premodifiers (e.g. root growth). Clearly, the emphasis in this text is on transmitting information precisely and concisely, not on interactive or affective concerns. Furthermore, by considering the scores of other registers on Dimension 1, we can see that such densely packed informational features are not typical in more colloquial registers of English. For this reason, it is not surprising that many novices experience difficulty when asked to read biology research articles or write up research reports like a professional (cf. Walvoord and McCarthy 1990; Wilkinson 1985). Even with this very brief examination of just one dimension in the MD model of English, we can see why, linguistically, these texts are challenging and why students are unlikely to have had practice with such densely informational prose. 2.2 Comparison of the major oral/literate dimensions in English, Korean, and Somali The MD methodological approach outlined in the last section has been applied to the analysis of register variation in English, Korean, and Somali. Biber (1995) provides a full description of the corpora, computational and statistical techniques, linguistic features analyzed, and multidimensional patterns of register variation for each of these languages. That book synthesizes these studies to focus on typological comparisons across languages. Here we present only a summary of some of the more striking cross-linguistic comparisons. Table 9.4 presents a summary of the major “oral/literate” dimensions in English, Korean, and Somali. Oral/literate dimensions distinguish between stereotypical speech – i.e. conversation – at one pole, versus stereotypical writing – i.e. informational exposition – at the other pole. However, as discussed below, each of these dimensions is composed of a different set of linguistic features, each has different functional associations, and each defines a different set of relations among the full range of spoken and written registers. The first column in table 9.4 lists the co-occurring linguistic features that define each dimension. Most dimensions comprise two groups of features, separated by a dashed line on table 9.4. As discussed above for Dimension 1 in English, these two groups represent sets of features that occur in a complementary pattern. That is, when the features in one group occur together frequently in a text, the features in the other group are markedly less frequent in that text, and vice versa. To interpret the dimensions, it is important to consider likely reasons for the complementary distribution of these two groups of features as well as the reasons for the co-occurrence pattern within each group. It should be emphasized that the co-occurrence patterns underlying dimensions are determined empirically (by a statistical factor analysis) and not on any a priori
- Page 158 and 159: 136 Gregory Ward and Betty J. Birne
- Page 160 and 161: 138 Laurel J. Brinton 7 Historical
- Page 162 and 163: 140 Laurel J. Brinton of particular
- Page 164 and 165: 142 Laurel J. Brinton Shakespeare,
- Page 166 and 167: 144 Laurel J. Brinton argues that i
- Page 168 and 169: 146 Laurel J. Brinton typical of th
- Page 170 and 171: 148 Laurel J. Brinton 1 What is the
- Page 172 and 173: 150 Laurel J. Brinton politeness. 1
- Page 174 and 175: 152 Laurel J. Brinton that this are
- Page 176 and 177: 154 Laurel J. Brinton functions; -s
- Page 178 and 179: 156 Laurel J. Brinton Linguistics,
- Page 180 and 181: 158 Laurel J. Brinton Kastovsky, Di
- Page 182 and 183: 160 Laurel J. Brinton semantic-prag
- Page 184 and 185: 162 John Myhill verb, and direct ob
- Page 186 and 187: 164 John Myhill refer to a supposed
- Page 188 and 189: 166 John Myhill (2) Binasa ng lalak
- Page 190 and 191: 168 John Myhill These data have bee
- Page 192 and 193: 170 John Myhill Sometimes, translat
- Page 194 and 195: 172 John Myhill text-count methodol
- Page 196 and 197: 174 John Myhill Sun, Chaofen and Ta
- Page 198 and 199: 176 Douglas Biber and Susan Conrad
- Page 200 and 201: 178 Douglas Biber and Susan Conrad
- Page 202 and 203: 180 Douglas Biber and Susan Conrad
- Page 204 and 205: 182 Douglas Biber and Susan Conrad
- Page 206 and 207: 184 Douglas Biber and Susan Conrad
- Page 210 and 211: 188 Douglas Biber and Susan Conrad
- Page 212 and 213: 190 Douglas Biber and Susan Conrad
- Page 214 and 215: 192 Douglas Biber and Susan Conrad
- Page 216 and 217: 194 Douglas Biber and Susan Conrad
- Page 218 and 219: 196 Douglas Biber and Susan Conrad
- Page 221 and 222: Nine Ways of Looking at Apologies 1
- Page 223 and 224: Nine Ways of Looking at Apologies 2
- Page 225 and 226: Nine Ways of Looking at Apologies 2
- Page 227 and 228: Nine Ways of Looking at Apologies 2
- Page 229 and 230: Nine Ways of Looking at Apologies 2
- Page 231 and 232: Nine Ways of Looking at Apologies 2
- Page 233 and 234: Nine Ways of Looking at Apologies 2
- Page 235 and 236: Nine Ways of Looking at Apologies 2
- Page 237 and 238: Interactional Sociolinguistics 215
- Page 239 and 240: Interactional Sociolinguistics 217
- Page 241 and 242: Interactional Sociolinguistics 219
- Page 243 and 244: Interactional Sociolinguistics 221
- Page 245 and 246: Interactional Sociolinguistics 223
- Page 247 and 248: Interactional Sociolinguistics 225
- Page 249 and 250: Interactional Sociolinguistics 227
- Page 251 and 252: Discourse as an Interactional Achie
- Page 253 and 254: Discourse as an Interactional Achie
- Page 255 and 256: Discourse as an Interactional Achie
- Page 257 and 258: Discourse as an Interactional Achie
Register Variation: A Corpus Approach 187<br />
response to ELEVATED CO 2 . In the Scirpus community, root growth was evenly<br />
distributed throughout the 15-cm pr<strong>of</strong>ile, with no SIGNIFICANT differences in root<br />
biomass among the 5-cm sampling intervals within a treatment.<br />
All three <strong>of</strong> these features serve the purpose <strong>of</strong> densely packing the text with information<br />
about specific referents. Nouns refer to entities or concepts, and are then further<br />
specified by prepositional phrases, attributive adjectives, or other nouns which function<br />
as premodifiers (e.g. root growth). Clearly, the emphasis in this text is on transmitting<br />
information precisely and concisely, not on interactive or affective concerns.<br />
Furthermore, by considering the scores <strong>of</strong> other registers on Dimension 1, we can<br />
see that such densely packed informational features are not typical in more colloquial<br />
registers <strong>of</strong> English. For this reason, it is not surprising that many novices experience<br />
difficulty when asked to read biology research articles or write up research reports<br />
like a pr<strong>of</strong>essional (cf. Walvoord and McCarthy 1990; Wilkinson 1985). Even with this<br />
very brief examination <strong>of</strong> just one dimension in the MD model <strong>of</strong> English, we can see<br />
why, linguistically, these texts are challenging and why students are unlikely to have<br />
had practice with such densely informational prose.<br />
2.2 Comparison <strong>of</strong> the major oral/literate dimensions in<br />
English, Korean, and Somali<br />
<strong>The</strong> MD methodological approach outlined in the last section has been applied to the<br />
analysis <strong>of</strong> register variation in English, Korean, and Somali. Biber (1995) provides a<br />
full description <strong>of</strong> the corpora, computational and statistical techniques, linguistic<br />
features analyzed, and multidimensional patterns <strong>of</strong> register variation for each <strong>of</strong><br />
these languages. That book synthesizes these studies to focus on typological comparisons<br />
across languages. Here we present only a summary <strong>of</strong> some <strong>of</strong> the more striking<br />
cross-linguistic comparisons.<br />
Table 9.4 presents a summary <strong>of</strong> the major “oral/literate” dimensions in English,<br />
Korean, and Somali. Oral/literate dimensions distinguish between stereotypical speech<br />
– i.e. conversation – at one pole, versus stereotypical writing – i.e. informational<br />
exposition – at the other pole. However, as discussed below, each <strong>of</strong> these dimensions<br />
is composed <strong>of</strong> a different set <strong>of</strong> linguistic features, each has different functional<br />
associations, and each defines a different set <strong>of</strong> relations among the full range <strong>of</strong><br />
spoken and written registers.<br />
<strong>The</strong> first column in table 9.4 lists the co-occurring linguistic features that define<br />
each dimension. Most dimensions comprise two groups <strong>of</strong> features, separated by a<br />
dashed line on table 9.4. As discussed above for Dimension 1 in English, these two<br />
groups represent sets <strong>of</strong> features that occur in a complementary pattern. That is,<br />
when the features in one group occur together frequently in a text, the features in the<br />
other group are markedly less frequent in that text, and vice versa. To interpret the<br />
dimensions, it is important to consider likely reasons for the complementary distribution<br />
<strong>of</strong> these two groups <strong>of</strong> features as well as the reasons for the co-occurrence<br />
pattern within each group.<br />
It should be emphasized that the co-occurrence patterns underlying dimensions<br />
are determined empirically (by a statistical factor analysis) and not on any a priori