Corpus Linguistics and the WebMarianne Hundt, Nadja Nesselhauf, Carolin Biewer Using the Web as Corpus is one of the recent challenges for corpus linguistics. This volume presents a current state-of-the-arts discussion of the topic. The articles address practical problems such as suitable linguistic search tools for accessing the www, the question of register variation, or they probe into methods for culling data from the web. The book also offers a wide range of case studies, covering morphology, syntax, lexis, as well as synchronic and diachronic variation in English. These case studies make use of the two approaches to the www in corpus linguistics - web-as-corpus and web-for-corpus-building. The case studies demonstrate that web data can provide useful additional evidence for a broad range of research questions. |
Contents
1 | |
25 | |
an integrated system for web text search | 47 |
Compiling corpora from the internet | 69 |
message boards | 87 |
a multi | 109 |
Language variation and change | 110 |
Critical voices | 133 |
a case study | 167 |
Determinants of grammatical variation in English and the for | 191 |
Recalcitrant problems of comparative alternation and new | 211 |
integrating the | 233 |
The dynamics of inner and outer circle varieties in the South | 249 |
He rung the bell and she drunk ale nonstandard past tense | 271 |
Diachronic analysis with the internet? Will and shall in | 287 |
using the BNC for exploring the | 151 |
Other editions - View all
Corpus Linguistics and the Web Marianne Hundt,Nadja Nesselhauf,Carolin Biewer No preview available - 2007 |
Common terms and phrases
abstract adjectives adverbials aequi American English analysis analytic comparative animacy approach Biber British English British National Corpus Brown Corpus Bybee Canada online Cluster CNN transcripts collocations compiled Computational Concordancing contexts corpora corpus linguistics corpus-based database diachronic dimension scores discourse documents domain downloaded example extract factors fiction Figure Fletcher forms forums genres Google categories I-language identify inanimate modifiers included instances internet data investigation lexical linguistic research matching million words Mondorf N+N constructions newspapers non-standard nouns occur outer circle varieties paper Parallel Structures past tense patterns plural prepositional present perfect preterite problems pronouns query registers relative frequency relevant Renouf retrieval Rohdenburg S-ARCHER s-genitives Science search engine search term semantic SPEAC-1 speakers statistical Table text categories text samples text types textual universe TVAus users verbs volume WebCorp WebFict world wide world wide web