In corpus and computational linguistics, email is a distinctive and important text type. Its usage spans many purposes and borrows linguistic features from both written and spoken text. Today, email touches the lives of most Australians and is a primary communication medium for both workplace and personal conversation, often replacing letter writing, postcards, phone calls, and memoranda. As such a widely used communication channel, it is crucial that email data be included in the Australian National Corpus. Current research on the language and use of email is limited by the scarcity of available real-world email data. Including email in the Australian National Corpus would provide researchers the opportunity to expand beyond a single dominant corpus. In particular this would make available a collection of email that reflects Australian language and culture to linguists, language technologists, sociologists, and other researchers.
Selected Proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: Mustering Languages
edited by Michael Haugh, Kate Burridge, Jean Mulder, and Pam Peters
Table of contents