Cascadilla Proceedings Project: Paper 2284 Abstract


List of proceedings

Enter a document #:
Enter search terms:




Info for readers

Info for authors

Info for editors

Info for libraries



Order form

Shopping cart

Towards the Design of the Australian National Corpus
Phuong Dzung Pho
25-29 (complete pdf)
Bookmark and Share

Although there are currently several corpora of Australian English, they have not been widely used due to their small size or scope in comparison with well-known corpora such as the British National Corpus (BNC) and the American National Corpus (ANC). In order to compile a corpus that will be widely used, it is necessary to make it comparable to those large corpora. This paper thus reviews the designs of current widely used corpora in the world and proposes a design for the Australian National Corpus (AusNC). In doing so, this paper outlines what needs to be taken into consideration to compile an Australian corpus, such as timeline, various genres or categories to be included in the corpus, and selection criteria for texts to be included in each genre or subgenre. A careful design of the corpus before actual data are collected or accepted for inclusion in the corpus will help avoid waste of resources.



Published in:
Selected Proceedings of the 2008 HCSNet Workshop on Designing the Australian National Corpus: Mustering Languages
edited by Michael Haugh, Kate Burridge, Jean Mulder, and Pam Peters

Table of contents

ISBN 978-1-57473-435-5 library binding
vi+113 pages
publication date: 2009
published by Cascadilla Proceedings Project, Somerville, MA, USA

Printed edition: $190.00



Copyright © 2009 Cascadilla Proceedings Project. All rights reserved. To request permission to copy any elements from our pages, or to send comments or questions about our pages, please write to webmaster@cascadilla.com and make sure to provide the URL of the particular page.