基本説明
A range of electronic corpora has become increasingly accessible via the WWW and CD-ROM. This development coincided with improvements in the standards governing the collecting, encoding and archiving of such data. Less attention, however, has been paid to making other types of digital data available. This is especially true of that which one might describe as 'unconventional', namely, dialects, child language and bilingual databases. This book is a first step toward developing similar standards for enriching and preserving these neglected resources.
Full Description
A range of electronic corpora is increasingly accessible via the WWW and CD-ROM. This development coincided with improved standards governing the collecting, encoding and archiving of such data. This book looks at developing similar standards for enriching and preserving unconventional data: dialects, child language and bilingual databases.
Contents
Notes on Contributors Foreword: S.Poplack Taming Digital Voices and Texts: Models and Methods for Handling Unconventional Synchronic Corpora; J.C.Beal, K.P.Corrigan & H.L.Moisl SCOTS: Scottish Corpus of Texts and Speech; J.Anderson, D.Beavan & C.Kay FRED - The Freiburg English Dialect Corpus; L.Anderwald & S.Wagner The Syntactic Atlas of the Dutch Dialects (SAND): A Corpus of Elicited Speech as an On-line Dynamic Atlas; S.Barbiers, L.Cornips & J-P.Kunst Coding and Analyzing Multilingual Data: The LIDES Project; P.Gardner-Chloros, M.G.Moyer & M.Sebba ICE-Ireland: Local Variations on Global Standards; J.Kallen & J.Kirk The Talkbank Project; B.MacWhinney Developing and Using a Corpus of Written Creole; M.Sebba & S.Dray Representing Real Language: Consistency, trade-offs and thinking ahead!; S.A.Tagliamonte Index



