URL: Link To Resource

Access: No Restrictions

As part of the Language Archives Project (National Digital Archive Project) the main aim of the "Mandarin spoken corpora project" is to collect a wide variety of speech data of Taiwan Mandarin and to digitally archive the use of Taiwan Mandarin in audio and video data formats. The project consists of (1) speech data collection and processing, (2) toolkit and database development, (3) metadata management, (4) speech annotation design and (5) web query system construction. Funded by the Institute of Linguistics, the National Science Council and the National Digital Archives Project, this database contains three Mandarin spoken corpora, “Mandarin Topic-oriented Conversation Corpus” (MTCC), “Mandarin Conversational Dialogue Corpus” (MCDC) and “Mandarin Map Task Corpus” (MMTC). The annotation systems include “discourse annotation,” “detailed spontaneous speech phenomena” and “particular phonetic phenomena”. Web users can also use the web query system to search for keywords and annotations marked in the corpora mentioned above. Creator: Academia Sinica
English Title: Archives and Linguistic Representations of Spoken Taiwan Mandarin
Romanized Title: xin shi ji yu liao ku - duo mei ti de yu yan cheng xian han dian cang
Vernacular Title: 新世紀語料庫—多媒體的語言呈現和典藏
Vernacular Title 2: 新世纪语料库—多媒体的语言呈现和典藏


Posted September 2nd, 2008 by Chiun Chau

Back to top