OLAC Record oai:www.ldc.upenn.edu:LDC2000S88 |
Metadata | ||
Title: | 1999 HUB4 Broadcast News Evaluation English Test Material | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Linguistic Data Consortium. 1999 HUB4 Broadcast News Evaluation English Test Material LDC2000S88. Web Download. Philadelphia: Linguistic Data Consortium, 2000 | |
Contributor: | Linguistic Data Consortium | |
Date (W3CDTF): | 2000 | |
Description: | *Introduction* This publication contains the English evaluation test material used in the 1999 NIST Broadcast News Transcription Evaluation administered by the NIST, Spoken Natural Language Processing Group and produced by the Linguistic Data ConsortiumCatalog number LDC2000S88 ISBN 1-58563-176-0. *Data* The test material is contained in two SPHERE-formatted waveform files. The file bn99en_1.sph (set1) contains 1.5 hours of Broadcast News excerpts from last year's set2 epoch. The file bn99en_2.sph (set2) contains 1.5 hours of Broadcast News excerpts from the summer of 1998. Each file should be separately recognized per the Broadcast News English Evaluation Specification. Additional test material for each set is also included. Test materials include evaluation map files (bn99en_1.uem), automatically generated segmentation files (bn99en_1.seg), transcripts from the evaluation (bn99en_1.utf) and the utf.dtd used to validate the transcripts, reference STM files (bn99en_1.stm), and transcript orthography mapping files (en981118.glm). For more complete information, see the 1998 HUB4 Website. *Updates* There are no updates at this time. Note that the waveform and transcript data on this disc are licensed through the Linguistic Data Consortium (LDC) and are subject to usage restrictions. Contact the LDC for license agreement information. *Additional Licensing Instructions* This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member. | |
Extent: | Corpus size: 352339 KB | |
Identifier: | LDC2000S88 | |
https://catalog.ldc.upenn.edu/LDC2000S88 | ||
ISBN: 1-58563-176-0 | ||
ISLRN: 691-755-940-811-0 | ||
DOI: 10.35111/r4e7-nb71 | ||
Language: | English | |
Language (ISO639): | eng | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2000S88 | |
Rights Holder: | Portions Copyright 1998 PRI-Public Radio International Portions Copyright 1997-1998 ABC News Portions Copyright 1998 NBC News Portions Copyright 1997-1998 Cable News Network, Inc. All Rights Reserved Note that the waveform and transcript data on this disc are licensed through the http://www.ldc.upenn.edu" rel="nofollow">Linguistic Data Consortium (LDC) and are subject to usage restrictions. Contact the LDC for license agreement information. | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2000S88 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Linguistic Data Consortium. 2000. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text |