OLAC Record: Multi-Channel WSJ Audio

OLAC Record
oai:www.ldc.upenn.edu:LDC2014S03

Metadata

Title: Multi-Channel WSJ Audio

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Lincoln, Mike, Erich Zwyssig, and Iain McCowan. Multi-Channel WSJ Audio LDC2014S03. Web Download. Philadelphia: Linguistic Data Consortium, 2014

Contributor: Lincoln, Mike

Zwyssig, Erich

McCowan, Iain

Date (W3CDTF): 2014

Date Issued (W3CDTF): 2014-04-15

Description: *Introduction* Multi-Channel WSJ Audio (MCWSJ) was developed by the Centre for Speech Technology Research at The University of Edinburgh and contains approximately 100 hours of recorded speech from 45 British English speakers. Participants read Wall Street Journal texts published in 1987-1989 in three recording scenarios: a single stationary speaker, two stationary overlapping speakers and one single moving speaker. This corpus was designed to address the challenges of speech recognition in meetings, which often occur in rooms with non-ideal acoustic conditions and significant background noise, and may contain large sections of overlapping speech. Using headset microphones represents one approach, but meeting participants may be reluctant to wear them. Microphone arrays are another option. MCWSJ supports research in large vocabulary tasks using microphone arrays. The news sentences read by speakers are taken from WSJCAM0 Cambridge Read News, a corpus originally developed for large vocabulary continuous speech recognition experiments, which in turn was based on CSR-1 (WSJ0) Complete, made available by LDC to support large vocabulary continuous speech recognition initiatives. *Data* Speakers reading news text from prompts were recorded using a headset microphone, a lapel microphone and an eight-channel microphone array. In the single speaker scenario, participants read from six fixed positions. Fixed positions were assigned for the entire recording in the overlapping scenario. For the moving scenario, participants moved from one position to the next while reading. Fifteen speakers were recorded for the single scenario, nine pairs for the overlapping scenario and nine individuals for the moving scenario. Each read approximately 90 sentences. The audio data are presented as single channel 16kHz flac compressed wav files. *Samples* Please listen to the below samples. * Overlapping Sample * Stationary Sample * Moving Sample *Updates* None at this time.

Extent: Corpus size: 5145608 KB

Format: Sampling Rate: 16000

Sampling Format: pcm

Identifier: LDC2014S03

https://catalog.ldc.upenn.edu/LDC2014S03

ISBN: 1-58563-674-6

ISLRN: 766-428-479-143-5

DOI: 10.35111/zd7f-qr83

Language: English

Language (ISO639): eng

License: Multi-Channel WSJ Audio: https://catalog.ldc.upenn.edu/license/multi-channel-wsj-audio.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2014S03

Rights Holder: Portions © 1987-1989 Dow Jones & Company, Inc., © 2014 University Court of the University of Edinburgh, © 1992-1995, 2014 Trustees of the University of Pennsylvania

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2014S03

DateStamp: 2020-11-30

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Lincoln, Mike; Zwyssig, Erich; McCowan, Iain. 2014. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2014S03
Up-to-date as of: Wed Feb 26 18:31:49 EST 2025

Metadata
Title:		Multi-Channel WSJ Audio
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Lincoln, Mike, Erich Zwyssig, and Iain McCowan. Multi-Channel WSJ Audio LDC2014S03. Web Download. Philadelphia: Linguistic Data Consortium, 2014
Contributor:		Lincoln, Mike
		Zwyssig, Erich
		McCowan, Iain
Date (W3CDTF):		2014
Date Issued (W3CDTF):		2014-04-15
Description:		Introduction Multi-Channel WSJ Audio (MCWSJ) was developed by the Centre for Speech Technology Research at The University of Edinburgh and contains approximately 100 hours of recorded speech from 45 British English speakers. Participants read Wall Street Journal texts published in 1987-1989 in three recording scenarios: a single stationary speaker, two stationary overlapping speakers and one single moving speaker. This corpus was designed to address the challenges of speech recognition in meetings, which often occur in rooms with non-ideal acoustic conditions and significant background noise, and may contain large sections of overlapping speech. Using headset microphones represents one approach, but meeting participants may be reluctant to wear them. Microphone arrays are another option. MCWSJ supports research in large vocabulary tasks using microphone arrays. The news sentences read by speakers are taken from WSJCAM0 Cambridge Read News, a corpus originally developed for large vocabulary continuous speech recognition experiments, which in turn was based on CSR-1 (WSJ0) Complete, made available by LDC to support large vocabulary continuous speech recognition initiatives. Data Speakers reading news text from prompts were recorded using a headset microphone, a lapel microphone and an eight-channel microphone array. In the single speaker scenario, participants read from six fixed positions. Fixed positions were assigned for the entire recording in the overlapping scenario. For the moving scenario, participants moved from one position to the next while reading. Fifteen speakers were recorded for the single scenario, nine pairs for the overlapping scenario and nine individuals for the moving scenario. Each read approximately 90 sentences. The audio data are presented as single channel 16kHz flac compressed wav files. Samples Please listen to the below samples. * Overlapping Sample * Stationary Sample * Moving Sample Updates None at this time.
Extent:		Corpus size: 5145608 KB
Format:		Sampling Rate: 16000
Format:		Sampling Format: pcm
Identifier:		LDC2014S03
		https://catalog.ldc.upenn.edu/LDC2014S03
		ISBN: 1-58563-674-6
		ISLRN: 766-428-479-143-5
		DOI: 10.35111/zd7f-qr83
Language:		English
Language (ISO639):		eng
License:		Multi-Channel WSJ Audio: https://catalog.ldc.upenn.edu/license/multi-channel-wsj-audio.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2014S03
Rights Holder:		Portions © 1987-1989 Dow Jones & Company, Inc., © 2014 University Court of the University of Edinburgh, © 1992-1995, 2014 Trustees of the University of Pennsylvania
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2014S03
DateStamp:		2020-11-30
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Lincoln, Mike; Zwyssig, Erich; McCowan, Iain. 2014. Linguistic Data Consortium.
Terms:		area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text