OLAC Record
oai:catalogue.elra.info:ELRA-S0486

Metadata
Title:ALLIES Corpus
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2023-05-05
Date Issued (W3CDTF):2023-05-05
Description:The ALLIES Corpus was produced within the European CHIST-Era project ALLIES. The ALLIES project enabled to carry out a campaign for the evaluation of Broadcast News across time diarization systems using French data. This project is an extension of the previous ESTER, REPERE and ETAPE evaluation campaigns that were carried out for the French language in this field.This corpus is based on the material that was used for the ESTER, REPERE and ETAPE evaluation packages (see ELRA Catalogue: http://catalogue.elra.info for respective packages). The ALLIES corpus was built as an extension of the previous produced corpora. It contains corrected annotations from the previous evaluation materials as well as new audio data with corresponding transcriptions. Corrections include corrected names of speakers and re-segmentation. The segmentation tasks consist of segmentation in sound events, speaker tracking and speaker segmentation, detailed as follows:- For the sound event segmentation, the task consists of tracking the parts which contain music (with or without speech) and the parts which contain speech (with or without music).- The speaker tracking task consists in detecting the parts of the document that correspond to a given speaker. - The speaker segmentation consists of segmenting the document in speakers and grouping the parts spoken by the same speaker.Overall, the ALLIES Corpus contains about 900 hours of news broadcast, including orthographic transcriptions, speaker annotations and segmentation.
Identifier:ELRA-S0486
ISLRN: 397-116-696-859-2
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-S0486/
Language:French
Language (ISO639):fra
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0486
DateStamp:  2023-05-05
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2023. ELRA (European Language Resources Association).
Terms: area_Europe country_FR dcmi_Sound iso639_fra olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0486
Up-to-date as of: Thu Apr 3 2:08:24 EDT 2025