OLAC Record oai:www.ldc.upenn.edu:LDC2023T10 |
Metadata | ||
Title: | AIDA Scenario 1 and 2 Reference Knowledge Base | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Tracey, Jennifer, et al. AIDA Scenario 1 and 2 Reference Knowledge Base LDC2023T10. Web Download. Philadelphia: Linguistic Data Consortium, 2023 | |
Contributor: | Tracey, Jennifer | |
Strassel, Stephanie | ||
Getman, Jeremy | ||
Bies, Ann | ||
Griffitt, Kira | ||
Graff, David | ||
Caruso, Christopher | ||
Date (W3CDTF): | 2023 | |
Date Issued (W3CDTF): | 2023-10-16 | |
Description: | *Introduction* AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in Venezuela). The KB content was drawn from GeoNames, the CIA World Leaders List and the CIA World Factbook and was supplemented with manually-created KB entries developed specifically for AIDA data. The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages. Each phase of the AIDA program focused on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. The socioeconomic and political crisis in Venezuela since 2010 was the scenario in Phase 2. *Data* This knowledge base supported the AIDIA entity detection and linking task for 13 entity types: GPE (Geo-Political Entity), LOC (Location), PER (Person), ORG (Organization), FAC (Facility), MHI (Medical/Health Issue), WEA (Weapon), SID (Side), COM (Commodity), CRM (Crime), LAW (Law), VEH (Vehicle), and BAL (Ballot). There are four inputs to the KB: GPE and LOC entities from GeoNames (GEO), PER entities from the CIA World Leaders List (WLL), ORG entities from Appendix B of the CIA World Factbook (APB), and additional entities manually created by LDC. The GEO, WLL and APB entries are also found in LORELEI Entity Detection and Linking Knowledge Base (LDC2010T10). *Acknowledgement* This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013. *Samples* Please view the following samples: * Alternate Names Sample * Entities Sample * Member States Sample *Updates* None at this time. | |
Extent: | Corpus size: 805034 KB | |
Identifier: | LDC2023T10 | |
https://catalog.ldc.upenn.edu/LDC2023T10 | ||
ISLRN: 644-411-403-964-6 | ||
DOI: 10.35111/3wzr-h616 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2023T10 | |
Rights Holder: | Portions © 2023 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2023T10 | |
DateStamp: | 2024-01-01 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher. 2023. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |