1
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track
2
What do we really know about State of the Art NER?
3
Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information
4
How Does Pre-Trained Wav2Vec 2.0 Perform on Domain-Shifted Asr? an Extensive Benchmark on Air Traffic Control Communications
5
A Two-Step Approach to Leverage Contextual Data: Speech Recognition in Air-Traffic Communications
6
Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data
7
Bertraffic: Bert-Based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications
8
Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning
9
Datasets: A Community Library for Natural Language Processing
10
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances
11
Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition
12
Improving callsign recognition with air-surveillance data in air-traffic communication
13
A comprehensive survey on sentiment analysis: Approaches, challenges and trends
14
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems
15
Detecting English Speech in the Air Traffic Control Voice Communication
16
Spoken Instruction Understanding in Air Traffic Control: Challenge, Technique, and Application
17
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks
18
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
19
A Survey on Recent Advances in Sequence Labeling from Deep Learning Models
20
Transformers: State-of-the-Art Natural Language Processing
21
Automatic Speech Recognition Benchmark for Air-Traffic Communications
22
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
23
Common Voice: A Massively-Multilingual Speech Corpus
24
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
25
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26
Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development
27
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
28
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks
29
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas
30
Ontology for Transcription of ATC Speech Commands of SESAR 2020 Solution PJ.16-04
31
A Survey on Recent Advances in Named Entity Recognition from Deep Learning models
32
Analysis of BUT-PT Submission for NIST LRE 2017
33
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces
34
A Real-life, French-accented Corpus of Air Traffic Control Communications
35
Decoupled Weight Decay Regularization
36
Semi-Supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control
37
Increasing ATM Efficiency withAssistant Based Speech Recognition
38
The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages
39
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI
40
Reducing controller workload with automatic speech recognition
41
Gaussian Error Linear Units (GELUs)
42
Pattern Based Sequence Classification
43
Librispeech: An ASR corpus based on public domain audio books
44
Speaker adaptation of neural network acoustic models using i-vectors
45
On the difficulty of training recurrent neural networks
46
A speech interface for air traffic control terminals
47
Automated speech recognition in ATC environment
48
TED-LIUM: an Automatic Speech Recognition dedicated corpus
49
Linguistic Analysis of English Phraseology and Plain Language in Air-Ground Communication
50
Natural Language Processing (Almost) from Scratch
51
AN AUTOMATED SIMULATION PILOT CAPABILITY TO SUPPORT ADVANCED AIR TRAFFIC CONTROLLER TRAINING
52
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
53
Vocalise: assessing the impact of data link technology on the R/T channel
55
Message Understanding Conference- 6: A Brief History
56
Connectionist Speech Recognition: A Hybrid Approach
57
Hybrid Neural Network/Hidden Markov Model Systems for Continuous Speech Recognition
58
SWITCHBOARD: telephone speech corpus for research and development
59
The ATIS Spoken Language Systems Pilot Corpus
60
Adapting probability-transitions in DP matching processing for an oral task-oriented dialogue
61
Microcomputer System Integration for Air Control Training
62
An assessment of the technology of automatic speech recognition for military applications
63
International Civil Aviation Organization
64
Legal and Ethical Challenges in Recording Air Traffic Control Speech
65
Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety
66
Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
67
of the Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021)
68
“ICAO phraseology reference guide,”
69
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
70
Dropout: a simple way to prevent neural networks from overfitting
71
The Kaldi Speech Recognition Toolkit
72
Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis
73
Design and characterization of the non-native military air traffic communications database (nnMATC)
74
The HIWIRE database, a noisy and non-native English speech corpus for cockpit communication
75
The Air Traffic Control Corpus (ATC0) - LDC94S14A
76
</tags>: extra metadata
77
</text>: ground truth transcripts with high-level entities annotations (callsigns, commands and values)
78
</speaker>: speaker information to identifiy whether the segment is from an ATCO or pilot. Unknown cases are tagged with <UNK> • <text>
79
</segment>: one sample of data. One recording may have one or more segments
80
</end>: timing information with speech activity by the speakers