Motivation
Linguistic diversity is one of the fundamental rights of the European Union and we are optimistic that cross-lingual AI models can play an important role in facilitating it. Cross-lingual models based on neural networks are trained on terabytes of data and have recently reached large performance gains. As the computationally expensive training of such models can only be afforded by large companies, the evaluation of cross-lingual models is driven by commercial incentives and focuses on the average quantitative performance across more than a hundred languages. The intricacies of application scenarios for low-resource languages or economically insignificant purposes are largely being overlooked and individual differences between users are underestimated. When we want to use cross-lingual models for human-centered scenarios such as cognitive modeling, language education, or for use cases in the digital humanities, we quickly encounter their limitations.
In this workshop, we want to bring together leading scholars from linguistics, cognitive science, and computer science to develop a more diverse and human-centered perspective on cross-lingual models. We want to integrate typological theories about differences between language families, cognitive models of multilingual processing, and computational approaches towards increasing diversity in language technology.
Participation in the workshop is free of charge but the number of participants is limited. Please register using this form.
Program
Day 1: March 29 (Wednesday), 2pm-6pm
Location: Room Truss, Volkshotel
14.00 | Lisa Beinborn: Human-Centered Challenges for Cross-Lingual Language Models |
14.45 | Piek Vossen: Cross-lingual Reference for Learning Languages with Robots |
15.30 | Coffee break |
16.00 | Antal v.d. Bosch: Implicit and Explicit Multilingual Computational Language Models |
16.45 | Poster Session |
Day 2, March 30 (Thursday), 9 am-5pm
Location: Interactive workspace 3D@VU
09.00 | Anne Lauscher: Fostering Cultural and Subcultural Inclusion in NLP |
10.00 | Coffee break |
10.30 | Lonneke van der Plas: Language Specificity and Low-Resource Languages |
11.30 | Discussion: Challenges for Applying Cross-Lingual Models to Human-Centered Use Cases Panel: Anne Lauscher, Lonneke van der Plas, Andrew Caines |
12.30 | Lunch break |
14.00 | Yevgen Matusevych: Targeted Evaluation of Bilingual Models |
15.00 | Coffee break |
15.30 | Discussion: Human-Centered Aspects in the Evaluation of Cross-Lingual Models Panel: Yevgen Matusevych, Stefan Frank, Rochelle Choenni |
Confirmed Posters:
- Rochelle Choenni: Data-Efficient Cross-Lingual Transfer with Language-Specific Subnetworks
- Charlotte Pouw: Cross-Lingual Transfer of Cognitive Complexity
- Ece Takmaz: Transformer Adapters for the Multi- and Cross-Lingual Prediction of Human Reading Behavior
- Marcell Fekete: Gradual Language Model Adaptation Using Fine-Grained Typology
- Yiyi Chen: Improving Cross-Lingual Models with Semantic Typology
- Antonia Karamolegkou: Investigation of Transfer Languages for Parsing Latin
- Ester Ploeger: Machine Translation between Languages with Varying Semantic Typology
- Philipp Rust: Language Modeling with Pixels
- Richard Diehl Martinez: Framing Multi-Lingual Language Modeling as a Meta-Learning Task
- Gabriele Sarti: DivEMT: Neural Machine Translation Post Editing Effort Across Typologically Diverse Languages
- Andrew Caines: Cross-Lingual Word Segmentation
- Adrielli Lopes Rego: Towards a Cross-Lingual Computational Model of Reading
- Mark Breuker: Educational NLP for Language Learning
- Lisa Beinborn: Modeling Disagreement in Cross-Lingual Eye-Tracking Signals
- Plus a few more TBA
Contact
The workshop is sponsored by an Early Career Partnership awarded to Lisa Beinborn by the Royal Netherlands Academy of Arts and Sciences (KNAW). If you have questions about the program, you can send an e-mail to humanclaim@googlegroups.com.