HumanCLAIM Workshop

Motivation

Linguistic diversity is one of the fundamental rights of the European Union and we are optimistic that cross-lingual AI models can play an important role in facilitating it. Cross-lingual models based on neural networks are trained on terabytes of data and have recently reached large performance gains. As the computationally expensive training of such models can only be afforded by large companies, the evaluation of cross-lingual models is driven by commercial incentives and focuses on the average quantitative performance across more than a hundred languages. The intricacies of application scenarios for low-resource languages or economically insignificant purposes are largely being overlooked and individual differences between users are underestimated. When we want to use cross-lingual models for human-centered scenarios such as cognitive modeling, language education, or for use cases in the digital humanities, we quickly encounter their limitations.

In this workshop, we want to bring together leading scholars from linguistics, cognitive science, and computer science to develop a more diverse and human-centered perspective on cross-lingual models. We want to integrate typological theories about differences between language families, cognitive models of multilingual processing, and computational approaches towards increasing diversity in language technology.

Participation in the workshop is free of charge but the number of participants is limited. Please register using this form.

Program

Day 1: March 29 (Wednesday), 2pm-6pm
Location: Room Truss, Volkshotel

14.00	Lisa Beinborn: Human-Centered Challenges for Cross-Lingual Language Models
14.45	Piek Vossen: Cross-lingual Reference for Learning Languages with Robots
15.30	Coffee break
16.00	Antal v.d. Bosch: Implicit and Explicit Multilingual Computational Language Models
16.45	Poster Session

Day 2, March 30 (Thursday), 9 am-5pm
Location: Interactive workspace 3D@VU

09.00	Anne Lauscher: Fostering Cultural and Subcultural Inclusion in NLP
10.00	Coffee break
10.30	Lonneke van der Plas: Language Specificity and Low-Resource Languages
11.30	Discussion: Challenges for Applying Cross-Lingual Models to Human-Centered Use Cases Panel: Anne Lauscher, Lonneke van der Plas, Andrew Caines
12.30	Lunch break
14.00	Yevgen Matusevych: Targeted Evaluation of Bilingual Models
15.00	Coffee break
15.30	Discussion: Human-Centered Aspects in the Evaluation of Cross-Lingual Models Panel: Yevgen Matusevych, Stefan Frank, Rochelle Choenni

Confirmed Posters:

Rochelle Choenni: Data-Efficient Cross-Lingual Transfer with Language-Specific Subnetworks
Charlotte Pouw: Cross-Lingual Transfer of Cognitive Complexity
Ece Takmaz: Transformer Adapters for the Multi- and Cross-Lingual Prediction of Human Reading Behavior
Marcell Fekete: Gradual Language Model Adaptation Using Fine-Grained Typology
Yiyi Chen: Improving Cross-Lingual Models with Semantic Typology
Antonia Karamolegkou: Investigation of Transfer Languages for Parsing Latin
Ester Ploeger: Machine Translation between Languages with Varying Semantic Typology
Philipp Rust: Language Modeling with Pixels
Richard Diehl Martinez: Framing Multi-Lingual Language Modeling as a Meta-Learning Task
Gabriele Sarti: DivEMT: Neural Machine Translation Post Editing Effort Across Typologically Diverse Languages
Andrew Caines: Cross-Lingual Word Segmentation
Adrielli Lopes Rego: Towards a Cross-Lingual Computational Model of Reading
Mark Breuker: Educational NLP for Language Learning
Lisa Beinborn: Modeling Disagreement in Cross-Lingual Eye-Tracking Signals
Plus a few more TBA

Contact

The workshop is sponsored by an Early Career Partnership awarded to Lisa Beinborn by the Royal Netherlands Academy of Arts and Sciences (KNAW). If you have questions about the program, you can send an e-mail to humanclaim@googlegroups.com.