From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

Feedback
Report

13 Views PremiumSep 29, 2022

The lack of publicly available evaluation data for low-resource languages limits progress in Spoken Language Understanding (SLU). As key tasks like intent classification and slot filling require abundant training data, it is desirable to reuse existing data in high-resource languages to develop models for low-resource scenarios. We introduce XSID, a new benchmark for cross-lingual (X) Slot and Intent Detection in 13 languages from 6 language families, including a very low-resource dialect. To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer. We study two setups which differ by type and language coverage of the pre-trained embeddings. Our results show that jointly learning the main tasks with masked language modeling is effective for slots, while machine translation transfer works best for intent classification

Repost is prohibited without the creator's permission.

0 Follower · 11 Videos

Recommended for You

All
Anime

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

10:00

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

16 Views

We Need to Talk About train-dev-test Splits

8:00

We Need to Talk About train-dev-test Splits

18 Views

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

1:55

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

19 Views

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

6:32

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

27 Views

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

6:03

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

11 Views

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

4:31

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

8 Views

Where are we Still Split on Tokenization?

4:46

Where are we Still Split on Tokenization?

5 Views

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

12:15

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

8 Views

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

5:45

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

26 Views

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

0:39

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

9 Views

Elementor Pro Review: The Ultimate Page Builder for WordPress

11:41

Elementor Pro Review: The Ultimate Page Builder for WordPress

4 Views

Bluehost Review: A Comprehensive Look at Its Features, Pros, and Pricing

10:15

Bluehost Review: A Comprehensive Look at Its Features, Pros, and Pricing

0 View

The Ultimate Lead Magnet System: The Best Way to Create High-Converting Lead Magnets

11:46

The Ultimate Lead Magnet System: The Best Way to Create High-Converting Lead Magnets

0 View

The Ultimate Lead Magnet System Review: Unlock Your Business Growth with High-Converting Lead Magnet

14:31

The Ultimate Lead Magnet System Review: Unlock Your Business Growth with High-Converting Lead Magnet

0 View

Funcionários das filiais do Departamento de Segurança da província de Hwanghae S

6:27

Funcionários das filiais do Departamento de Segurança da província de Hwanghae S

bili_1604276720

0 View

Bluehost Web Hosting Review: Fast, Secure, and Beginner-Friendly?

9:10

Bluehost Web Hosting Review: Fast, Secure, and Beginner-Friendly?

0 View

Click AI Bank 2.0 Review – Is It a Scam? (Real User Complaints)

9:09

Click AI Bank 2.0 Review – Is It a Scam? (Real User Complaints)

bili_1736922433

0 View

Teslas.Medicine.The.Universal.Fluid.2022.720p.WEBRip.x264.AAC-[YTS.MX]

1:35:00

Teslas.Medicine.The.Universal.Fluid.2022.720p.WEBRip.x264.AAC-[YTS.MX]

1 View

German New Medicine: The New Paradigm of Healing

49:05

German New Medicine: The New Paradigm of Healing

0 View

Recipient Gift Alerts

10:00

Recipient Gift Alerts

bili_1497895872

1 View