MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

Phản hồi
Báo xấu

16 Lượt xem Premium12/06/2023

To improve the ability of language models to handle Natural Language Processing (NLP) tasks and intermediate step of pre-training has recently been introduced. In this setup, one takes a pre-trained language model, trains it on a (set of) NLP dataset(s), and then finetunes it for a target task. It is known that the selection of relevant transfer tasks is important, but recently some work has shown substantial performance gains by doing intermediate training on a very large set of datasets. Most previous work uses generative language models or only focuses on one or a couple of tasks and uses a carefully curated setup. We compare intermediate training with one or many tasks in a setup where the choice of datasets is more arbitrary; we use all SemEval 2023 text-based tasks. We reach performance improvements for most tasks when using intermediate training. Gains are higher when doing intermediate training on single tasks than all tasks if the right transfer task is identified. Dataset smoothing and heterogeneous batching did not lead to robust gains in our setup.

Không được đăng tải lại nội dung khi chưa có sự cho phép của nhà sáng tạo

0 Người theo dõi · 11 Videos

Đề xuất cho bạn

Tất cả
Anime

BAO MINH- KHANH HONG JSC: Introduction Clip - 国际营商环境创新论坛暨京津冀企业‘走出去・链通全球’全球合作大会

3:02

BAO MINH- KHANH HONG JSC: Introduction Clip - 国际营商环境创新论坛暨京津冀企业‘走出去・链通全球’全球合作大会

宝明 - 庆鸿股份公司

3 Lượt xem

Sensationelle Filmaufnahmen von Berlin nach der Kapitulation (3. Mai 1945)

5:27

Sensationelle Filmaufnahmen von Berlin nach der Kapitulation (3. Mai 1945)

0 Lượt xem

handmade

0:59

0 Lượt xem

Lục Địa Kiện Tiên Tập 120 Vietsub Miễn Phí - HHPANDA

9:11

Lục Địa Kiện Tiên Tập 120 Vietsub Miễn Phí - HHPANDA

0 Lượt xem

Where are we Still Split on Tokenization?

4:46

Where are we Still Split on Tokenization?

5 Lượt xem

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

0:39

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

9 Lượt xem

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

1:55

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

19 Lượt xem

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

10:00

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

16 Lượt xem

We Need to Talk About train-dev-test Splits

8:00

We Need to Talk About train-dev-test Splits

18 Lượt xem

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

6:32

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

27 Lượt xem

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

12:15

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

8 Lượt xem

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

10:00

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

13 Lượt xem

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

5:45

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

26 Lượt xem

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

6:03

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

11 Lượt xem

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

4:31

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

8 Lượt xem

Sukatan ng vital idol part 1

1:20

Sukatan ng vital idol part 1

431.1K Lượt xem

Fallen in love with A High School Girl

0:52

Fallen in love with A High School Girl

253.0K Lượt xem

boys will be boys

0:46

boys will be boys

318.5K Lượt xem

Sonic the Hedgehog 3 (2023) | 5 Pitches for the Sequel

5:27

Sonic the Hedgehog 3 (2023) | 5 Pitches for the Sequel

285.0K Lượt xem

10 Anime where OP MC hides his Power at School

8:09

10 Anime where OP MC hides his Power at School

Shinobi Update Channel

236.8K Lượt xem