MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

Feedback
Report

16 Views PremiumJun 13, 2023

To improve the ability of language models to handle Natural Language Processing (NLP) tasks and intermediate step of pre-training has recently been introduced. In this setup, one takes a pre-trained language model, trains it on a (set of) NLP dataset(s), and then finetunes it for a target task. It is known that the selection of relevant transfer tasks is important, but recently some work has shown substantial performance gains by doing intermediate training on a very large set of datasets. Most previous work uses generative language models or only focuses on one or a couple of tasks and uses a carefully curated setup. We compare intermediate training with one or many tasks in a setup where the choice of datasets is more arbitrary; we use all SemEval 2023 text-based tasks. We reach performance improvements for most tasks when using intermediate training. Gains are higher when doing intermediate training on single tasks than all tasks if the right transfer task is identified. Dataset smoothing and heterogeneous batching did not lead to robust gains in our setup.

Repost is prohibited without the creator's permission.

0 Follower · 11 Videos

Recommended for You

All
Anime

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

10:00

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

13 Views

We Need to Talk About train-dev-test Splits

8:00

We Need to Talk About train-dev-test Splits

18 Views

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

1:55

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

19 Views

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

6:32

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

27 Views

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

5:45

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

26 Views

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

4:31

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

8 Views

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

6:03

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

11 Views

Where are we Still Split on Tokenization?

4:46

Where are we Still Split on Tokenization?

5 Views

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

12:15

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

8 Views

Importantnce of Dawah

4:41

Importantnce of Dawah

bili_1089525649

0 View

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

0:39

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

9 Views

ATTACK ON TITAN FINAL BATTLE MOVIE (FULL VERSION) | EPIC ENDING SCENE! | 4K ENG SUB

3:59

ATTACK ON TITAN FINAL BATTLE MOVIE (FULL VERSION) | EPIC ENDING SCENE! | 4K ENG SUB

0 View

英語聽力練習－《銀河系漫遊指南》English Listening Practise - The Hitchhikers Guide To The Galaxy - Don't Panic!

12:12

英語聽力練習－《銀河系漫遊指南》English Listening Practise - The Hitchhikers Guide To The Galaxy - Don't Panic!

EmmasESLEnglish

0 View

sistem saham

1:51:02

Burhanuddeen Mo_5927

1 View

LOST BITCOIN & OTHER CRYPTO ASSETS WITH DUNAMIS CYBER SOLUTION

0:11

LOST BITCOIN & OTHER CRYPTO ASSETS WITH DUNAMIS CYBER SOLUTION

bili_1956729888

0 View

sistem saham awek

1:55:51

sistem saham awek

Burhanuddeen Mo_5927

1 View

【Anime Japanese Learning 30】Self-introduction of an ordinary office worker - Kira Yoshikage

6:53

【Anime Japanese Learning 30】Self-introduction of an ordinary office worker - Kira Yoshikage

0 View

Ang batang tamad

10:04

Ang batang tamad

Geraldine Gaddi

5 Views

The Truth About Hewelth Nail Fungus Light: Does It Really Work?

2:21

The Truth About Hewelth Nail Fungus Light: Does It Really Work?

bili_1245994273

1 View

ABC Letters for Kids _ Full English Alphabet for Preschool & Kindergarten - Kids

32:50

ABC Letters for Kids _ Full English Alphabet for Preschool & Kindergarten - Kids

0 View