Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

ข้อเสนอแนะ
รายงาน

9 วิว Premium29/09/2022

Because of globalization, it is becoming more and more common to use multiple languages in a single utterance, also called codeswitching. This results in special linguistic structures and, therefore, poses many challenges for Natural Language Processing. Existing models for language identification in code-switched data are all supervised, requiring annotated training data which is only available for a limited number of language pairs. In this paper, we explore semi-supervised approaches, that exploit out-of-domain monolingual training data. We experiment with word uni-grams, word n-grams, character ngrams, Viterbi Decoding, Latent Dirichlet Allocation, Support Vector Machine and Logistic Regression. The Viterbi model was the best semi-supervised model, scoring a weighted F1 score of 92.23%, whereas a fully supervised state-of-the-art BERT-based model scored 98.43%

ห้ามทำซ้ำหรือดัดแปลงโดยไม่ได้รับอนุญาตจากครีเอเตอร์

0 แฟนคลับ · 11 วิดีโอ

วีดีโอแนะนำสำหรับคุณ

ทั้งหมด
อนิเมะ

Teaser ตัวอย่าง MorchangTv Epพิเศษดวงชะตาประจำปี 2568

0:39

Teaser ตัวอย่าง MorchangTv Epพิเศษดวงชะตาประจำปี 2568

1 วิว

เดชาวัดสัปโบ่

1:47:29

เดชาวัดสัปโบ่

1 วิว

English Sentence Repetition Pattern #1

5:51

English Sentence Repetition Pattern #1

1 วิว

We Need to Talk About train-dev-test Splits

8:00

We Need to Talk About train-dev-test Splits

16 วิว

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

1:55

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

18 วิว

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

0:39

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

9 วิว

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

12:15

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

8 วิว

Where are we Still Split on Tokenization?

4:46

Where are we Still Split on Tokenization?

5 วิว

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

5:45

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

26 วิว

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

6:32

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

27 วิว

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

6:03

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

11 วิว

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

4:31

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

3 วิว

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

10:00

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

13 วิว

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

10:00

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

16 วิว

0:27

292.2K วิว

Dark Continent Episode 1 | Hunter x Hunter - Tagalog Dubbed

11:27

Dark Continent Episode 1 | Hunter x Hunter - Tagalog Dubbed

216.6K วิว

MAASIM NA JOWA feat. JohnBill (PINOY ANIMATION MEME)

4:26

MAASIM NA JOWA feat. JohnBill (PINOY ANIMATION MEME)

362.0K วิว

its too big

0:32

411.5K วิว

I AM LEGEND 2: LAST MAN ON EARTH - Teaser Trailer 2023 | Will Smith

2:30

I AM LEGEND 2: LAST MAN ON EARTH - Teaser Trailer 2023 | Will Smith

280.2K วิว

Kim Chiu tries to get the Coaches to spin their red chairs | The Voice Kids Philippines 2023

1:56

Kim Chiu tries to get the Coaches to spin their red chairs | The Voice Kids Philippines 2023

256.3K วิว