Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

Phản hồi
Báo xấu

11 Lượt xem Premium29/09/2022

Because of globalization, it is becoming more and more common to use multiple languages in a single utterance, also called codeswitching. This results in special linguistic structures and, therefore, poses many challenges for Natural Language Processing. Existing models for language identification in code-switched data are all supervised, requiring annotated training data which is only available for a limited number of language pairs. In this paper, we explore semi-supervised approaches, that exploit out-of-domain monolingual training data. We experiment with word uni-grams, word n-grams, character ngrams, Viterbi Decoding, Latent Dirichlet Allocation, Support Vector Machine and Logistic Regression. The Viterbi model was the best semi-supervised model, scoring a weighted F1 score of 92.23%, whereas a fully supervised state-of-the-art BERT-based model scored 98.43%.

Không được đăng tải lại nội dung khi chưa có sự cho phép của nhà sáng tạo

0 Người theo dõi · 11 Videos

Đề xuất cho bạn

Tất cả
Anime

BAO MINH- KHANH HONG JSC: Introduction Clip - 国际营商环境创新论坛暨京津冀企业‘走出去・链通全球’全球合作大会

3:02

BAO MINH- KHANH HONG JSC: Introduction Clip - 国际营商环境创新论坛暨京津冀企业‘走出去・链通全球’全球合作大会

宝明 - 庆鸿股份公司

3 Lượt xem

Sensationelle Filmaufnahmen von Berlin nach der Kapitulation (3. Mai 1945)

5:27

Sensationelle Filmaufnahmen von Berlin nach der Kapitulation (3. Mai 1945)

0 Lượt xem

handmade

0:59

0 Lượt xem

Lục Địa Kiện Tiên Tập 120 Vietsub Miễn Phí - HHPANDA

9:11

Lục Địa Kiện Tiên Tập 120 Vietsub Miễn Phí - HHPANDA

0 Lượt xem

We Need to Talk About train-dev-test Splits

8:00

We Need to Talk About train-dev-test Splits

18 Lượt xem

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

6:03

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

11 Lượt xem

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

10:00

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

16 Lượt xem

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

0:39

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

9 Lượt xem

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

1:55

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

19 Lượt xem

Where are we Still Split on Tokenization?

4:46

Where are we Still Split on Tokenization?

5 Lượt xem

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

4:31

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

8 Lượt xem

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

12:15

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

8 Lượt xem

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

5:45

Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

26 Lượt xem

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

6:32

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

27 Lượt xem

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

10:00

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

13 Lượt xem

Hunter X Hunter Episode 38 Tagalog Dubbed 720P

19:55

Hunter X Hunter Episode 38 Tagalog Dubbed 720P

AnimeForeverAko

270.2K Lượt xem

Boy Called Weak was Transferred to Another World and Discovered He Had the Powers of a Demon King

8:03

Boy Called Weak was Transferred to Another World and Discovered He Had the Powers of a Demon King

244.3K Lượt xem

Ang masalimoot na sinapit ng mag-ina sa baliw na lalaking ito, ginahàsa sila at sapilitan at...

11:32

Ang masalimoot na sinapit ng mag-ina sa baliw na lalaking ito, ginahàsa sila at sapilitan at...

349.6K Lượt xem

solo

0:10

230.3K Lượt xem

Hunter X Hunter Episode 51 Tagalog Dubbed 720P

19:55

Hunter X Hunter Episode 51 Tagalog Dubbed 720P

AnimeForeverAko

315.9K Lượt xem