This paper presents our work on linking language tools for Tunisian Arabic, focusing on a lexicographic database and a corpus of informal written texts. This work on Tunisian Arabic is an ongoing pilot study, while our wider goal is to create resources for various under-resourced languages. We outline a methodology that emphasises open science principles, leveraging existing language resources and NLP tools for standardisation and annotation. Our approach ensures reproducibility and benefits other researchers. We share annotated data on a digital platform and release NLP tools on a dedicated repository. Our work aligns with FAIR principles, facilitating open and effective research on under-resourced languages.

Towards a Unified Digital Resource for Tunisian Arabic Lexicography / Gugliotta, Elisa; Mallia, Michele; Panascì, Livia. - (2023), pp. 579-590. [10.34619/srmk-injj]

Towards a Unified Digital Resource for Tunisian Arabic Lexicography

Gugliotta, Elisa
Conceptualization
;
2023-01-01

Abstract

This paper presents our work on linking language tools for Tunisian Arabic, focusing on a lexicographic database and a corpus of informal written texts. This work on Tunisian Arabic is an ongoing pilot study, while our wider goal is to create resources for various under-resourced languages. We outline a methodology that emphasises open science principles, leveraging existing language resources and NLP tools for standardisation and annotation. Our approach ensures reproducibility and benefits other researchers. We share annotated data on a digital platform and release NLP tools on a dedicated repository. Our work aligns with FAIR principles, facilitating open and effective research on under-resourced languages.
2023
Inglese
Gugliotta, Elisa ; Mallia, Michele; Panascì, Livia
Carvalho, Sara ; Khan, Anas Fahad ; Anić, Ana Ostroški ; Spahiu, Blerina ; Gracia, Jorge ; McCrae, John P. ; Gromann, Dagmar ; Heinisch, Barbara ; Salgado, Ana
Proceedings of the 4th Conference on Language, Data and Knowledge
579
590
12
978-989-54081-5-3
https://aclanthology.org/2023.ldk-1.14/
NOVA CLUNL
Portugal
PORTOGALLO
Tunisian Arabic, lexicographic database, informal written texts, under-resourced languages, language resources, NLP tools, standardisation, annotation, open science, reproducibility, FAIR principles, digital platform
Internazionale
No
info:eu-repo/semantics/bookPart
Gugliotta, Elisa; Mallia, Michele; Panascì, Livia
2 Contributo in Volume::2.1 Contributo in volume (Capitolo o Saggio)
3
268
Towards a Unified Digital Resource for Tunisian Arabic Lexicography / Gugliotta, Elisa; Mallia, Michele; Panascì, Livia. - (2023), pp. 579-590. [10.34619/srmk-injj]
none
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11388/361756
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact