Online first articles

Articles not assigned to an issue 58 articles

ArEntail: manually-curated Arabic natural language inference dataset from news headlines

Authors (first, second and last of 4)
- Rasha Obeidat
- Yara Al-Harahsheh
- Maram Gharaibeh
- Content type: Original Paper
- Published: 22 April 2024
Faux Hate: unravelling the web of fake narratives in spreading hateful stories: a multi-label and multi-class dataset in cross-lingual Hindi-English code-mixed text

Authors
- Shankar Biradar
- Sunil Saumya
- Arun Chauhan
- Content type: Original Paper
- Published: 16 April 2024
Depression symptoms modelling from social media text: an LLM driven semi-supervised learning approach

Authors (first, second and last of 4)
- Nawshad Farruque
- Randy Goebel
- Osmar R. Zaïane
- Content type: Original Paper
- Open Access
- Published: 04 April 2024
A morphologically annotated longitudinal corpus of spoken Czech child–adult interactions

Authors (first, second and last of 4)
- Anna Chromá
- Jakub Sláma
- Jolana Treichelová
- Content type: OriginalPaper
- Published: 30 March 2024
TCMeta: a multilingual dataset of COVID tweets for relation-level metaphor analysis

Authors
- Mojca Brglez
- Omnia Zayed
- Paul Buitelaar
- Content type: Original Paper
- Open Access
- Published: 30 March 2024
A longitudinal multi-modal dataset for dementia monitoring and diagnosis

Authors (first, second and last of 7)
- Dimitris Gkoumas
- Bo Wang
- Maria Liakata
- Content type: Original Paper
- Open Access
- Published: 30 March 2024
DILLo: an Italian lexical database for speech-language pathologists

Authors (first, second and last of 9)
- Federica Beccaria
- Angela Cristiano
- Gloria Gagliardi
- Content type: Original Paper
- Open Access
- Published: 23 March 2024
"Approaches to sentiment analysis of Hungarian political news at the sentence level"

Authors (first, second and last of 5)
- Orsolya Ring
- Martina Katalin Szabó
- István Üveges
- Content type: Original Paper
- Open Access
- Published: 23 March 2024
Introducing the 3MT_French dataset to investigate the timing of public speaking judgements

Authors
- Beatrice Biancardi
- Mathieu Chollet
- Chloé Clavel
- Content type: OriginalPaper
- Open Access
- Published: 23 March 2024
VeLeRo: an inflected verbal lexicon of standard Romanian and a quantitative analysis of morphological predictability

Authors
- Borja Herce
- Bogdan Pricop
- Content type: Project Notes
- Open Access
- Published: 23 March 2024
An aligned corpus of Spanish bibles

Authors (first, second and last of 5)
- Gerardo Sierra
- Gemma Bel-Enguix
- Núria Bel
- Content type: Original Paper
- Open Access
- Published: 15 March 2024
SOLD: Sinhala offensive language dataset

Authors (first, second and last of 7)
- Tharindu Ranasinghe
- Isuri Anuradha
- Marcos Zampieri
- Content type: Original Paper
- Open Access
- Published: 06 March 2024
Infectious risk events and their novelty in event-based surveillance: new definitions and annotated corpus

Authors (first, second and last of 8)
- François Delon
- Gabriel Bédubourg
- Marc Tanti
- Content type: Original Paper
- Published: 05 March 2024
Semantic search as extractive paraphrase span detection

Authors (first, second and last of 6)
- Jenna Kanerva
- Hanna Kitti
- Filip Ginter
- Content type: Original Paper
- Open Access
- Published: 01 February 2024
A new methodology for automatic creation of concept maps of Turkish texts

Authors
- Merve Bayrak
- Deniz Dal
- Content type: Original Paper
- Published: 28 January 2024
Large scale annotated dataset for code-mix abusive short noisy text

Authors
- Paras Tiwari
- Sawan Rai
- C. Ravindranath Chowdary
- Content type: OriginalPaper
- Published: 25 January 2024
A flexible tool for a qualia-enriched FrameNet: the FrameNet Brasil WebTool

Authors (first, second and last of 6)
- Tiago Timponi Torrent
- Ely Edison da Silva Matos
- Vanessa Maria Ramos Lopes Paiva
- Content type: Original Paper
- Published: 22 January 2024
NewsCom-TOX: a corpus of comments on news articles annotated for toxicity in Spanish

Authors (first, second and last of 4)
- Mariona Taulé
- Montserrat Nofre
- Xavier Bonet
- Content type: Original Paper
- Open Access
- Published: 17 January 2024
Toxic comment classification and rationale extraction in code-mixed text leveraging co-attentive multi-task learning

Authors
- Kiran Babu Nelatoori
- Hima Bindu Kommanti
- Content type: Original Paper
- Published: 13 January 2024
Multi-layered semantic annotation and the formalisation of annotation schemas for the investigation of modality in a Latin corpus

Authors
- Helena Bermúdez-Sabel
- Francesca Dell’Oro
- Paola Marongiu
- Content type: Project Notes
- Published: 06 January 2024
AC-IQuAD: Automatically Constructed Indonesian Question Answering Dataset by Leveraging Wikidata

Authors
- Kerenza Doxolodeo
- Adila Alfa Krisnadhi
- Content type: OriginalPaper
- Open Access
- Published: 03 January 2024
KurdiSent: a corpus for kurdish sentiment analysis

Authors
- Soran Badawi
- Arefeh Kazemi
- Vali Rezaie
- Content type: Original Paper
- Published: 02 January 2024
Linguistic annotation of Byzantine book epigrams

Authors
- Colin Swaelens
- Ilse De Vos
- Els Lefever
- Content type: Original Paper
- Published: 13 December 2023
Democratizing neural machine translation with OPUS-MT

Authors (first, second and last of 10)
- Jörg Tiedemann
- Mikko Aulamo
- Sami Virpioja
- Content type: Original Paper
- Open Access
- Published: 13 December 2023
When MIPVU goes to no man’s land: a new language resource for hybrid, morpheme-based metaphor identification in Hungarian

Authors (first, second and last of 6)
- Gábor Simon
- Tímea Bajzát
- Eszter Szlávich
- Content type: Original Paper
- Open Access
- Published: 09 December 2023
EmoTwiCS: a corpus for modelling emotion trajectories in Dutch customer service dialogues on Twitter

Authors
- Sofie Labat
- Thomas Demeester
- Véronique Hoste
- Content type: Original Paper
- Open Access
- Published: 08 December 2023
Resources building for sentiment analysis of content disseminated by Tunisian medias in social networks

Authors
- Emna Fsih
- Rahma Boujelbane
- Lamia Hadrich Belguith
- Content type: OriginalPaper
- Published: 02 December 2023
A corpus of Persian literary text

Authors (first, second and last of 4)
- Shahab Raji
- Malihe Alikhani
- Matthew Stone
- Content type: Original Paper
- Open Access
- Published: 23 November 2023
A corpus of English learners with Arabic and Hebrew backgrounds

Authors (first, second and last of 5)
- Omaima Abboud
- Batia Laufer
- Shuly Wintner
- Content type: Project Notes
- Published: 20 November 2023
The Reading Everyday Emotion Database (REED): a set of audio-visual recordings of emotions in music and language

Authors
- Jia Hoong Ong
- Florence Yik Nam Leung
- Fang Liu
- Content type: OriginalPaper
- Open Access
- Published: 20 November 2023
Automatic genre identification: a survey

Authors
- Taja Kuzman
- Nikola Ljubešić
- Content type: Survey
- Open Access
- Published: 16 November 2023
A multilingual, multimodal dataset of aggression and bias: the ComMA dataset

Authors (first, second and last of 9)
- Ritesh Kumar
- Shyam Ratan
- Akanksha Bansal
- Content type: Original Paper
- Published: 16 November 2023
Correction: The DELAD initiative for sharing language resources on speech disorders

Authors (first, second and last of 5)
- Alice Lee
- Nicola Bessell
- Satu Saalasti
- Content type: Correction
- Open Access
- Published: 06 November 2023
LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI

Authors
- Ishan Tarunesh
- Somak Aditya
- Monojit Choudhury
- Content type: Original Paper
- Published: 04 November 2023
Building the VisSE Corpus of Spanish SignWriting

Authors
- Antonio F. G. Sevilla
- Alberto Díaz Esteban
- José María Lahoz-Bengoechea
- Content type: Original Paper
- Published: 26 October 2023
Text augmentation for semantic frame induction and parsing

Authors (first, second and last of 5)
- Saba Anwar
- Artem Shelmanov
- Chris Biemann
- Content type: Original Paper
- Open Access
- Published: 21 October 2023
A new corpus of geolocated ASR transcripts from Germany

Authors
- Steven Coats
- Content type: Project Notes
- Open Access
- Published: 21 October 2023
Beyond plain toxic: building datasets for detection of flammable topics and inappropriate statements

Authors
- Nikolay Babakov
- Varvara Logacheva
- Alexander Panchenko
- Content type: Original Paper
- Published: 21 October 2023
A semi-supervised method to generate a persian dataset for suggestion classification

Authors
- Leila Safari
- Zanyar Mohammady
- Content type: Original Paper
- Published: 29 September 2023
NEREL: a Russian information extraction dataset with rich annotation for nested entities, relations, and wikidata entity links

Authors (first, second and last of 11)
- Natalia Loukachevitch
- Ekaterina Artemova
- Alexey Yandutov
- Content type: Original Paper
- Published: 21 September 2023
An eye-tracking-with-EEG coregistration corpus of narrative sentences

Authors
- Stefan L. Frank
- Anna Aumeistere
- Content type: Original Paper
- Open Access
- Published: 29 August 2023
Data augmentation strategies to improve text classification: a use case in smart cities

Authors
- Luciana Bencke
- Viviane Pereira Moreira
- Content type: Original Paper
- Published: 23 August 2023
The development of a labelled te reo Māori–English bilingual database for language technology

Authors (first, second and last of 7)
- Jesin James
- Isabella Shields
- Keoni Mahelona
- Content type: Original Paper
- Published: 20 August 2023
Comparative performance of ensemble machine learning for Arabic cyberbullying and offensive language detection

Authors (first, second and last of 4)
- Marwa Khairy
- Tarek M. Mahmoud
- Tarek Abd El-Hafeez
- Content type: Original Paper
- Open Access
- Published: 13 August 2023
RUN-AS: a novel approach to annotate news reliability for disinformation detection

Authors (first, second and last of 5)
- Alba Bonet-Jover
- Robiert Sepúlveda-Torres
- Mario Nieto-Pérez
- Content type: Original Paper
- Open Access
- Published: 06 August 2023
Assessment of pragmatic abilities and cognitive substrates (APACS) brief remote: a novel tool for the rapid and tele-evaluation of pragmatic skills in Italian

Authors (first, second and last of 7)
- Luca Bischetti
- Chiara Pompei
- Valentina Bambini
- Content type: Original Paper
- Published: 23 July 2023
The limitations of irony detection in Dutch social media

Authors (first, second and last of 4)
- Aaron Maladry
- Els Lefever
- Véronique Hoste
- Content type: Original Paper
- Open Access
- Published: 23 July 2023
MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish

Authors
- Ismael Garrido-Muñoz
- Fernando Martínez-Santiago
- Arturo Montejo-Ráez
- Content type: Original Paper
- Open Access
- Published: 23 July 2023
FullStop: punctuation and segmentation prediction for Dutch with transformers

Authors
- Vincent Vandeghinste
- Oliver Guhr
- Content type: Original Paper
- Published: 14 July 2023
The C-ORAL-ESQ project: a corpus for the study of spontaneous speech of individuals with schizophrenia

Authors (first, second and last of 6)
- Tommaso Raso
- Bruno Neves Rati de Melo Rocha
- Heliana Mello
- Content type: Original Paper
- Published: 27 June 2023

Previous page
1
2
Next page