Online first articles
Articles not assigned to an issue 58 articles
-
-
Faux Hate: unravelling the web of fake narratives in spreading hateful stories: a multi-label and multi-class dataset in cross-lingual Hindi-English code-mixed text
Authors
- Shankar Biradar
- Sunil Saumya
- Arun Chauhan
- Content type: Original Paper
- Published: 16 April 2024
-
Depression symptoms modelling from social media text: an LLM driven semi-supervised learning approach
Authors (first, second and last of 4)
- Nawshad Farruque
- Randy Goebel
- Osmar R. Zaïane
- Content type: Original Paper
- Open Access
- Published: 04 April 2024
-
A morphologically annotated longitudinal corpus of spoken Czech child–adult interactions
Authors (first, second and last of 4)
- Anna Chromá
- Jakub Sláma
- Jolana Treichelová
- Content type: OriginalPaper
- Published: 30 March 2024
-
TCMeta: a multilingual dataset of COVID tweets for relation-level metaphor analysis
Authors
- Mojca Brglez
- Omnia Zayed
- Paul Buitelaar
- Content type: Original Paper
- Open Access
- Published: 30 March 2024
-
A longitudinal multi-modal dataset for dementia monitoring and diagnosis
Authors (first, second and last of 7)
- Dimitris Gkoumas
- Bo Wang
- Maria Liakata
- Content type: Original Paper
- Open Access
- Published: 30 March 2024
-
DILLo: an Italian lexical database for speech-language pathologists
Authors (first, second and last of 9)
- Federica Beccaria
- Angela Cristiano
- Gloria Gagliardi
- Content type: Original Paper
- Open Access
- Published: 23 March 2024
-
"Approaches to sentiment analysis of Hungarian political news at the sentence level"
Authors (first, second and last of 5)
- Orsolya Ring
- Martina Katalin Szabó
- István Üveges
- Content type: Original Paper
- Open Access
- Published: 23 March 2024
-
Introducing the 3MT_French dataset to investigate the timing of public speaking judgements
Authors
- Beatrice Biancardi
- Mathieu Chollet
- Chloé Clavel
- Content type: OriginalPaper
- Open Access
- Published: 23 March 2024
-
VeLeRo: an inflected verbal lexicon of standard Romanian and a quantitative analysis of morphological predictability
Authors
- Borja Herce
- Bogdan Pricop
- Content type: Project Notes
- Open Access
- Published: 23 March 2024
-
An aligned corpus of Spanish bibles
Authors (first, second and last of 5)
- Gerardo Sierra
- Gemma Bel-Enguix
- Núria Bel
- Content type: Original Paper
- Open Access
- Published: 15 March 2024
-
SOLD: Sinhala offensive language dataset
Authors (first, second and last of 7)
- Tharindu Ranasinghe
- Isuri Anuradha
- Marcos Zampieri
- Content type: Original Paper
- Open Access
- Published: 06 March 2024
-
Infectious risk events and their novelty in event-based surveillance: new definitions and annotated corpus
Authors (first, second and last of 8)
- François Delon
- Gabriel Bédubourg
- Marc Tanti
- Content type: Original Paper
- Published: 05 March 2024
-
Semantic search as extractive paraphrase span detection
Authors (first, second and last of 6)
- Jenna Kanerva
- Hanna Kitti
- Filip Ginter
- Content type: Original Paper
- Open Access
- Published: 01 February 2024
-
A new methodology for automatic creation of concept maps of Turkish texts
Authors
- Merve Bayrak
- Deniz Dal
- Content type: Original Paper
- Published: 28 January 2024
-
Large scale annotated dataset for code-mix abusive short noisy text
Authors
- Paras Tiwari
- Sawan Rai
- C. Ravindranath Chowdary
- Content type: OriginalPaper
- Published: 25 January 2024
-
A flexible tool for a qualia-enriched FrameNet: the FrameNet Brasil WebTool
Authors (first, second and last of 6)
- Tiago Timponi Torrent
- Ely Edison da Silva Matos
- Vanessa Maria Ramos Lopes Paiva
- Content type: Original Paper
- Published: 22 January 2024
-
NewsCom-TOX: a corpus of comments on news articles annotated for toxicity in Spanish
Authors (first, second and last of 4)
- Mariona Taulé
- Montserrat Nofre
- Xavier Bonet
- Content type: Original Paper
- Open Access
- Published: 17 January 2024
-
Toxic comment classification and rationale extraction in code-mixed text leveraging co-attentive multi-task learning
Authors
- Kiran Babu Nelatoori
- Hima Bindu Kommanti
- Content type: Original Paper
- Published: 13 January 2024
-
Multi-layered semantic annotation and the formalisation of annotation schemas for the investigation of modality in a Latin corpus
Authors
- Helena Bermúdez-Sabel
- Francesca Dell’Oro
- Paola Marongiu
- Content type: Project Notes
- Published: 06 January 2024
-
AC-IQuAD: Automatically Constructed Indonesian Question Answering Dataset by Leveraging Wikidata
Authors
- Kerenza Doxolodeo
- Adila Alfa Krisnadhi
- Content type: OriginalPaper
- Open Access
- Published: 03 January 2024
-
KurdiSent: a corpus for kurdish sentiment analysis
Authors
- Soran Badawi
- Arefeh Kazemi
- Vali Rezaie
- Content type: Original Paper
- Published: 02 January 2024
-
Linguistic annotation of Byzantine book epigrams
Authors
- Colin Swaelens
- Ilse De Vos
- Els Lefever
- Content type: Original Paper
- Published: 13 December 2023
-
Democratizing neural machine translation with OPUS-MT
Authors (first, second and last of 10)
- Jörg Tiedemann
- Mikko Aulamo
- Sami Virpioja
- Content type: Original Paper
- Open Access
- Published: 13 December 2023
-
When MIPVU goes to no man’s land: a new language resource for hybrid, morpheme-based metaphor identification in Hungarian
Authors (first, second and last of 6)
- Gábor Simon
- Tímea Bajzát
- Eszter Szlávich
- Content type: Original Paper
- Open Access
- Published: 09 December 2023
-
EmoTwiCS: a corpus for modelling emotion trajectories in Dutch customer service dialogues on Twitter
Authors
- Sofie Labat
- Thomas Demeester
- Véronique Hoste
- Content type: Original Paper
- Open Access
- Published: 08 December 2023
-
Resources building for sentiment analysis of content disseminated by Tunisian medias in social networks
Authors
- Emna Fsih
- Rahma Boujelbane
- Lamia Hadrich Belguith
- Content type: OriginalPaper
- Published: 02 December 2023
-
A corpus of Persian literary text
Authors (first, second and last of 4)
- Shahab Raji
- Malihe Alikhani
- Matthew Stone
- Content type: Original Paper
- Open Access
- Published: 23 November 2023
-
A corpus of English learners with Arabic and Hebrew backgrounds
Authors (first, second and last of 5)
- Omaima Abboud
- Batia Laufer
- Shuly Wintner
- Content type: Project Notes
- Published: 20 November 2023
-
The Reading Everyday Emotion Database (REED): a set of audio-visual recordings of emotions in music and language
Authors
- Jia Hoong Ong
- Florence Yik Nam Leung
- Fang Liu
- Content type: OriginalPaper
- Open Access
- Published: 20 November 2023
-
Automatic genre identification: a survey
Authors
- Taja Kuzman
- Nikola Ljubešić
- Content type: Survey
- Open Access
- Published: 16 November 2023
-
A multilingual, multimodal dataset of aggression and bias: the ComMA dataset
Authors (first, second and last of 9)
- Ritesh Kumar
- Shyam Ratan
- Akanksha Bansal
- Content type: Original Paper
- Published: 16 November 2023
-
Correction: The DELAD initiative for sharing language resources on speech disorders
Authors (first, second and last of 5)
- Alice Lee
- Nicola Bessell
- Satu Saalasti
- Content type: Correction
- Open Access
- Published: 06 November 2023
-
LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI
Authors
- Ishan Tarunesh
- Somak Aditya
- Monojit Choudhury
- Content type: Original Paper
- Published: 04 November 2023
-
Building the VisSE Corpus of Spanish SignWriting
Authors
- Antonio F. G. Sevilla
- Alberto Díaz Esteban
- José María Lahoz-Bengoechea
- Content type: Original Paper
- Published: 26 October 2023
-
Text augmentation for semantic frame induction and parsing
Authors (first, second and last of 5)
- Saba Anwar
- Artem Shelmanov
- Chris Biemann
- Content type: Original Paper
- Open Access
- Published: 21 October 2023
-
A new corpus of geolocated ASR transcripts from Germany
Authors
- Steven Coats
- Content type: Project Notes
- Open Access
- Published: 21 October 2023
-
Beyond plain toxic: building datasets for detection of flammable topics and inappropriate statements
Authors
- Nikolay Babakov
- Varvara Logacheva
- Alexander Panchenko
- Content type: Original Paper
- Published: 21 October 2023
-
A semi-supervised method to generate a persian dataset for suggestion classification
Authors
- Leila Safari
- Zanyar Mohammady
- Content type: Original Paper
- Published: 29 September 2023
-
NEREL: a Russian information extraction dataset with rich annotation for nested entities, relations, and wikidata entity links
Authors (first, second and last of 11)
- Natalia Loukachevitch
- Ekaterina Artemova
- Alexey Yandutov
- Content type: Original Paper
- Published: 21 September 2023
-
An eye-tracking-with-EEG coregistration corpus of narrative sentences
Authors
- Stefan L. Frank
- Anna Aumeistere
- Content type: Original Paper
- Open Access
- Published: 29 August 2023
-
Data augmentation strategies to improve text classification: a use case in smart cities
Authors
- Luciana Bencke
- Viviane Pereira Moreira
- Content type: Original Paper
- Published: 23 August 2023
-
The development of a labelled te reo Māori–English bilingual database for language technology
Authors (first, second and last of 7)
- Jesin James
- Isabella Shields
- Keoni Mahelona
- Content type: Original Paper
- Published: 20 August 2023
-
Comparative performance of ensemble machine learning for Arabic cyberbullying and offensive language detection
Authors (first, second and last of 4)
- Marwa Khairy
- Tarek M. Mahmoud
- Tarek Abd El-Hafeez
- Content type: Original Paper
- Open Access
- Published: 13 August 2023
-
RUN-AS: a novel approach to annotate news reliability for disinformation detection
Authors (first, second and last of 5)
- Alba Bonet-Jover
- Robiert Sepúlveda-Torres
- Mario Nieto-Pérez
- Content type: Original Paper
- Open Access
- Published: 06 August 2023
-
Assessment of pragmatic abilities and cognitive substrates (APACS) brief remote: a novel tool for the rapid and tele-evaluation of pragmatic skills in Italian
Authors (first, second and last of 7)
- Luca Bischetti
- Chiara Pompei
- Valentina Bambini
- Content type: Original Paper
- Published: 23 July 2023
-
The limitations of irony detection in Dutch social media
Authors (first, second and last of 4)
- Aaron Maladry
- Els Lefever
- Véronique Hoste
- Content type: Original Paper
- Open Access
- Published: 23 July 2023
-
MarIA and BETO are sexist: evaluating gender bias in large language models for Spanish
Authors
- Ismael Garrido-Muñoz
- Fernando Martínez-Santiago
- Arturo Montejo-Ráez
- Content type: Original Paper
- Open Access
- Published: 23 July 2023
-
FullStop: punctuation and segmentation prediction for Dutch with transformers
Authors
- Vincent Vandeghinste
- Oliver Guhr
- Content type: Original Paper
- Published: 14 July 2023
-
The C-ORAL-ESQ project: a corpus for the study of spontaneous speech of individuals with schizophrenia
Authors (first, second and last of 6)
- Tommaso Raso
- Bruno Neves Rati de Melo Rocha
- Heliana Mello
- Content type: Original Paper
- Published: 27 June 2023
For authors
Submit manuscriptWorking on a manuscript?
Avoid the most common mistakes and prepare your manuscript for journal editors.
Learn more