Few Shot NER on Augmented Unstructured Text from Cardiology Records

Ferraro, Antonino; Galli, Antonio; La Gatta, Valerio; Minocchi, Mario; Moscato, Vincenzo; Postiglione, Marco

doi:10.1007/978-3-031-53555-0_1

Antonino Ferraro³,
Antonio Galli³,
Valerio La Gatta³,
Mario Minocchi³,
Vincenzo Moscato³ &
…
Marco Postiglione³

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 193))

Included in the following conference series:

International Conference on Emerging Internet, Data & Web Technologies

347 Accesses

Abstract

The principal challenge encountered in the realm of Named-Entity Recognition lies in the acquisition of high-caliber annotated data. In certain languages and specialized domains, the availability of substantial datasets suitable for training models via traditional machine learning methodologies can prove to be a formidable obstacle [10]. In an effort to address this issue, we have explored a Policy-based Active Learning approach aimed at meticulously selecting the most advantageous instances generated through a Data Augmentation procedure [3, 6]. This endeavor was undertaken within the context of a few-shot scenario in the biomedical field. Our study has revealed the superiority of this strategy in comparison to active learning techniques relying on fixed metrics or random instance selection, guaranteeing the privacy of patients from whose medical records the source data were obtained and used. However, it is imperative to note that this approach entails heightened computational demands and necessitates a longer execution duration [7].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

AHIAP: An Agile Medical Named Entity Recognition and Relation Extraction Framework Based on Active Learning

An Efficient Text Labeling Framework Using Active Learning Model

Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition

References

Barolli, L., Ferraro, A.: A prediction approach in health domain combining encoding strategies and neural networks. In: Barolli, L. (ed.) 3PGCIC 2022. LNNS, vol. 571, pp. 129–136. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-19945-5_12
Chapter Google Scholar
Chen, S., Aguilar, G., Neves, L., Solorio, T.: Data augmentation for cross-domain named entity recognition (2021). https://doi.org/10.48550/ARXIV.2109.01758, https://arxiv.org/abs/2109.01758
Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: Autoaugment: learning augmentation policies from data. arXiv preprint arXiv:1805.09501 (2018)
Dai, X., Adel, H.: An analysis of simple data augmentation for named entity recognition (2020). https://doi.org/10.48550/ARXIV.2010.11683, https://arxiv.org/abs/2010.11683
Fang, M., Li, Y., Cohn, T.: Learning how to active learn: a deep reinforcement learning approach (2017). https://doi.org/10.48550/ARXIV.1708.02383, https://arxiv.org/abs/1708.02383
Ferraro, A., et al.: HEMR: hypergraph embeddings for music recommendation (2023)
Google Scholar
Ferraro, A., et al.: Unsupervised anomaly detection in predictive maintenance using sound data (2023)
Google Scholar
Ferraro, A., Galli, A., La Gatta, V., Postiglione, M.: A deep learning pipeline for network anomaly detection based on autoencoders. In: 2022 IEEE International Conference on Metrology for Extended Reality, Artificial Intelligence and Neural Engineering (MetroXRAINE), pp. 260–264. IEEE (2022)
Google Scholar
Ferraro, A., Galli, A., La Gatta, V., Postiglione, M.: Benchmarking open source and paid services for speech to text: an analysis of quality and input variety. Front. Big Data 6 (2023). https://doi.org/10.3389/fdata.2023.1210559. https://www.frontiersin.org/articles/10.3389/fdata.2023.1210559
Houssein, E.H., Mohamed, R.E., Ali, A.A.: Machine learning techniques for biomedical natural language processing: a comprehensive review. IEEE Access 9, 140628–140653 (2021)
Article Google Scholar
Krishna, K., Agal, A.: Diversity Sampling in Machine Learning
Google Scholar
Margatina, K., Vernikos, G., Barrault, L., Aletras, N.: Active learning by acquiring contrastive examples (2021). https://doi.org/10.48550/ARXIV.2109.03764, https://arxiv.org/abs/2109.03764
Nguyen, V.L., Shaker, M., Hllermeier, E.: How to measure uncertainty in uncertainty sampling for active learning. Mach. Learn. 111, 89–122 (2022)
Article MathSciNet Google Scholar
Settles, B.: Active learning literature survey. Computer Sciences Technical report 1648, University of Wisconsin–Madison (2009)
Google Scholar
Sutton, R., Barto, A.: Reinforcement learning: an introduction. IEEE Trans. Neural Networks 9(5), 1054 (1998). https://doi.org/10.1109/TNN.1998.712192
Article Google Scholar
Uzuner, Ö., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. J. Am. Medical Informatics Assoc. 18(5), 552–556 (2011). https://doi.org/10.1136/amiajnl-2011-000203

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering and Information Technology (DIETI), University of Naples “Federico II”, Via Claudio 21, Naples, Italy
Antonino Ferraro, Antonio Galli, Valerio La Gatta, Mario Minocchi, Vincenzo Moscato & Marco Postiglione

Authors

Antonino Ferraro
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Galli
View author publications
You can also search for this author in PubMed Google Scholar
Valerio La Gatta
View author publications
You can also search for this author in PubMed Google Scholar
Mario Minocchi
View author publications
You can also search for this author in PubMed Google Scholar
Vincenzo Moscato
View author publications
You can also search for this author in PubMed Google Scholar
Marco Postiglione
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonino Ferraro .

Editor information

Editors and Affiliations

Faculty of Information Engineering, Fukuoka Institute of Technology, Fukuoka, Japan
Leonard Barolli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ferraro, A., Galli, A., La Gatta, V., Minocchi, M., Moscato, V., Postiglione, M. (2024). Few Shot NER on Augmented Unstructured Text from Cardiology Records. In: Barolli, L. (eds) Advances in Internet, Data & Web Technologies. EIDWT 2024. Lecture Notes on Data Engineering and Communications Technologies, vol 193. Springer, Cham. https://doi.org/10.1007/978-3-031-53555-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-031-53555-0_1
Published: 14 February 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53554-3
Online ISBN: 978-3-031-53555-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Few Shot NER on Augmented Unstructured Text from Cardiology Records

Abstract

Access this chapter

Similar content being viewed by others

AHIAP: An Agile Medical Named Entity Recognition and Relation Extraction Framework Based on Active Learning

An Efficient Text Labeling Framework Using Active Learning Model

Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Few Shot NER on Augmented Unstructured Text from Cardiology Records

Abstract

Access this chapter

Similar content being viewed by others

AHIAP: An Agile Medical Named Entity Recognition and Relation Extraction Framework Based on Active Learning

An Efficient Text Labeling Framework Using Active Learning Model

Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation