Person Name Segmentation with Deep Neural Networks

Santosh, Tokala Yaswanth Sri Sai; Sanyal, Debarshi Kumar; Das, Partha Pratim

doi:10.1007/978-3-030-66187-8_4

Tokala Yaswanth Sri Sai Santosh¹²,
Debarshi Kumar Sanyal¹³ &
Partha Pratim Das¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11987))

Included in the following conference series:

International Conference on Mining Intelligence and Knowledge Exploration

253 Accesses

Abstract

Person names often need to be represented in a consistent format in an application, for example, in <Last Name, Given Name, Suffix> format in library catalogs. Obtaining a normalized representation automatically from an input name requires precise labeling of its components. The process is difficult owing to numerous cultural conventions in writing personal names. In this paper, we propose deep learning-based techniques to achieve this using sequence-to-sequence learning. We design several architectures using a bidirectional long short-term memory (BiLSTM)-based recurrent neural network (RNN). We compare these methods with one based on the hidden Markov model. We perform experiments on a large collection of author names drawn from the National Digital Library of India. The best accuracy of \(94\%\) is achieved by the character-level BiLSTM with a conditional random field at the output layer. We also show visualizations of the vectors (representing person names) learned by a BiLSTM and how these vectors are clustered according to name structures. Our study shows that deep learning is a promising approach to automatic name segmentation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Borkar, V., Deshmukh, K., Sarawagi, S.: Automatic segmentation of text into structured records. In: ACM SIGMOD Record, vol. 30, pp. 175–186. ACM (2001)
Google Scholar
Churches, T., Christen, P., Lim, K., Zhu, J.X.: Preparation of name and address data for record linkage using hidden Markov models. BMC Med. Inform. Decis. Mak. 2(1), 9 (2002)
Article Google Scholar
Das, G.S., Li, X., Sun, A., Kardes, H., Wang, X.: Person-name parsing for linking user web profiles. In: Proceedings of the 18th International Workshop on Web and Databases, pp. 20–26. ACM (2015)
Google Scholar
Deng, L.: A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Trans. Signal Inf. Process. 3, e2 (2014)
Article Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the SIGKDD Conference on Knowledge Discovery and Data Mining 1996, pp. 226–231 (1996)
Google Scholar
Forney, G.D.: The Viterbi algorithm. Proc. IEEE 61(3), 268–278 (1973)
Article MathSciNet Google Scholar
Gonçalves, R.D.C.B., Freire, S.M.: Name segmentation using hidden Markov models and its application in record linkage. Cadernos de Saude Publica 30(10), 2039–2048 (2014)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Johnson, S.B., Bales, M.E., Dine, D., Bakken, S., Albert, P.J., Weng, C.: Automatic generation of investigator bibliographies for institutional research networking systems. J. Biomed. Inform. 51, 8–14 (2014)
Article Google Scholar
Keras-Team: Keras documentation (2018). https://keras.io/. Accessed 09 Mar 2019
Lipton, Z.C., Berkowitz, J., Elkan, C.: A critical review of recurrent neural networks for sequence learning. arXiv preprint arXiv:1506.00019 (2015)
van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(Nov), 2579–2605 (2008)
MATH Google Scholar
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)
Article Google Scholar
Sarawagi, S.: Information extraction. Found. Trends Databases 1(3), 261–377 (2008)
Article Google Scholar
Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends® Mach. Learn. 4(4), 267–373 (2012)
Google Scholar
Yadav, V., Bethard, S.: A survey on recent advances in named entity recognition from deep learning models. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2145–2158 (2018)
Google Scholar
Zeyer, A., Doetsch, P., Voigtlaender, P., Schlüter, R., Ney, H.: A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2462–2466. IEEE (2017)
Google Scholar

Download references

Acknowledgements

This work is supported by the National Digital Library of India Project sponsored by the Ministry of Human Resource Development, Government of India at IIT Kharagpur.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India
Tokala Yaswanth Sri Sai Santosh & Partha Pratim Das
National Digital Library of India, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India
Debarshi Kumar Sanyal

Authors

Tokala Yaswanth Sri Sai Santosh
View author publications
You can also search for this author in PubMed Google Scholar
Debarshi Kumar Sanyal
View author publications
You can also search for this author in PubMed Google Scholar
Partha Pratim Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Debarshi Kumar Sanyal .

Editor information

Editors and Affiliations

National Institute of Technology, Goa, India
Purushothama B. R.
National Institute of Technology, Goa, India
Veena Thenkanidiyoor
Indian Institute of Information Technology, Sri City, India
Rajendra Prasath
Indian Institute of Information Technology, Sri City, India
Odelu Vanga

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Santosh, T.Y.S.S., Sanyal, D.K., Das, P.P. (2020). Person Name Segmentation with Deep Neural Networks. In: B. R., P., Thenkanidiyoor, V., Prasath, R., Vanga, O. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2019. Lecture Notes in Computer Science(), vol 11987. Springer, Cham. https://doi.org/10.1007/978-3-030-66187-8_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-66187-8_4
Published: 20 December 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-66186-1
Online ISBN: 978-3-030-66187-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics