Skip to main content

Person Name Segmentation with Deep Neural Networks

  • Conference paper
  • First Online:
Mining Intelligence and Knowledge Exploration (MIKE 2019)

Abstract

Person names often need to be represented in a consistent format in an application, for example, in <Last Name, Given Name, Suffix> format in library catalogs. Obtaining a normalized representation automatically from an input name requires precise labeling of its components. The process is difficult owing to numerous cultural conventions in writing personal names. In this paper, we propose deep learning-based techniques to achieve this using sequence-to-sequence learning. We design several architectures using a bidirectional long short-term memory (BiLSTM)-based recurrent neural network (RNN). We compare these methods with one based on the hidden Markov model. We perform experiments on a large collection of author names drawn from the National Digital Library of India. The best accuracy of \(94\%\) is achieved by the character-level BiLSTM with a conditional random field at the output layer. We also show visualizations of the vectors (representing person names) learned by a BiLSTM and how these vectors are clustered according to name structures. Our study shows that deep learning is a promising approach to automatic name segmentation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Borkar, V., Deshmukh, K., Sarawagi, S.: Automatic segmentation of text into structured records. In: ACM SIGMOD Record, vol. 30, pp. 175–186. ACM (2001)

    Google Scholar 

  2. Churches, T., Christen, P., Lim, K., Zhu, J.X.: Preparation of name and address data for record linkage using hidden Markov models. BMC Med. Inform. Decis. Mak. 2(1), 9 (2002)

    Article  Google Scholar 

  3. Das, G.S., Li, X., Sun, A., Kardes, H., Wang, X.: Person-name parsing for linking user web profiles. In: Proceedings of the 18th International Workshop on Web and Databases, pp. 20–26. ACM (2015)

    Google Scholar 

  4. Deng, L.: A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Trans. Signal Inf. Process. 3, e2 (2014)

    Article  Google Scholar 

  5. Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the SIGKDD Conference on Knowledge Discovery and Data Mining 1996, pp. 226–231 (1996)

    Google Scholar 

  6. Forney, G.D.: The Viterbi algorithm. Proc. IEEE 61(3), 268–278 (1973)

    Article  MathSciNet  Google Scholar 

  7. Gonçalves, R.D.C.B., Freire, S.M.: Name segmentation using hidden Markov models and its application in record linkage. Cadernos de Saude Publica 30(10), 2039–2048 (2014)

    Article  Google Scholar 

  8. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  9. Johnson, S.B., Bales, M.E., Dine, D., Bakken, S., Albert, P.J., Weng, C.: Automatic generation of investigator bibliographies for institutional research networking systems. J. Biomed. Inform. 51, 8–14 (2014)

    Article  Google Scholar 

  10. Keras-Team: Keras documentation (2018). https://keras.io/. Accessed 09 Mar 2019

  11. Lipton, Z.C., Berkowitz, J., Elkan, C.: A critical review of recurrent neural networks for sequence learning. arXiv preprint arXiv:1506.00019 (2015)

  12. van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(Nov), 2579–2605 (2008)

    MATH  Google Scholar 

  13. Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)

    Article  Google Scholar 

  14. Sarawagi, S.: Information extraction. Found. Trends Databases 1(3), 261–377 (2008)

    Article  Google Scholar 

  15. Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends® Mach. Learn. 4(4), 267–373 (2012)

    Google Scholar 

  16. Yadav, V., Bethard, S.: A survey on recent advances in named entity recognition from deep learning models. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2145–2158 (2018)

    Google Scholar 

  17. Zeyer, A., Doetsch, P., Voigtlaender, P., Schlüter, R., Ney, H.: A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2462–2466. IEEE (2017)

    Google Scholar 

Download references

Acknowledgements

This work is supported by the National Digital Library of India Project sponsored by the Ministry of Human Resource Development, Government of India at IIT Kharagpur.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Debarshi Kumar Sanyal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Santosh, T.Y.S.S., Sanyal, D.K., Das, P.P. (2020). Person Name Segmentation with Deep Neural Networks. In: B. R., P., Thenkanidiyoor, V., Prasath, R., Vanga, O. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2019. Lecture Notes in Computer Science(), vol 11987. Springer, Cham. https://doi.org/10.1007/978-3-030-66187-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-66187-8_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-66186-1

  • Online ISBN: 978-3-030-66187-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics