Skip to main content

Introduction to Text Analytics

  • Conference paper
  • First Online:
Big Data Management and Analytics (eBISS 2019)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 390))

Included in the following conference series:

  • 320 Accesses

Abstract

Data processing regards analysis of various types of data, including numerical data, signals, texts, pictures, videos, etc. This paper focuses on defining and studying various tasks of text analytics following the typical processing pipeline. Sources of textual data are introduced and related challenges are discussed. Along with the process of text analytics, examples are presented to demonstrate how text analytics should be carried out. Finally, potential applications of text analytics are given including sentiment analysis and automatic generation of content.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://en.wikipedia.org/wiki/Text_mining.

  2. 2.

    https://en.wikipedia.org/wiki/Parsing.

  3. 3.

    https://en.wikipedia.org/wiki/Regular_expression.

  4. 4.

    https://en.wikipedia.org/wiki/XPath.

  5. 5.

    http://openrefine.org/download.html.

  6. 6.

    https://medium.com/@sumn2u/cosine-similarity-between-two-sentences-8f6630b0ebb7.

References

  1. Chapter 11 - Information retrieval: Concepts, models, and systems. In: Gudivada, V.N., Rao, C. (eds.) Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications. Handbook of Statistics, vol. 38, pp. 331–401. Elsevier (2018)

    Google Scholar 

  2. Cooper, D., Schindler, P.: Business Research Methods. McGraw-Hill, New York (2016)

    Google Scholar 

  3. Falkum, I., Vicente, A.: Polysemy: current perspectives and approaches. Lingua 157, 02 (2015)

    Google Scholar 

  4. Fang, X., Zhan, J.: Sentiment analysis using product review data. J. Big Data 2(1), 1–14 (2015). https://doi.org/10.1186/s40537-015-0015-2

    Article  Google Scholar 

  5. Gudivada, V.N.: Chapter 12 - Natural language core tasks and applications. In: Gudivada, V.N., Rao, C. (eds.) Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications. Handbook of Statistics, vol. 38, pp. 403–428. Elsevier (2018)

    Google Scholar 

  6. Hussein, D.M.E.-D.M.: A survey on sentiment analysis challenges. J. King Saud Univ. Eng. Sci. 30(4), 330–338 (2018)

    Google Scholar 

  7. Jurafsky, D., Martin, J.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, vol. 2 (2008)

    Google Scholar 

  8. Kane, D.: Data science - PART XI - text analytics. https://www.datasciencecentral.com/profiles/blogs/an-introduction-to-text-analytics. Accessed 01 Oct 2019

  9. Kotu, V., Deshpande, B.: Chapter 9 - Text mining. In: Kotu, V., Deshpande, B. (eds.) Data Science, 2nd edn., pp. 281–305. Morgan Kaufmann, Burlington (2019)

    Chapter  Google Scholar 

  10. Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Linguisticae Investigationes 30(1), 3–26 (2007)

    Article  Google Scholar 

  11. Sonntag, D.: Assessing the quality of natural language text data. In: INFORMATIK 2004 - Informatik verbindet, Band 1, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI), Ulm, 20–24 September 2004, pp. 259–263 (2004)

    Google Scholar 

  12. Yadav, V., Bethard, S.: A survey on recent advances in named entity recognition from deep learning models. In: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, pp. 2145–2158. Association for Computational Linguistics, August 2018

    Google Scholar 

  13. Yordanov, V.: Introduction to natural language processing for text. https://towardsdatascience.com/introduction-to-natural-language-processing-for-text-df845750fb63. Accessed 01 Oct 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Agata Filipowska .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Filipowska, A., Filipiak, D. (2020). Introduction to Text Analytics. In: Kutsche, RD., Zimányi, E. (eds) Big Data Management and Analytics. eBISS 2019. Lecture Notes in Business Information Processing, vol 390. Springer, Cham. https://doi.org/10.1007/978-3-030-61627-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61627-4_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61626-7

  • Online ISBN: 978-3-030-61627-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics