Skip to main content

Extractive Single Document Summarization via Multi-feature Combination and Sentence Compression

  • Conference paper
  • First Online:
Natural Language Processing and Chinese Computing (NLPCC 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10619))

  • 3372 Accesses

Abstract

In this paper, we attempt to extract and generate the short summary for the news article with the length limit of 60 Chinese characters. Firstly, we preprocess the news article by segmenting sentences and words, and then extract four kinds of central words to form the keyword dictionary based on parsing tree. After that, the four kinds of features, i.e. the sentence weight, the sentence similarity, the sentence position and the length of sentence, will be employed to measure the significance of each sentence. Finally, we extract two sentences in the descending order of significance score and compress them to get the summary for each news article. This approach can analyze the grammatical elements from original sentences in order to generate compression rules and trim syntactic elements according to their parsing trees. The evaluation results show that our system is efficient in Chinese news summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.toutiao.com/.

  2. 2.

    http://www-nlp.stanford.edu/software/segmenter.shtml.

  3. 3.

    https://nlp.stanford.edu/software/lex-parser.html.

References

  1. Luhn, H.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)

    Article  MathSciNet  Google Scholar 

  2. Liu, M., Wang, L., Nie, L.: Weibo-oriented Chinese news summarization via multi-feature combination. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 581–589. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-25207-0_55

    Chapter  Google Scholar 

  3. John, A., Wilscy, M.: Random forest classifier based multi-document summarization system. In: International Conference on Computer Engineering and Systems, pp. 132–138 (2013)

    Google Scholar 

  4. Moawad, I., Aref, M.: Semantic graph reduction approach for abstractive text summarization. In: International Conference on Computer Engineering and Systems, pp. 132–138 (2012)

    Google Scholar 

  5. Hirao, T., Yoshida, Y., Nishino, M.: Single-document summarization as a tree knapsack problem. In: Conference on Empirical Methods in Natural Language Processing, pp. 1515–1520 (2013)

    Google Scholar 

  6. Napoles, C., Durme, B.: Evaluating sentence compression: pitfalls and suggested remedies. In: Workshop on Monolingual Text-to-text Generation, pp. 91–97 (2011)

    Google Scholar 

  7. Cohn, T., Lapata, M.: Sentence compression as tree transduction. J. Artif. Intell. Res. 34(1), 637–674 (2009)

    MATH  Google Scholar 

  8. Alias, S., Mohammad, S.K., Hoon, G.K.: A Malay text summarizer using pattern-growth method with sentence compression rules. In: Third International Conference on Information Retrieval and Knowledge Management, pp. 7–12. IEEE (2017)

    Google Scholar 

  9. Filippova, K., Alfonseca, E.: Sentence compression by deletion with LSTMs. In: Conference on Empirical Methods in Natural Language Processing, pp. 360–368 (2015)

    Google Scholar 

  10. Nallapati, R., Zhou, B.: Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond. IBM Watson (2016)

    Google Scholar 

Download references

Acknowledgments

The work presented in this paper is partially supported by the Major Projects of National Social Science Foundation of China under No. 11&ZD189, Natural Science Foundation of China under No. 61402341, Planning Foundation of Wuhan Science and Technology Bureau under No. 2016060101010047, and Open Foundation of Hubei Province Key Laboratory under No. 2016znss05A.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Han Ren .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, M., Yu, Y., Qi, Q., Hu, H., Ren, H. (2018). Extractive Single Document Summarization via Multi-feature Combination and Sentence Compression. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_70

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73618-1_70

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73617-4

  • Online ISBN: 978-3-319-73618-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics