Skip to main content

Single-Sentence Compression Using SVM

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 758))

Abstract

Presenting a sentence in less number of words compared to its original one without changing the meaning is known as sentence compression. Most recent works on sentence compression models define the problem as an integer linear programming problem and solve it using an external ILP-solver which suffers from slow running time. In this paper, we have presented a machine learning approach to single-sentence compression. The sentence compression task is modeled as a two-class classification problem and used support vector machine to solve the problem. Different learning models are created using different types of kernel functions. Finally, it has been observed that RBF kernel gives good result compared to other kernel functions for this compression task of single sentence.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html.

  2. 2.

    http://storage.googleapis.com/sentencecomp/compression-data.json.

References

  1. Ganesan, K., Zhai, C., Han, J.: Opinosis: a graph-based approach to abstractive summarization of highly redundant opinions. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 340–348, Aug, 2010

    Google Scholar 

  2. Grefenstette, G.: Producing intelligent telegraphic text reduction to provide an audio scanning service for the blind. In: Working Notes of the Workshop on Intelligent Text Summarization, Palo Alto, Cal., pp. 111–117, 23 Mar 1998

    Google Scholar 

  3. Knight, K., Marcu, D.: Statistics-based summarization step one: Sentence compression. In: Proceedings of AAAI-00, pp. 703–710 (2000)

    Google Scholar 

  4. Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probabilistic approach to sentence compression. Artificial Intelligence-139, pp. 703–710 (2002)

    Google Scholar 

  5. Jing, H., McKeown, K.: Cut and paste based text summarization. In: Proceedings of NAACL-00, pp. 178–185 (2000)

    Google Scholar 

  6. Clarke, J., Lapata, M.: Global inference for sentence compression: an integer linear programming approach. J. Artif. Intell. Res. 31, 399–429 (2008)

    Article  Google Scholar 

  7. McDonald, R.: Discriminative sentence compression with soft syntactic evidence. In: Proceedings of EACL-06, pp. 297–304 (2006)

    Google Scholar 

  8. Galley, M., McKeown, K.R.: Lexicalized Markov grammars for sentence compression. In: Proceedings of NAACL-HLT-07, pp. 180–187 (2007)

    Google Scholar 

  9. Filippova, K., Strube, M.: Dependency tree based sentence compression. In: Proceedings of INLG-08, pp. 25–32 (2008)

    Google Scholar 

  10. Cohn, T., Lapata, M.: Sentence compression as tree transduction. J. Artif. Intell. Res. 34, 637–674 (2009)

    Article  Google Scholar 

  11. Nomoto, T.: A comparison of model free versus model intensive approaches to sentence compression. In: Proceedings of EMNLP-09, pp. 391–399 (2009)

    Google Scholar 

  12. Wang, L., Raghavan, H., Castelli, V., Florian, R., Cardie, C.: A sentence compression based framework to query-focused multi-document summarization. In: Proceedings of ACL-13, pp. 1384–1394 (2013)

    Google Scholar 

  13. Dorr, B., Zajic, D., Schwartz, R.: Hedge trimmer: a parse-and-trim approach to head-line generation. In: Proceedings of the Text Summarization Workshop at HLT-NAACL-03, Ed-monton, Alberta, Canada, pp. 1–8 (2003)

    Google Scholar 

  14. Hori, C., Furui, S., Malkin, R., Yu, H., Waibel, A.: A statistical approach to automatic speech summarization. EURASIP J. Appl. Signal Process. 2, 128–139 (2003)

    MATH  Google Scholar 

  15. Martins, A.F.T., Smith, N. A.: Summarization with a joing model for sentence extraction and compression. In: ILP for NLP-09, pp. 1–9 (2009)

    Google Scholar 

  16. Berg-Kirkpatrick, T., Gillick, D., Klein, D.: Jointly learning to extract and compress. In: Proceedings of ACL-11 (2011)

    Google Scholar 

  17. Thadani, K., McKeown, K.: Sentence compression with joint structural inference. In: Proceedings of CoNLL-13, pp. 65–74 (2013)

    Google Scholar 

  18. Woodsend, K., Lapata, M.: Multiple aspect summarization using integer linear pro-gramming. In: Proceedings of EMNLP-12, pp. 233–243 (2012)

    Google Scholar 

  19. Almeida, M.B., Martins, A.F.T.: Fast and robust compressive summarization with dual de-composition and multi-task learning. In: Proceedings of ACL-13 (2013)

    Google Scholar 

  20. Filippova, K., Alfonseca, E.: Fast k-best sentence compression (2015). arXiv:1510.08418. Accessed 28 Oct 2015

  21. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    Google Scholar 

  22. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepak Sahoo .

Editor information

Editors and Affiliations

Appendix 1

Appendix 1

Table 5 Sentences that are grammatically correct and informative (category A)
Table 6 Sentences that are grammatically correct and Informative but few extra words (category B) given in brackets
Table 7 Sentences that are grammatically incorrect (C)

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sahoo, D., Balabantaray, R.C. (2019). Single-Sentence Compression Using SVM. In: Nayak, J., Abraham, A., Krishna, B., Chandra Sekhar, G., Das, A. (eds) Soft Computing in Data Analytics . Advances in Intelligent Systems and Computing, vol 758. Springer, Singapore. https://doi.org/10.1007/978-981-13-0514-6_48

Download citation

Publish with us

Policies and ethics