Abstract
In this paper we analyze news text collections (clusters) via extracting their paraphrase headlines into a paraphrase graph and working with this graph. Our aim is to test whether news headline is an appropriate form of news text compression. Different types of news collections: dynamic, static and combined (both dynamic and static) clusters are analyzed and it is shown that their respective paraphrase graphs reflect the characteristics of the texts. We also automatically extract the most informationally important linked fragments of news texts, and these fragments characterize news texts as either informative, conveying some information, or publicistic ones, trying to affect the readers emotionally. It is shown that news headlines of the informative type do represent their respective compressed news reports.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
A program implementing the described algorithm is available at http://donelaitis.vdu.lt/~vidas/tools.htm.
- 2.
References
Antonov, A., Bagley, S., Meshkov, V., Sukhanov, A.: Documents clustering using metainformation. In: Proceedings of Dialog 2006, Moscow (2006)
Azzopardi, J., Staff, Ch.: Incremental clustering of news reports. Algorithms 5, 364–378 (2012)
Bora, N.N., Mishra, B.S.P., Dehuri, S.: Heuristic frequent term-based clustering of news headlines. Proc. Technol. 6, 436–443 (2012)
Daudaravičius, V., Marcinkevičienė, R.: Gravity counts for the boundaries of collocations. Int. J. Corpus Linguist. 9(2), 321–348 (2004). John Benjamins Publishing Company, Amsterdam
Fernando, S., Stevenson, M.: A semantic similarity approach to paraphrase detection. In: Proceedings of Computational Linguistics UK (CLUK 2008) 11th Annual Research Colloqium (2008)
Pronoza, E., Yagunova, E., Pronoza, A.: Construction of a Russian paraphrase corpus: unsupervised paraphrase extraction. In: Proceedings of the 9th Summer School in Information Retrieval and Young Scientist Conference (2015)
Sidorov, G., Gelbukh, A., Gómez-Adorno, H., Pinto, D.: Soft similarity and soft cosine measure: similarity of features in vector space model. Computación y Sistemas 18(3), 491–504 (2014)
Thirunarayan, K., Immaneni, T., Shaik, M.V.: Selecting labels for news document clusters. Lect. Notes Comput. Sci. 4592, 119–130 (2007)
Yagunova, E.: Variations of speech perception (experimental study based on the Russian texts of different functional styles). Perm’, St.-Petersburg (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Kochetkova, N., Pronoza, E., Yagunova, E. (2018). News Headline as a Form of News Text Compression. In: Staab, S., Koltsova, O., Ignatov, D. (eds) Social Informatics. SocInfo 2018. Lecture Notes in Computer Science(), vol 11186. Springer, Cham. https://doi.org/10.1007/978-3-030-01159-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-01159-8_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01158-1
Online ISBN: 978-3-030-01159-8
eBook Packages: Computer ScienceComputer Science (R0)