Skip to main content
Log in

The automatic retrieval of news entities based on the structure of a news cluster

  • Published:
Scientific and Technical Information Processing Aims and scope

Abstract

This paper deals with a method for the retrieval of variants for naming the same entity based on the structural arrangement of a news cluster. The method is based on both the comparison of different contexts of phrase usage and the comparison of phrase usage in the same or neighboring sentences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Loukachevitch, N.V., Dobrov, B.V., and Shternov, S.V., Treatment of News Flows Based on Large Linguistic Resources. Internet Matematika-2005. http://download.yandex.ru/company/grant/2005/10-Loukachevitch-103030.pdf)

  2. Dobrov, B.V. and Pavlov, A.M., Study of Quality of Basic Methods of Clusterization of News Flow in Daily Temporal Window, Trudy Konferentsii RCDL-2010 (Proc. Russ. Conf.: Digital Libraries-2010), 2010.

  3. Li, J., Sun, L., Kit, C., and Webster, J., A Query-Focused Multi-Document Summarizer Based on Lexical Chains, Proc. Document Understanding Conf. DUC-2007, 2007.

  4. Loukachevitch, N.V. and Dobrov, B.V., Summarization of News Clusters Based on Thematic Representation, in Kompyuternaya Lingvistika I Intellektualnye Tekhnologii (Computer Linguistics and Intellectual Technologies), Moscow: Nauka, 2009, No. 8, pp. 299–305.

    Google Scholar 

  5. Ermakov, A.E., Automatic Extraction of Facts from Texts of Personal Files: Experience in Anaphora Resolution, in Kompyuternaya Lingvistika I Intellektualnye Tekhnologii (Computer Linguistics and Intellectual Technologies), Moscow: Nauka, 2007, pp. 172–177.

    Google Scholar 

  6. Doddington, G., Mitchell, A., Przybocki, M., Ramshaw, L., Strassel, S., and Weishedel, R., The Automatic Content Extraction (ACE): Task, Data, Evaluation, Proc. 4th Int. Conf. on Language Resources and Evaluation, LREC-2004, 2004.

  7. Arutyunova, N.D., Predlozhenie i ego smysl (Preposition and Its Sense), Moscow, 1976.

  8. Halliday, M. and Hasan, R., Cohesion in English London: Longman, 1976.

    Google Scholar 

  9. Barzilay, R. and Lee, L., Learning to Paraphrase: An Unsupervised Approach Using Multiple Sequence Alignment, Proc. HLT/NAACL, 2003.

  10. Dolan, B., Quirk, Ch., and Brockett, Ch., Unsupervised Construction of Large Paraphrase Corpora: Exploiting Massively Parallel News Sources, Proc. COLING-2004 2004.

  11. van Dijk, T., Semantic Discourse Analysis, Handbook of Discourse Analysis, Teun, A., Ed., London: Academic Press, 1985, vol. 2, pp. 103–136.

    Google Scholar 

  12. Hirst, G. and St-Onge, D., Lexical Chains as Representation of Context for the Detection and Correction Malapropisms, WordNet: An Electronic Lexical Database and Some of Its Applications Fellbaum, C., Ed., Cambrige, MA: MIT Press, 1998.

    Google Scholar 

  13. Hasan, R., Coherence and Cohesive Harmony, Understanding Reading Comprehension Flood, J., Ed., Newark, DE: Int. Read. Ass., 1984, pp. 181–219.

    Google Scholar 

  14. Loukachevitch, N., Multigraph Representation for Lexical Chaining, Proc. SENSE Workshop on Conceptual Structures for Extracting Natural language Semantics Moscow, 2009, pp. 67–76.

  15. Dobrov, B.V., Loukachevitch, N.V., and Syromyatnikov, S.V., Automatic Detection of Test Entries for Information Retrieval Thesaurus, Trudy pyatoi vserossiiskoi nauchnoi konferentsii “Elektronnye biblioteki: perspektivnye metody i tekhnologii, elektronnye kollektsii, (Proc. 5th All-Russ. Sci. Conf.’ Digital Libraries: Advanced Methods and Technologies), 2003, pp. 201–210.

  16. Yang, H. and Callan, J., A Metric-Based Framework for Automatic Taxonomy Induction, Proc. of ACL-2009, 2009.

  17. Su, Nam Kim., Medelyan, O., Min-Yen, Kan, and Baldwin, T., Automatic Keyphrase Extraction from Scientific Articles, Proc. 5th Int. Workshop on Semantic Evaluation ACL-2010. SemEval-2010 Task-5 2010, pp. 21–26.

  18. Dobrov B.V. and Loukachevitch, N.V., Thesaurus’ RuTez’ as Resource for Solution of Informational Search Problems, Trudy vserossiiskoi konferentsii ZNANIYa-ONTOLOGII-TEORII (Proc. All-Russ. Conf. ‘Knowledges, Ontologies, Theories’), vol. 1, pp. 250–259.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. A. Alekseev.

Additional information

Original Russian Text © A.A. Alekseev, N.V. Loukachevitch, 2012, published in Iskusstvennyi Intellekt i Prinyatie Reshenii, Seriya 1, 2012, No. 4, pp. 51–59.

About this article

Cite this article

Alekseev, A.A., Loukachevitch, N.V. The automatic retrieval of news entities based on the structure of a news cluster. Sci. Tech.Inf. Proc. 39, 303–309 (2012). https://doi.org/10.3103/S0147688212060019

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S0147688212060019

Keywords

Navigation