Soft Computing

, Volume 17, Issue 9, pp 1585–1593 | Cite as

A query-oriented XML text summarization for mobile devices

  • Dexi Liu
  • Shihan Wu
  • Yuehua Lan
  • Guoqiang Di
  • Jiezhao Peng
  • Naixue Xiong
  • Athanasios V. Vasilakos
Methodologies and Application

Abstract

Extensible Markup Language (XML) is a simple, flexible text format derived from SGML, which is originally designed to support large-scale electronic publishing. Nowadays XML plays a fundamental role in the exchange of a wide variety of data on the Web. As XML allows designers to create their own customized tags, enables the definition, transmission, validation, and interpretation of data between applications, devices and organizations, lots of works in soft computing employ XML to take control and responsibility for the information, such as fuzzy markup language, and accordingly there are lots of XML-based data or documents. However, most of mobile and interactive ubiquitous multimedia devices have restricted hardware such as CPU, memory, and display screen. So, it is essential to compress an XML document/element collection to a brief summary before it is delivered to the user according to his/her information need. Query-oriented XML text summarization aims to provide users a brief and readable substitution of the original retrieved documents/elements according to the user’s query, which can relieve users’ reading burden effectively. We propose a query-oriented XML summarization system QXMLSum, which extracts sentences and combines them as a summary based on three kinds of features: user’s queries, the content of XML documents/elements, and the structure of XML documents/elements. Experiments on the IEEE-CS datasets used in Initiative for the Evaluation of XML Retrieval show that the query-oriented XML summary generated by QXMLSum is competitive.

Keywords

Mobile devices Query-oriented XML text summarization Query expansion Content and structure 

References

  1. Acampora G, Loia V (2005) Fuzzy control interoperability and scalability for adaptive domotic framework. IEEE Trans Ind Inf 1(2):97–111CrossRefGoogle Scholar
  2. Acampora G, Loia V (2008) A proposal of ubiquitous fuzzy computing for ambient intelligence. Inf Sci 178(3):631–646CrossRefGoogle Scholar
  3. Acampora G, Gaeta M, Loia V, Vasilakos AV (2010) Interoperable and adaptive fuzzy services for ambient intelligence applications. ACM Trans Auton Adapt Syst 5(2), art. no. 8Google Scholar
  4. Acampora G, Lee C-S, Vitiello A, Wang M-H (2012) Evaluating cardiac health through semantic soft computing techniques. Soft Comput 16(7):1183–1196MATHCrossRefGoogle Scholar
  5. Ali MS, Consens M, Gu X et al (2007) Efficient, effective and flexible XML retrieval using summaries. In: Proceedings of the comparative evaluation of XML information retrieval systems, pp 89–103Google Scholar
  6. Barta A, Consens MP, Mendelzon AO (2005) Benefits of path summaries in an XML query optimizer supporting multiple access methods. In: Proceedings of the international conference on very large data bases (VLDB05), pp 133–144Google Scholar
  7. Bergsma S, Lin D, Goebel R (2009) Web-scale N-gram models for lexical disambiguation. In: Proceedings of the 21st international joint conference on artificial intelligence, pp 1507–1512Google Scholar
  8. Chen D, Tang J, Yao L, Li J, Zhou L (2009) Query-focused summarization by combining topic model and affinity propagation. In: Proceedings of APWeb/WAIM 2009, pp 174–185Google Scholar
  9. Comai S, Marrara S (2004) XML document summarization: using XQuery for synopsis creation. In: Proceedings of the 15th international workshop on database and expert systems applications (DEXA04), pp 928–932Google Scholar
  10. Dalamagas T, Cheng T et al (2004) Clustering XML documents using structural summaries. In: Proceedings of the EDBT workshop on clustering information over the web, pp 547–556Google Scholar
  11. Jinxi X, Bruce Croft W (2000) Improving the effectiveness of informational retrieval with local context analysis. ACM Trans Inf Syst 18(1):79–112CrossRefGoogle Scholar
  12. Lee C-S, Wang M-H, Acampora G, Hsu C-Y, Hagras H (2010) Diet assessment based on type-2 fuzzy ontology and fuzzy markup language. Int J Intell Syst 25(12):1187–1216CrossRefGoogle Scholar
  13. Lin CY, Hovy E (2000) The automated acquisition of topic signatures for text summarization. In: Proceedings of the 18th COLING Conference, pp 495–501Google Scholar
  14. Lin CY, Hovy E (2002) From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL), pp 457–464Google Scholar
  15. Moreno-Velo FJ, Barros AB, Sánchez-Solano S, Baturone I (2012) XFSML: an XML-based modeling language for fuzzy systems. 2012 IEEE International Conference on Fuzzy Systems, Australia, pp. 1–8Google Scholar
  16. Polyzotis N, Garofalakis M (2002) Statistical synopses for graph structured XML databases. In: Proceedings of the 2002 ACM SIGMOD, pp 358–369Google Scholar
  17. Qin B, Liu T, Li S (2005) Review of multi-document summarization. J Chin Inf Process 19(6):13–20Google Scholar
  18. Szlávik Z, Tombros A, Lalmas M (2007) Feature- and query-based table of contents generation for XML documents. In: Proceedings of ECIR 2007, pp 456–467Google Scholar
  19. Thomas O, Dollmann T (2010) Fuzzy-EPC markup language: XML based interchange formats for fuzzy process models. Soft Comput XML Data Manag Stud Fuzziness Soft Comput 255:227–257CrossRefGoogle Scholar
  20. Wei F, He Y, Li W, Huang L (2009) Query-oriented summarization based on neighborhood graph model. In: Proceedings of ICCPOL 2009, pp 156–167Google Scholar
  21. Wenjie L, Furu W, Qin L, Yanxiang H (2008) Ranking sentences with positive and negative reinforcement for query-oriented update summarization. In: Proceedings of the 22nd international conference on computational linguistics (Coling 2008), pp 489–496Google Scholar
  22. World Wide Web Consortium. Extensible Markup Language (XML) 1.0 (Third Edition). W3C Recommendation. 2004, http://www.w3.org/TR/REC-xml/

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Dexi Liu
    • 1
    • 2
  • Shihan Wu
    • 3
  • Yuehua Lan
    • 4
  • Guoqiang Di
    • 1
  • Jiezhao Peng
    • 1
  • Naixue Xiong
    • 1
  • Athanasios V. Vasilakos
    • 5
  1. 1.Jiangxi University of Finance and EconomicsNanchangChina
  2. 2.Jiangxi Key Laboratory of Data and Knowledge EngineeringNanchangChina
  3. 3.Songjiang Branch of Shanghai Rural Commercial BankShanghaiChina
  4. 4.Gannan Medical UniversityGanzhouChina
  5. 5.Department of Computer EngineeringUniversity of Western MacedoniaKozaniGreece

Personalised recommendations