Combination of Key Information Extracting with Spoken Document Classification Based on Lattice

Zhang, Lei; Zhang, Zhuo; Xiang, Xue-zhi

doi:10.1007/978-3-642-22691-5_41

Lei Zhang⁴,
Zhuo Zhang⁴ &
Xue-zhi Xiang⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 159))

Included in the following conference series:

International Workshop on Computer Science for Environmental Engineering and EcoInformatics

1547 Accesses

Abstract

Traditionally, the query words in spoken document classification are generated by manual. Here, based on CHI, TFIDF and maximum poster probability (MPP) features, key information extraction is combined with spoken document classification system, where different class has different topic. From the extraction, the weights of the same key word in each topic may be distinct. These weights which reveal the relationship between the word and topic can be taken part in spoken document classification system. Additionally, in the classification system, document length information is adopted when no query is found. The whole classification system is based on lattice, which has more information than 1-best result in speech recognition system. Among CHI, TFIDF and MPP, the system performance of MPP is a little worse than the others. CHI is a little better than TFIDF when the key words number is increasing. Experiments show that when the system is combined weight and document length information, the best performance can achieve 0.769 MAP.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chang, Y.-L., Chen, J.-T.: Latent Dirichlet Learning for Document Summarization. In: ICASSP 2009, pp. 1689–1692. IEEE Press, TaiWan (2009)
Google Scholar
Malian, J.T., Throop, D.R.: Basic Concepts and Distinctions for an Aerospace Ontology of Functions, Entities and Problems. In: Aerospace Conf., pp. 1–18. IEEE Press, Big Sky (2007)
Google Scholar
Chen, B., Wang, H.-M., Lee, L.-S.: Retrieval of Broadcast News Speech in Mandarin Chinese Collected in Taiwan Using Syllable-level Statistical Characteristics. In: ICASSP 2000, pp. 1771–1774. IEEE Press, Istanbul (2000)
Google Scholar
Meng, C.-H., Lee, H.-Y., Lee, L.-S.: Improved Lattice-based Spoken Document Retrieval by Directly Learning from the Evaluation Measures. In: ICASSP 2009, pp. 4893–4896. IEEE Press, Taiwan (2009)
Google Scholar
Mertens, T., Schneider, D.: Efficient Sub-word Lattice Retrieval for German Spoken Term Detection. In: ICASSP2009, pp. 4885–4888. IEEE Press, Taipei (2009)
Google Scholar
Zhang, L., Gao, Y., Xang, X., Lu, D.: A New Syllable-lattice Based Approach for Mandarin Spoken Document Retrieval. In: Wireless Communications & Signal Processing, pp. 1–4. IEEE Press, ShangHai (2009)
Google Scholar
Yang, Y.-M., Pedersen, J.O.: A Comparative Study on Feature Selection in Text Categorization. In: Proc. ICML-14, pp. 12–420 (1997)
Google Scholar
Hazen, T.J., Richardson, F., Margolis, A.: Anna Margolis.: Topic Identification from Audio Recordings using Word and Phone Recognition Lattices. In: ASRU 2007, pp.659–664 (2007)
Google Scholar
Chen, B., Wang, H.M., Lee, L.S.: A Discriminative HMM/N-gram-based Retrieval Approach for Mandarin Spoken Documents. ACM Trans. Asian Lang. Inform. Process. 3(2), 128–145 (2004)
Article Google Scholar
Blanco, R., Barreiro, Á.: Probabilistic Document Length Priors for Language Models. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 394–405. Springer, Heidelberg (2008)
Chapter Google Scholar
Zhai, C.-X., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Information Retrieval. ACM Trans. Information Systems 22(2), 179–214 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Information and Communication Engineering College, Harbin Engineering University, Harbin, China
Lei Zhang, Zhuo Zhang & Xue-zhi Xiang

Authors

Lei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhuo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xue-zhi Xiang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Yuanxu Yu
Kunming University of Science and Technology, Kunming, China
Zhengtao Yu
International Association for Scientific and High Technology, Kunming, China
Jingying Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, L., Zhang, Z., Xiang, Xz. (2011). Combination of Key Information Extracting with Spoken Document Classification Based on Lattice. In: Yu, Y., Yu, Z., Zhao, J. (eds) Computer Science for Environmental Engineering and EcoInformatics. CSEEE 2011. Communications in Computer and Information Science, vol 159. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22691-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-642-22691-5_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22690-8
Online ISBN: 978-3-642-22691-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics