Skip to main content
Log in

A text categorization system with soft real-time guarantee

  • Automatic Text Indexing and Classification
  • Published:
Wuhan University Journal of Natural Sciences

Abstract

In order to provide predictable runtime performance for text categorization (TC) systems, an innovative system design method is proposed for soft real-time TC systems. An analyzable mathematical model is established to approximately describe the nonlinear and time-varying TC systems. According to this mathematical model, the feedback control theory is adopted to prove the system's stableness and zero steady state error. The experiments result shows that the error of deadline satisfied ratio in the system is kept within 4% of the desired value. And the number of classifiers can be dynamically adjusted by the system itself to save the computation resources. The proposed methodology enables the theoretical analysis and evaluation to the TC systems, leading to a high-quality and low-cost implementation approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Lewis D D. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval.Proceedings of the 10th European Conference on Machine Learning. Berlin: Springer-Verlag, 1998. 4–19.

    Google Scholar 

  2. Yang Yi-ming, Liu Xin. A Re-Examination of Text Categorization Methods.Proceedings of the 22 nd International Conference on Research and Development in Information Retrieval. Berkeley: ACM Press, 1999. 42–49.

    Google Scholar 

  3. Yang Yi-ming, Chute C G. An Example-Based Mapping Method for Text Categorization and Retrieval.ACM Transaction on Information Systems, 1994,12(3): 252–277.

    Article  Google Scholar 

  4. Wiener E, Federsen J O, Weigend A S. A Neural Network Approach to Topic Spotting.Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval. Las Vegas: Press of University of Nevada, 1995. 317–332.

    Google Scholar 

  5. Schapire R E, Singer Y. Improved Boosting Algorithms Using Confidence-Rated Predictions.Machine Learning, 1999,37(3): 297–336.

    Article  MATH  Google Scholar 

  6. Joachims T. Text Categorization with Support Vector Machines: Learning with Many Relevant Features.Proceedings of the 10th European Conference on Machine Learning. Berlin: Springer-Verlag, 1998. 137–142.

    Google Scholar 

  7. Grossman D A, Frieder O.Information Retrieval—Algorithms and Heuristics. Massachusetts: Kluwer Academic Publishers, 1998.

    MATH  Google Scholar 

  8. Li Rong-lu, Hu Yun-fa. Noise Reduction to Text Categorization Based on Density for KNN.Proceedings of the International Conference on Machine Learning and Cybernetics]. Xi'an: Institute of Electrical and Electronics Engineers Inc, 2003. 3119–3124.

    Google Scholar 

  9. Zhou Shui-geng, Ling T W, Guan Ji-hong,et al. Fast Text Classification: A Training-corpus Pruning Based Approach.Proceedings of 8th International Conference on Database Systems for Advanced Applications. Los Alamitos: IEEE Computer Society, 2003. 127–136.

    Google Scholar 

  10. Deng Zhi-hong, Tang Shi-wei, Yang Dong-qing,et al. SRFW: A Simple, Fast and Effective Text Classification Algorithm.Proceedings of International Conference on Machine Learning and Cybernetics. Piscataway, NJ: IEEE Computer Society, 2002. 1267–1271.

    Google Scholar 

  11. Liu C L, Layland J W. Scheduling Algorithms for Multiprogramming in a Hard Real Time Environment.Journal of the ACM, 1973,20(1): 46–61.

    Article  MATH  MathSciNet  Google Scholar 

  12. Buttazzo G C.Hard Real-Time Computing System: Predictable Scheduling Algorithms and Applications. Massachusetts: Kluwer Academic Publishers, 2000.

    Google Scholar 

  13. Diaz L, Garcia D F, Kim K,et al. Stochastic Analysis of Periodic Real-Time Systems.Proceedings of the 23rd IEEE Real-Time Systems Symposium. Los Alamitos CA: IEEE Computer Society, 2002. 289–300.

    Google Scholar 

  14. Franklin G F, Powell J D, Workman M L.Digital Control of Dynamic Systems. 3rd Edition. Massachusetts: Addison-Wesley, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

Foundation item: Supported by the National Natural Science Foundation of China (90104032), the National High-Tech Research and Development Plan of China (2003AA1Z2090)

Biography: WANG Hua-yong(1978-), male, Ph. D candidate, research direction: information retrieval, real-time system.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hua-yong, W., Yu, C. & Yi-qi, D. A text categorization system with soft real-time guarantee. Wuhan Univ. J. Nat. Sci. 11, 226–229 (2006). https://doi.org/10.1007/BF02831736

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02831736

Key words

CLC number

Navigation