An Approach to Mining Picture Objects Based on Textual Cues
The task of extracting knowledge from text is an important research problem for information processing and document understanding. Approaches to capture the semantics of picture objects in documents constitute subjects of great interest in the domain of document mining recently. In this paper, we present an approach to extracting information about picture objects in a document using cues from the text written about them. The goal of this work is to mine a document and understand the content of picture objects in the document based on meaning inferred from the texts written about such objects. We apply some Natural Language Processing techniques to extract semantic information about picture objects in a document and process texts written about them. The mining algorithms were developed and implemented as a working system and gone through testing and experimentations. Results and future extensions of the work are discussed in this paper.
KeywordsInput File Mining Algorithm Multimedia Data Parse Tree Image Mining
Unable to display preview. Download preview PDF.
- 1.Wynne, H., Mong, L.L., Zhang, J.: Image Mining: Trends and Developments. Journal of Intelligent Information System (JISS): Special Issue on Multimedia Data Mining, 97–106 (2002)Google Scholar
- 2.Popescul, A., Ungar, L.H., Lawrence, S., Pennock, D.M.: Statistical relational learning for document mining. In: Third IEEE International Conference on Data Mining, ICDM 2003, November 19-22, pp. 275–282 (2003)Google Scholar
- 3.Qin-Bao, S., Nai-Qian, L., Jun-Yi, S., Li-Ming, C.: Web documents mining. In: Proceedings of 2002 International Conference on Machine Learning and Cybernetics, November 4-5, vol. 2, pp. 791–795 (2002)Google Scholar
- 5.Ahonen, H., Heinonen, O., Klemettinen, M., Verkamo, A.I.: Applying Data mining techniques in text analysis. Report C-1997-23, University of Helsinki, Department of Computer Science (March 1997)Google Scholar
- 6.Dixon, M.: An Overview of Document Mining Technology. A research report Computer Based Learning Unit, University of Leeds (October 1997)Google Scholar
- 9.Zaiane, O.R., Han, J., Li, Z., Hou, J.: Mining Multimedia Data. In: CASCON 1998: Meeting of Minds, Toronto, Canada, November 1998, pp. 83–96 (1998)Google Scholar
- 10.Zaiane, O.R., Fall, A., Dahl, R.V., Tarau, P.: On-line resource discovery using natural language. In: Proceedings, RIAO 1997, Montreal, Canada, June 25-27, pp. 65–73 (1997)Google Scholar
- 11.Futrelle, R.P.: Summarization of Diagrams in Documents. In: Mani, I., Maybury, M. (eds.) Advances in Automated Text Summarization, Cambridge, MA, March 1999, pp. 61–65 (1999)Google Scholar
- 12.Zhang, J., Wynne, H., Mong, L.L.: Image Mining: Issues, Frameworks and Techniques. In: Second International Workshop on Multimedia Data Mining-MDM/KDD San Francisco, U.S.A., August 2001, pp. 34–42 (2001)Google Scholar
- 13.Landauer, T.K., Foltz, P.W., Laham, D.: An Introduction to Latent Semantic Analysis. Discourse Processes (25), 259–284 (1998)Google Scholar
- 14.Meyers, A., Hilster, D.: Description of the TexUS System as used for MUC-4. In: Proceedings of MUC-4. DARPA, March 1992, pp. 207–214 (1992)Google Scholar