Abstract
This paper discusses several data mining algorithms and techniques thatwe have developed at the University of Arizona Artificial Intelligence Lab.We have implemented these algorithms and techniques into severalprototypes, one of which focuses on medical information developed incooperation with the National Cancer Institute (NCI) and the University ofIllinois at Urbana-Champaign. We propose an architecture for medicalknowledge information systems that will permit data mining across severalmedical information sources and discuss a suite of data mining tools that weare developing to assist NCI in improving public access to and use of theirexisting vast cancer information collections.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T. & Swami, A. (1993). Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering 5(6): 914-925.
Brill, E. (1993). A Corpus-based Approach to Language Learning. PhD thesis, The University of Pennsylvania, Philadelphia, PA.
Chen, H., Chung, Y., Houston, A. L., Li, P. C. & Schatz, B. R. (1999). Using Neural Networks for Vocabulary Switching. Submitted to IEEE Expert.
Chen, H., Houston, A., Yen, J. & Nunamaker, J. F. (1996a). Toward Intelligent Meeting Agents. IEEE COMPUTER (August) 29(8): 62-70.
Chen, H., Houston, A. L., Swell, R. R. & Schatz B. R. (1998a). Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques. Journal of the American Society for Information Science (May) 49(7): 582-603.
Chen, H., Hsu, P., Orwig, R., Hoopes, L. & Nunamaker, J. F. (1994). Automatic Concept Classification of Text from Electronic Meetings. Communications of the ACM (October) 37(10): 56-73.
Chen, H. & Lynch, K. J. (1992). Automatic Construction of Networks of Concepts Characterizing Document Databases. IEEE Transactions on Systems, Man and Cybernetics (September/October) 22(5): 885-902.
Chen, H., Lynch, K. J. Basu, K. & Ng, D. T. (1993). Generating, Integrating, and Activating Thesauri for Concept-Based Document Retrieval. IEEE EXPERT, Special Series on Artificial Intelligence in Text-Based Information Systems (April) 8(2): 25-34.
Chen, H. & Ng, D. T. (1995). An Algorithmic Approach to Concept Exploration in a Large Knowledge Network (Automatic Thesaurus Consultation): Symbolic Branch-and-Bound vs. Connectionist Hopfield Net Activation. Journal of the American Society for Information Science (June) 46(5): 348-369.
Chen, H., Schuffels, C. & Orwig, R. (1996b). Internet Categorization and Search: A Machine Learning approach. Journal of Visual Communications and Image Representations (March) 7(1): 88-102.
Chen, H., Zhang, Y. & Houston, A. L. (1998b). Semantic Indexing and Searching Using a Hopfield Net. Journal of Information Science (JIS) (January) 24(1): 3-18.
Dalton, J. & Deshmane A. (1991). Artificial Neural Networks. IEEE Potentials (April) 10(2): 33-36.
Decker, K. M. & Focardi, S. (1995). Technology Overview: A Report on Data Mining. Technical Report CSCS TR-95-92, Swiss Scientific Computing Center.
Fayyad, U. M., Piatetsky-Shapiro, G. & Smyth, P. (1996a). From Data Mining to Knowledge Discovery: An Overview. In Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P. & Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, 1-36. AAAI Press/MIT Press.
Fayyad, U. M., Piatetsky-Shapiro, G. & Smyth, P. (1996b). From Data Mining to Knowledge Discovery in Databases. AI Magazine 17(3): 37-54.
Holsheimer, M. & Siebes, A. P. J. M. (1994). Data Mining: The Search for Knowledge in Databases. Technical Report CS-R9406, CWI: Dutch National Research Center.
Honkela, T., Kaski, S., Kohonen, T. & Lagus, K. (1998). Self-Organizing Maps of Very Large Document Collections: Justification for the WEBSOM Method. In Balderjahn, I., Mathar, R. & Schader, M. (eds.) Classification, Data Analysis, and Data Highways, 245-252. Springer: Berlin.
Honkela, T., Kaski, S., Lagus, K. & Kohonen, T. (1996a). Newsgroup Exploration with WEBSOM Method and Browsing Interface. Techical Report A32, Helsinki University of Technology, Laboratory of Computer and Information Science, Espoo, Finland.
Honkela, T., Kaski, S., Lagus, K. & Kohonen, T. (1996b). Self-Organizing Maps of Document Collections. ALMA 1(2). Electronic Journal, address: (http://www.diemme.it/luigi/alma.html).
Hopfield, J. J. (1982). Neural Network and Physical Systems with Collective Computational Abilities. Proceedings of the National Academy of Science, USA 79(4): 2554-2558.
Houston, A. L., Chen, H., Schatz, B. R., Hubbard, S. M., Doszkocs, T. E., Swell, R. R., Tolle, K. M. & Ng, T. D. (1996). Exploring the Use of Concept Space, Category Map Techniques and Natural Language Parsers to Improve Medical Information Retrieval. Technical report, University of Arizona, AI Group Working Paper, January.
Houston, A. L., Chen, H., Schatz, B. R., Hubbard, S. M., Swell, R. R. & Ng, T. D. (1998). Exploring the Use of Concept Space to Improve Medical Information Retrieval. International Journal of Decision Support Systems, forthcoming.
Hubbard, S. M., Martin, N. B. & Thurn, A. L. (1995). NCI's Cancer Information Systems-Bringing Medical Knowledge to Clinicians. Oncology (April) 9(4): 302-314.
Kashi, S., Honkela, T., Lagus, K. & Kohonen, T. (1996). Creating an Order in Digital Libraries with Self-Organizing Maps. In Proceedings of WCNN '96, World Congress on Neural Networks, September 15-18, San Diego, California, 814-817. Mahwah, NJ: Lawrence Erlbaum and INNS Press.
Khosla, R. & Dillon, T. (1997). Knowledge Discovery, Data Mining and Hybrid Systems. In Engineering Intelligent Hybrid Multi-Agent Systems, 143-177. Kluwer Academic Publishers.
Knight, K. (1990). Connectionist Ideas and Algorithms. Communications of the ACM (November) 33(11): 59-74.
Kohonen, T. (1989). Self-Organization and Associate Memory, 3rd edn. Springer-Verlag: Berlin Heidelberg.
Kohonen, T. (1995). Self-Organization Maps. Springer-Verlag: Berlin Heidelberg.
Lippmann, R. P. (1987). An Introduction to Computing with Neural Networks. IEEE Acoustics Speech and Signal Processing Magazine (April) 4(2), 4-22.
Ritter, H. & Kohonen, T. (1989). Self-Organizing Semantic Maps. Biological Cybernetics 61: 241-254.
Salton, G. (1989). Automatic Text Processing. Addison-Wesley Publishing Company, Inc.: Reading, MA.
Tank, D.W. & Hopfield, J. J. (1987). Collective Computation in Neuronlike Circuits. Scientific American (December) 257(6): 104-114.
Tolle, K. M. (1997). Improving Concept Extracting from Text Using Natural Language Processing Noun Phrasing Tools: An Experiment in Medical Information Retrieval. Master's thesis, Unversity of Arizona, Department of MIS, Tucson, AZ, May.
Uthurusamy, R. (1996). From Data Mining to Knowledge Discovery: Current Challenges and Future Directions. In Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P. & Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, 561-572. AAAI Press/MIT Press.
Varnado, S. (1995). The Role of Information Technology in Reducing Health Care Cost. In Proceedings of SPIE — The International Society for Optical Engineering volume 2618: Health Care Information Infrastructure, 36-46. Philadelphia, PA, October.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Houston, A.L., Chen, H., Hubbard, S.M. et al. Medical Data Mining on the Internet: Research on a Cancer Information System. Artificial Intelligence Review 13, 437–466 (1999). https://doi.org/10.1023/A:1006548623067
Issue Date:
DOI: https://doi.org/10.1023/A:1006548623067