Data Mining and Knowledge Discovery

, Volume 12, Issue 2–3, pp 127–150 | Cite as

Organizing Multiple Data Sources for Developing Intelligent e-Business Portals

Article

Abstract

Enterprise applications usually involve huge, complex, and persistent data to work on, together with business rules and processes. In order to represent, integrate, and use the information coming from the huge, distributed, multiple sources, we present a conceptual model with dynamic multi-level workflows corresponding to a mining-grid centric multi-layer grid architecture, for multi-aspect analysis in building an e-business portal on the Wisdom Web. We show that this integrated model will help to dynamically organize status-based business processes that govern enterprise application integration.

We also present two case studies to demonstrate the effectiveness of the proposed model in the real world. The first case study is about how to organize and mine multiple data sources for behavior-based online customer segmentation, which is the first crucial step of personalization and one-to-one marketing. The second case study is about how to evaluate and monitor data quality, which in return can optimize the knowledge discovery process for intelligent decision making. The proposed methodology attempts to orchestrate various mining agents on the mining-grid for integrating data and knowledge in a unified portal developed by a service-oriented architecture.

Keywords

intelligent e-business portals the Wisdom Web multi-layer grid dynamic multi-level workflows multi-database mining 

References

  1. Alonso, G., Casati, F., Kuno, H., and Machiraju, V. 2004. Enterprise Application Integration, Web Services — Concepts, Architectures and Applications, Springer, pp. 67–92.Google Scholar
  2. Berman, F., Fox, G., and Hey, A.J.G. (Eds.) 2003. Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons.Google Scholar
  3. Buschmann, F. et al. 1996. Pattern-Oriented Software Architecture: A System of Patterns. Wiley.Google Scholar
  4. Cannataro, M., Congiusta, A., Mastroianni, C., Pugliese, A., Talia, D., and Trunfio, P. 2004. Grid-based data mining and knowledge discovery. In N. Zhong and J. Liu (Eds.), Intelligent Technologies for Information Analysis. Springer-Verlag, pp. 19–45.Google Scholar
  5. Congiusta, A., Pugliese, A., Talia, D., and Trunfio, P. 2003. Designing grid services for distributed knowledge discovery. Web Intelligence and Agent Systems: An International Journal, 1:91–104.Google Scholar
  6. Curcin, V., Ghanem, M., Guo, Y., Köhler, M., Rowe, A., Syed, J., and Wendel, P. 2002. Discovery net: towards a grid of knowledge discovery. Proc. 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 658–663.Google Scholar
  7. Deelman, E., Blythe, J., Gil, Y., and Kesselman, C. 2003. Workflow management in GriPhyN. In J. Nabrzyski et al. (Eds.) Grid Resource Management. Kluwer Academic Publishers, pp. 99–116.Google Scholar
  8. Detlor, B. 2004. Towards Knowledge Portals: From Human Issues to Intelligent Agents, Information Science and Knowledge Management. Kluwer Academic Publishers.Google Scholar
  9. Foster, I. and Kesselman, C. 1997. Globus: A metacomputing infrastructure toolkit. The International Journal of Supercomputer Applications and High Performance Computing, 11(2):115–128.Google Scholar
  10. Foster, I. and Kesselman, C. 1999. The Grid: Blueprint for a Future Computing Infrastructure, 1st edition. Morgan Kaufmann.Google Scholar
  11. Foster, I. and Kesselman, C. 2003a. The Grid: Blueprint for a Future Computing Infrastructure, 2nd edition. Morgan Kaufmann.Google Scholar
  12. Foster, I. and Kesselman, C. 2003b. Concepts and architecture. In I. Foster and C. Kesselman (Eds.) The Grid 2: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, pp. 37–64.Google Scholar
  13. Foster, I., Kesselman, C., and Tuecke, S. 2001. The anatomy of the grid: Enabling scalable virtual organization. International Journal of High Performance Computing Application, 15(3):200–222.Google Scholar
  14. Fowler, M. 2003. Patterns of Enterprise Application Architecture. Addison Wesley Professional.Google Scholar
  15. Gil, Y., Deelman, E., Blythe, J., Kesselman, C., and Tangmunarunkit, H. 2004. Artificial intelligence and grids: Workflow planning and beyond. IEEE Intelligent Systems, Special Issue on e-Science, 19(1):26–33.Google Scholar
  16. Giudici, P. 2001. Association models for web mining. Data Mining and Knowledge Discovery, 5:183–196.Google Scholar
  17. Hackathorn, R.D. 1998. Web Farming for the Data Warehouse. Morgan Kaufmann.Google Scholar
  18. Hall, M. 2001. More Servlets and JavaServer Pages. Sun Microsystems Press.Google Scholar
  19. Hall, M. and Brown, L. 2001. Core Web Programming, 2nd edition Sun Microsystems Press.Google Scholar
  20. Han, J.W. and Chang, K. Ch. 2002. Data mining for web intelligence. IEEE Computer, 35(11):64–70, 2002.Google Scholar
  21. Hu, J. and Zhong, N. 2004. Organizing dynamic multi-level workflows on multi-layer grids for developing e-business portals. Proc. 2004 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 777–778.Google Scholar
  22. Hu, J. and Zhong, N. 2005. Developing e-business portals with dynamic multi-level workflows on the multi-layer grid. Proc. 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service, pp. 196–201.Google Scholar
  23. Huang, J.J., Liu, Ch. N., Ou, Ch. X., Zhong, N., and Yao, Y. Y. 2003. Attribute reduction of rough sets in mining market value functions. Proc. 2003 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 470–473.Google Scholar
  24. Hunter, J. 2001. Java Servlet Programming, 2nd edition O'Reilly.Google Scholar
  25. Ling, Ch. X. and Li, Ch. H. 1998. Data mining for direct marketing: Problems and solutions. Proc. 4th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 73–79.Google Scholar
  26. Liu, J. 2003. Web intelligence (WI): What makes wisdom web?. Proc. 18th International Joint Conference on Artificial Intelligence, pp. 1596–1601.Google Scholar
  27. Liu, J., Zhang, Sh. W., and Yang, J. 2004. Characterizing web usage regularities with information foraging agents. IEEE Transactions on Knowledge and Data Engineering, 16(5):566–584.Google Scholar
  28. Maier, R. 2004. Knowledge Management Systems: Information and Communication Technologies for Knowledge Management, 2nd edition. Springer-Verlag.Google Scholar
  29. Nabrzyski, J., Schopf, J.M., and Weglarz, J. 2004. Grid Resource Management: State of the Art and Future Trends. Kluwer Publishing.Google Scholar
  30. Preuner, G. and Schrefl, M. 2000. A three-level schema architecture for the conceptual design of web-based information systems. World Wide Web, 3(2):125–138.Google Scholar
  31. Preuner, G. and Schrefl, M. 2003. Integration of web services into workflows through a multi-level schema architecture. Proc. 4th IEEE International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, pp. 51–60.Google Scholar
  32. Priebe, T. and Pernul, G. 2003. Towards integrative enterprise knowledge portals. Proc. 12th International Conference on Information and Knowledge Management, pp. 216–223.Google Scholar
  33. Pyle, D. 2003. Business Modeling and Data Mining. Morgan Kaufmann.Google Scholar
  34. Stork, H.G. 2002. Webs, grids and knowledge spaces: Programmes, projects and prospects. Journal of Universal Computer Science, 8(9):848–867.Google Scholar
  35. The OGSA-DAI project: http://www.ogsadai.org.uk/.
  36. Tomita, K., Zhong, N., and Yamauchi, H. 2004. Coupling global semantic web with local information sources for problem solving. Proc. 1st International Workshop on Semantic Web Mining and Reasoning, pp. 66–74.Google Scholar
  37. Wege, C. 2002. Portal server technology. IEEE Internet Computing, 6(3):73–77.Google Scholar
  38. Wu, X. and Zhang, S. 2003. Synthesizing high-frequency rules from different data sources. IEEE Transactions on Knowledge and Data Engineering, 15(2):353–367.Google Scholar
  39. Xu, M., Hu, Zh. H., Long, W.H., and Liu, W. 2003. Service virtualization: Infrastructure and applications. I. Foster and C. Kesselman (Eds.) The Grid: Blueprint for a Future Computing Infrastructure, 2nd. Morgan Kaufmann, pp. 179–189.Google Scholar
  40. Zhang S., Wu, X., and Zhang, C. 2003. Multi-database mining. IEEE Computational Intelligence Bulletin, 2(1):5–13.Google Scholar
  41. Zhong, N., Liu, Ch. N., and Ohsuga, S. 2001. Dynamically organizing KDD processes. International Journal of Pattern Recognition and Artificial Intelligence, World Scientific, 15(3):451–473.Google Scholar
  42. Zhong, N. 2004. Developing intelligent portals by using WI technologies. J.P. Li et al. (Eds.) Wavelet Analysis and Its Applications, and Active Media Technology. World Scientific, 2, pp. 555–567.Google Scholar
  43. Zhong, N., Hu, J., and Motomura, S. 2005. Building a data mining grid for multiple human brain data analysis. Computational Intelligence, An International Journal, 21(2):177–196.Google Scholar
  44. Zhong, N. and Liu, J. 2004. The alchemy of intelligent IT (iIT): Blueprint for future of information technology. N. Zhong and J. Liu (Eds.) Intelligent Technologies for Information Analysis, Springer Monograph, pp. 1–16.Google Scholar
  45. Zhong, N., Ohara, H., Iwasaki, T., and Yao, Y.Y. 2003a. Using WI technology to develop intelligent enterprise portals. Proc. International Workshop on Applications, Products and Services of Web-based Support Systems, pp. 83–90.Google Scholar
  46. Zhong, N., Yao, Y.Y., Liu, Ch. N., Ou, Ch. X., and Huang, J.J. 2004. Data mining for targeted marketing. N. Zhong and J. Liu (Eds.) Intelligent Technologies for Information Analysis, Springer-Verlag, pp. 109–131.Google Scholar
  47. Zhong, N., Yao, Y.Y., and Ohshima M. 2003b. Peculiarity oriented multidatabase mining. IEEE Transactions on Knowledge and Data Engineering, 15(4):952–960.Google Scholar

Copyright information

© Springer Science + Business Media, Inc 2005

Authors and Affiliations

  1. 1.Department of Information EngineeringMaebashi Institute of TechnologyMaebashi, GunmaJapan

Personalised recommendations