Abstract
Enterprise applications usually involve huge, complex, and persistent data to work on, together with business rules and processes. In order to represent, integrate, and use the information coming from the huge, distributed, multiple sources, we present a conceptual model with dynamic multi-level workflows corresponding to a mining-grid centric multi-layer grid architecture, for multi-aspect analysis in building an e-business portal on the Wisdom Web. We show that this integrated model will help to dynamically organize status-based business processes that govern enterprise application integration.
We also present two case studies to demonstrate the effectiveness of the proposed model in the real world. The first case study is about how to organize and mine multiple data sources for behavior-based online customer segmentation, which is the first crucial step of personalization and one-to-one marketing. The second case study is about how to evaluate and monitor data quality, which in return can optimize the knowledge discovery process for intelligent decision making. The proposed methodology attempts to orchestrate various mining agents on the mining-grid for integrating data and knowledge in a unified portal developed by a service-oriented architecture.
Similar content being viewed by others
Notes
Charles H. Spurgeon, English preacher of 19th century 1834–1892.
References
Alonso, G., Casati, F., Kuno, H., and Machiraju, V. 2004. Enterprise Application Integration, Web Services — Concepts, Architectures and Applications, Springer, pp. 67–92.
Berman, F., Fox, G., and Hey, A.J.G. (Eds.) 2003. Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons.
Buschmann, F. et al. 1996. Pattern-Oriented Software Architecture: A System of Patterns. Wiley.
Cannataro, M., Congiusta, A., Mastroianni, C., Pugliese, A., Talia, D., and Trunfio, P. 2004. Grid-based data mining and knowledge discovery. In N. Zhong and J. Liu (Eds.), Intelligent Technologies for Information Analysis. Springer-Verlag, pp. 19–45.
Congiusta, A., Pugliese, A., Talia, D., and Trunfio, P. 2003. Designing grid services for distributed knowledge discovery. Web Intelligence and Agent Systems: An International Journal, 1:91–104.
Curcin, V., Ghanem, M., Guo, Y., Köhler, M., Rowe, A., Syed, J., and Wendel, P. 2002. Discovery net: towards a grid of knowledge discovery. Proc. 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 658–663.
Deelman, E., Blythe, J., Gil, Y., and Kesselman, C. 2003. Workflow management in GriPhyN. In J. Nabrzyski et al. (Eds.) Grid Resource Management. Kluwer Academic Publishers, pp. 99–116.
Detlor, B. 2004. Towards Knowledge Portals: From Human Issues to Intelligent Agents, Information Science and Knowledge Management. Kluwer Academic Publishers.
Foster, I. and Kesselman, C. 1997. Globus: A metacomputing infrastructure toolkit. The International Journal of Supercomputer Applications and High Performance Computing, 11(2):115–128.
Foster, I. and Kesselman, C. 1999. The Grid: Blueprint for a Future Computing Infrastructure, 1st edition. Morgan Kaufmann.
Foster, I. and Kesselman, C. 2003a. The Grid: Blueprint for a Future Computing Infrastructure, 2nd edition. Morgan Kaufmann.
Foster, I. and Kesselman, C. 2003b. Concepts and architecture. In I. Foster and C. Kesselman (Eds.) The Grid 2: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, pp. 37–64.
Foster, I., Kesselman, C., and Tuecke, S. 2001. The anatomy of the grid: Enabling scalable virtual organization. International Journal of High Performance Computing Application, 15(3):200–222.
Fowler, M. 2003. Patterns of Enterprise Application Architecture. Addison Wesley Professional.
Gil, Y., Deelman, E., Blythe, J., Kesselman, C., and Tangmunarunkit, H. 2004. Artificial intelligence and grids: Workflow planning and beyond. IEEE Intelligent Systems, Special Issue on e-Science, 19(1):26–33.
Giudici, P. 2001. Association models for web mining. Data Mining and Knowledge Discovery, 5:183–196.
Hackathorn, R.D. 1998. Web Farming for the Data Warehouse. Morgan Kaufmann.
Hall, M. 2001. More Servlets and JavaServer Pages. Sun Microsystems Press.
Hall, M. and Brown, L. 2001. Core Web Programming, 2nd edition Sun Microsystems Press.
Han, J.W. and Chang, K. Ch. 2002. Data mining for web intelligence. IEEE Computer, 35(11):64–70, 2002.
Hu, J. and Zhong, N. 2004. Organizing dynamic multi-level workflows on multi-layer grids for developing e-business portals. Proc. 2004 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 777–778.
Hu, J. and Zhong, N. 2005. Developing e-business portals with dynamic multi-level workflows on the multi-layer grid. Proc. 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service, pp. 196–201.
Huang, J.J., Liu, Ch. N., Ou, Ch. X., Zhong, N., and Yao, Y. Y. 2003. Attribute reduction of rough sets in mining market value functions. Proc. 2003 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 470–473.
Hunter, J. 2001. Java Servlet Programming, 2nd edition O'Reilly.
Ling, Ch. X. and Li, Ch. H. 1998. Data mining for direct marketing: Problems and solutions. Proc. 4th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 73–79.
Liu, J. 2003. Web intelligence (WI): What makes wisdom web?. Proc. 18th International Joint Conference on Artificial Intelligence, pp. 1596–1601.
Liu, J., Zhang, Sh. W., and Yang, J. 2004. Characterizing web usage regularities with information foraging agents. IEEE Transactions on Knowledge and Data Engineering, 16(5):566–584.
Maier, R. 2004. Knowledge Management Systems: Information and Communication Technologies for Knowledge Management, 2nd edition. Springer-Verlag.
Nabrzyski, J., Schopf, J.M., and Weglarz, J. 2004. Grid Resource Management: State of the Art and Future Trends. Kluwer Publishing.
Preuner, G. and Schrefl, M. 2000. A three-level schema architecture for the conceptual design of web-based information systems. World Wide Web, 3(2):125–138.
Preuner, G. and Schrefl, M. 2003. Integration of web services into workflows through a multi-level schema architecture. Proc. 4th IEEE International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, pp. 51–60.
Priebe, T. and Pernul, G. 2003. Towards integrative enterprise knowledge portals. Proc. 12th International Conference on Information and Knowledge Management, pp. 216–223.
Pyle, D. 2003. Business Modeling and Data Mining. Morgan Kaufmann.
Stork, H.G. 2002. Webs, grids and knowledge spaces: Programmes, projects and prospects. Journal of Universal Computer Science, 8(9):848–867.
The OGSA-DAI project: http://www.ogsadai.org.uk/.
Tomita, K., Zhong, N., and Yamauchi, H. 2004. Coupling global semantic web with local information sources for problem solving. Proc. 1st International Workshop on Semantic Web Mining and Reasoning, pp. 66–74.
Wege, C. 2002. Portal server technology. IEEE Internet Computing, 6(3):73–77.
Wu, X. and Zhang, S. 2003. Synthesizing high-frequency rules from different data sources. IEEE Transactions on Knowledge and Data Engineering, 15(2):353–367.
Xu, M., Hu, Zh. H., Long, W.H., and Liu, W. 2003. Service virtualization: Infrastructure and applications. I. Foster and C. Kesselman (Eds.) The Grid: Blueprint for a Future Computing Infrastructure, 2nd. Morgan Kaufmann, pp. 179–189.
Zhang S., Wu, X., and Zhang, C. 2003. Multi-database mining. IEEE Computational Intelligence Bulletin, 2(1):5–13.
Zhong, N., Liu, Ch. N., and Ohsuga, S. 2001. Dynamically organizing KDD processes. International Journal of Pattern Recognition and Artificial Intelligence, World Scientific, 15(3):451–473.
Zhong, N. 2004. Developing intelligent portals by using WI technologies. J.P. Li et al. (Eds.) Wavelet Analysis and Its Applications, and Active Media Technology. World Scientific, 2, pp. 555–567.
Zhong, N., Hu, J., and Motomura, S. 2005. Building a data mining grid for multiple human brain data analysis. Computational Intelligence, An International Journal, 21(2):177–196.
Zhong, N. and Liu, J. 2004. The alchemy of intelligent IT (iIT): Blueprint for future of information technology. N. Zhong and J. Liu (Eds.) Intelligent Technologies for Information Analysis, Springer Monograph, pp. 1–16.
Zhong, N., Ohara, H., Iwasaki, T., and Yao, Y.Y. 2003a. Using WI technology to develop intelligent enterprise portals. Proc. International Workshop on Applications, Products and Services of Web-based Support Systems, pp. 83–90.
Zhong, N., Yao, Y.Y., Liu, Ch. N., Ou, Ch. X., and Huang, J.J. 2004. Data mining for targeted marketing. N. Zhong and J. Liu (Eds.) Intelligent Technologies for Information Analysis, Springer-Verlag, pp. 109–131.
Zhong, N., Yao, Y.Y., and Ohshima M. 2003b. Peculiarity oriented multidatabase mining. IEEE Transactions on Knowledge and Data Engineering, 15(4):952–960.
Author information
Authors and Affiliations
Corresponding author
Additional information
Note
Rights and permissions
About this article
Cite this article
Hu, J., Zhong, N. Organizing Multiple Data Sources for Developing Intelligent e-Business Portals. Data Min Knowl Disc 12, 127–150 (2006). https://doi.org/10.1007/s10618-005-0018-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10618-005-0018-2