Skip to main content
Log in

Organizing Multiple Data Sources for Developing Intelligent e-Business Portals

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

Enterprise applications usually involve huge, complex, and persistent data to work on, together with business rules and processes. In order to represent, integrate, and use the information coming from the huge, distributed, multiple sources, we present a conceptual model with dynamic multi-level workflows corresponding to a mining-grid centric multi-layer grid architecture, for multi-aspect analysis in building an e-business portal on the Wisdom Web. We show that this integrated model will help to dynamically organize status-based business processes that govern enterprise application integration.

We also present two case studies to demonstrate the effectiveness of the proposed model in the real world. The first case study is about how to organize and mine multiple data sources for behavior-based online customer segmentation, which is the first crucial step of personalization and one-to-one marketing. The second case study is about how to evaluate and monitor data quality, which in return can optimize the knowledge discovery process for intelligent decision making. The proposed methodology attempts to orchestrate various mining agents on the mining-grid for integrating data and knowledge in a unified portal developed by a service-oriented architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
Figure 6.
Figure 7.
Figure 8.
Figure 9.

Similar content being viewed by others

Notes

  1. Charles H. Spurgeon, English preacher of 19th century 1834–1892.

References

  • Alonso, G., Casati, F., Kuno, H., and Machiraju, V. 2004. Enterprise Application Integration, Web Services — Concepts, Architectures and Applications, Springer, pp. 67–92.

  • Berman, F., Fox, G., and Hey, A.J.G. (Eds.) 2003. Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons.

  • Buschmann, F. et al. 1996. Pattern-Oriented Software Architecture: A System of Patterns. Wiley.

  • Cannataro, M., Congiusta, A., Mastroianni, C., Pugliese, A., Talia, D., and Trunfio, P. 2004. Grid-based data mining and knowledge discovery. In N. Zhong and J. Liu (Eds.), Intelligent Technologies for Information Analysis. Springer-Verlag, pp. 19–45.

  • Congiusta, A., Pugliese, A., Talia, D., and Trunfio, P. 2003. Designing grid services for distributed knowledge discovery. Web Intelligence and Agent Systems: An International Journal, 1:91–104.

    Google Scholar 

  • Curcin, V., Ghanem, M., Guo, Y., Köhler, M., Rowe, A., Syed, J., and Wendel, P. 2002. Discovery net: towards a grid of knowledge discovery. Proc. 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 658–663.

  • Deelman, E., Blythe, J., Gil, Y., and Kesselman, C. 2003. Workflow management in GriPhyN. In J. Nabrzyski et al. (Eds.) Grid Resource Management. Kluwer Academic Publishers, pp. 99–116.

  • Detlor, B. 2004. Towards Knowledge Portals: From Human Issues to Intelligent Agents, Information Science and Knowledge Management. Kluwer Academic Publishers.

  • Foster, I. and Kesselman, C. 1997. Globus: A metacomputing infrastructure toolkit. The International Journal of Supercomputer Applications and High Performance Computing, 11(2):115–128.

    Google Scholar 

  • Foster, I. and Kesselman, C. 1999. The Grid: Blueprint for a Future Computing Infrastructure, 1st edition. Morgan Kaufmann.

  • Foster, I. and Kesselman, C. 2003a. The Grid: Blueprint for a Future Computing Infrastructure, 2nd edition. Morgan Kaufmann.

  • Foster, I. and Kesselman, C. 2003b. Concepts and architecture. In I. Foster and C. Kesselman (Eds.) The Grid 2: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, pp. 37–64.

  • Foster, I., Kesselman, C., and Tuecke, S. 2001. The anatomy of the grid: Enabling scalable virtual organization. International Journal of High Performance Computing Application, 15(3):200–222.

    Google Scholar 

  • Fowler, M. 2003. Patterns of Enterprise Application Architecture. Addison Wesley Professional.

  • Gil, Y., Deelman, E., Blythe, J., Kesselman, C., and Tangmunarunkit, H. 2004. Artificial intelligence and grids: Workflow planning and beyond. IEEE Intelligent Systems, Special Issue on e-Science, 19(1):26–33.

    Google Scholar 

  • Giudici, P. 2001. Association models for web mining. Data Mining and Knowledge Discovery, 5:183–196.

    Google Scholar 

  • Hackathorn, R.D. 1998. Web Farming for the Data Warehouse. Morgan Kaufmann.

  • Hall, M. 2001. More Servlets and JavaServer Pages. Sun Microsystems Press.

  • Hall, M. and Brown, L. 2001. Core Web Programming, 2nd edition Sun Microsystems Press.

  • Han, J.W. and Chang, K. Ch. 2002. Data mining for web intelligence. IEEE Computer, 35(11):64–70, 2002.

    Google Scholar 

  • Hu, J. and Zhong, N. 2004. Organizing dynamic multi-level workflows on multi-layer grids for developing e-business portals. Proc. 2004 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 777–778.

  • Hu, J. and Zhong, N. 2005. Developing e-business portals with dynamic multi-level workflows on the multi-layer grid. Proc. 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service, pp. 196–201.

  • Huang, J.J., Liu, Ch. N., Ou, Ch. X., Zhong, N., and Yao, Y. Y. 2003. Attribute reduction of rough sets in mining market value functions. Proc. 2003 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 470–473.

  • Hunter, J. 2001. Java Servlet Programming, 2nd edition O'Reilly.

  • Ling, Ch. X. and Li, Ch. H. 1998. Data mining for direct marketing: Problems and solutions. Proc. 4th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 73–79.

  • Liu, J. 2003. Web intelligence (WI): What makes wisdom web?. Proc. 18th International Joint Conference on Artificial Intelligence, pp. 1596–1601.

  • Liu, J., Zhang, Sh. W., and Yang, J. 2004. Characterizing web usage regularities with information foraging agents. IEEE Transactions on Knowledge and Data Engineering, 16(5):566–584.

    Google Scholar 

  • Maier, R. 2004. Knowledge Management Systems: Information and Communication Technologies for Knowledge Management, 2nd edition. Springer-Verlag.

  • Nabrzyski, J., Schopf, J.M., and Weglarz, J. 2004. Grid Resource Management: State of the Art and Future Trends. Kluwer Publishing.

  • Preuner, G. and Schrefl, M. 2000. A three-level schema architecture for the conceptual design of web-based information systems. World Wide Web, 3(2):125–138.

    Google Scholar 

  • Preuner, G. and Schrefl, M. 2003. Integration of web services into workflows through a multi-level schema architecture. Proc. 4th IEEE International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, pp. 51–60.

  • Priebe, T. and Pernul, G. 2003. Towards integrative enterprise knowledge portals. Proc. 12th International Conference on Information and Knowledge Management, pp. 216–223.

  • Pyle, D. 2003. Business Modeling and Data Mining. Morgan Kaufmann.

  • Stork, H.G. 2002. Webs, grids and knowledge spaces: Programmes, projects and prospects. Journal of Universal Computer Science, 8(9):848–867.

    Google Scholar 

  • The OGSA-DAI project: http://www.ogsadai.org.uk/.

  • Tomita, K., Zhong, N., and Yamauchi, H. 2004. Coupling global semantic web with local information sources for problem solving. Proc. 1st International Workshop on Semantic Web Mining and Reasoning, pp. 66–74.

  • Wege, C. 2002. Portal server technology. IEEE Internet Computing, 6(3):73–77.

    Google Scholar 

  • Wu, X. and Zhang, S. 2003. Synthesizing high-frequency rules from different data sources. IEEE Transactions on Knowledge and Data Engineering, 15(2):353–367.

    Google Scholar 

  • Xu, M., Hu, Zh. H., Long, W.H., and Liu, W. 2003. Service virtualization: Infrastructure and applications. I. Foster and C. Kesselman (Eds.) The Grid: Blueprint for a Future Computing Infrastructure, 2nd. Morgan Kaufmann, pp. 179–189.

  • Zhang S., Wu, X., and Zhang, C. 2003. Multi-database mining. IEEE Computational Intelligence Bulletin, 2(1):5–13.

    Google Scholar 

  • Zhong, N., Liu, Ch. N., and Ohsuga, S. 2001. Dynamically organizing KDD processes. International Journal of Pattern Recognition and Artificial Intelligence, World Scientific, 15(3):451–473.

    Google Scholar 

  • Zhong, N. 2004. Developing intelligent portals by using WI technologies. J.P. Li et al. (Eds.) Wavelet Analysis and Its Applications, and Active Media Technology. World Scientific, 2, pp. 555–567.

  • Zhong, N., Hu, J., and Motomura, S. 2005. Building a data mining grid for multiple human brain data analysis. Computational Intelligence, An International Journal, 21(2):177–196.

    Google Scholar 

  • Zhong, N. and Liu, J. 2004. The alchemy of intelligent IT (iIT): Blueprint for future of information technology. N. Zhong and J. Liu (Eds.) Intelligent Technologies for Information Analysis, Springer Monograph, pp. 1–16.

  • Zhong, N., Ohara, H., Iwasaki, T., and Yao, Y.Y. 2003a. Using WI technology to develop intelligent enterprise portals. Proc. International Workshop on Applications, Products and Services of Web-based Support Systems, pp. 83–90.

  • Zhong, N., Yao, Y.Y., Liu, Ch. N., Ou, Ch. X., and Huang, J.J. 2004. Data mining for targeted marketing. N. Zhong and J. Liu (Eds.) Intelligent Technologies for Information Analysis, Springer-Verlag, pp. 109–131.

  • Zhong, N., Yao, Y.Y., and Ohshima M. 2003b. Peculiarity oriented multidatabase mining. IEEE Transactions on Knowledge and Data Engineering, 15(4):952–960.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jia Hu.

Additional information

Note

Footnote 1

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hu, J., Zhong, N. Organizing Multiple Data Sources for Developing Intelligent e-Business Portals. Data Min Knowl Disc 12, 127–150 (2006). https://doi.org/10.1007/s10618-005-0018-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-005-0018-2

Keywords

Navigation