Abstract
Business Intelligence (BI) requires the acquisition and aggregation of key pieces of knowledge from multiple sources in order to provide valuable information to customers or feed statistical BI models and tools. The massive amount of information available to business analysts makes information extraction and other natural language processing tools key enablers for the acquisition and use of that semantic information. We describe the application of ontology-based extraction and merging in the context of a practical e-business application for the EU MUSING Project where the goal is to gather international company intelligence and country/region information. The results of our experiments so far are very promising and we are now in the process of building a complete end-to-end solution.
Chapter PDF
Similar content being viewed by others
References
Alani, H., Dasmahapatra, S., Gibbins, N., Glaser, H., Harris, S., Kalfoglou, Y., O’Hara, K., Shadbolt, N.: Managing Reference: Ensuring Referential Integrity of Ontologies for the Semantic Web. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 317–334. Springer, Heidelberg (2002)
Artiles, J., Gonzalo, J., Sekine, S.: The SemEval-2007 WePS Evaluation: Establishing a benchmark for Web People Search Task. In: Proceedings of Semeval 2007, Association for Computational Linguistics (2007)
Aswani, N., Bontcheva, K., Cunningham, H.: Mining information for instance unification. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L. (eds.) ISWC 2006. LNCS, vol. 4273, Springer, Heidelberg (2006)
Bagga, A., Baldwin, B.: Entity-Based Cross-Document Coreferencing Using the Vector Space Model. In: COLING-ACL 1998. Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics, pp. 79–85 (1998)
Baumgartner, R., Frlich, O., Gottlob, G., Harz, P., Herzog, M., Lehmann, P.: Web data extraction for business intelligence: the lixto approach. In: Proc. of BTW 2005 (2005)
Bontcheva, K., Cunningham, H.: The semantic web: A new opportunity and challenge for human language technology. In: Cunningham, H., Ding, Y., Kiryakov, A. (eds.) Proceedings of Workshop on Human Language Technology for the Semantic Web and Web Services, 2nd International Semantic Web Conference, Sanibel Island, Florida, October 2003 (2003), http://www.gate.ac.uk/sale/iswc03/iswc03.pdf
Chen, Y., Martin, J.H.: Cu-comsem: Exploring rich features for unsupervised web personal named disambiguation. In: Proceedings of SemEval 2007, Assocciation for Computational Linguistics, pp. 125–128 (2007)
Chinchor, N.: Muc-4 evaluation metrics. In: Proceedings of the Fourth Message Understanding Conference, pp. 22–29 (1992)
Chung, W., Chen, H., Nunamaker Jr., J.F.: Business Intelligence Explorer: A Knowledge Map Framework for Discovering Business Intelligence on the Web. In: Hawaii International Conference on System Sciences, IEEE Computer Society Press, Los Alamitos (2003)
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: ACL 2002. Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (2002)
Dean, M., Schreiber, G., Bechhofer, S., van Harmelen, F., Hendler, J., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., Stein, L.A.: OWL web ontology language reference. In: W3C recommendation, W3C (February 2004), http://www.w3.org/TR/owl-ref/
Nie, J.-Y., Paradis, F., Tajarobi, A.: Discovery of business opportunities on the internet with information extraction. In: IJCAI 2005. Workshop on Multi-Agent Information Retrieval and Recommender Systems, Edinburgh, Scotland, pp. 47–54 (2005)
Hotho, A., Staab, S., Stumme, G.: WordNet improves text document clustering. In: Proc. of the SIGIR 2003 Semantic Web Workshop (2003)
Li, Y., Bontcheva, K., Cunningham, H.: An SVM Based Learning Algorithm for Information Extraction. Machine Learning Workshop, Sheffield (2004), http://gate.ac.uk/sale/ml-ws04/mlw2004.pdf
Majocchi, A., Strange, R.: The FDI Location Decision: does Liberalisation Matter? Transactional Corporation Review (to appear, 2007)
Marshall, A., McDonald, D., Chen, H., Chung, W.: EBizPort: Collecting and Analysing Business Intelligence Iformation. Journal of the American Society for Information Science and Technology 55(10), 873–891 (2004)
Maynard, D., Saggion, H., Yankova, M., Bontcheva, K., Peters, W.: natural language technology for information integration in business intelligence. In: Abramowicz, W. (ed.) 10th International Conference on Business Information Systems, Poland, pp. 25–27 (April 2007), http://gate.ac.uk/sale/bis07/musing-bis07-final.pdf
Maynard, D., Tablan, V., Ursu, C., Cunningham, H., Wilks, Y.: Named Entity Recognition from Diverse Text Types. In: Recent Advances in Natural Language Processing 2001 Conference, Tzigov Chark, Bulgaria, pp. 257–274 (2001)
Maynard, D., Yankova, M., Kourakis, A., Kokossis, A.: Ontology-based information extraction for market monitoring and technology watch. In: ESWC Workshop End User Apects of the Semantic Web, Heraklion, Crete (2005)
Phan, X.-H., Nguyen, L.-M., Horiguchi, S.: Personal name resolution crossover documents by a semantics-based approach. In: IEICE Trans. Inf. & Syst. (February 2006)
Saggion, H.: Shef: Semantic tagging and summarization techniques applied to cross-document coreference. In: Proceedings of SemEval 2007, Assocciation for Computational Linguistics, pp. 292–295 (2007)
van Rijsbergen, C.J.: Information Retrieval, Butterworths, London (1979)
Yangarber, R., Grishman, R., Tapanainen, P., Huttunen, S.: Unsupervised Discovery of Scenario-level Patterns for Information Extraction. In: Proceedings of ANLP-NAACL 2000, Seattle, WA (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saggion, H., Funk, A., Maynard, D., Bontcheva, K. (2007). Ontology-Based Information Extraction for Business Intelligence. In: Aberer, K., et al. The Semantic Web. ISWC ASWC 2007 2007. Lecture Notes in Computer Science, vol 4825. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76298-0_61
Download citation
DOI: https://doi.org/10.1007/978-3-540-76298-0_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76297-3
Online ISBN: 978-3-540-76298-0
eBook Packages: Computer ScienceComputer Science (R0)