Abstract
Updating and retrieving location-based data is an important problem in Location-Based Service (LBS) applications. The Web is a valuable pool of location-based information. Such information can be retrieved and extracted on the basis of corresponding postal addresses. This paper proposes an information extraction method to help collect location-based information from the Web automatically. The proposal applies an ontology-based conceptual information retrieval approach combined with graph matching techniques. Experimental evaluation shows that the method yields high recall and precision results.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, H.: The Vocabulary Problem in Collaboration of Text from Electronic Meetings. IEEE Computer, Special Issue On CSCW 27(5) (1994)
Furnas, G.W.: The Vocabulary Problem in Human-System Communication. Communications of the ACM 30(11) (1987)
Grishman, R.: Information Extraction: Techniques and Challenges. In: Pazienza, M.T. (ed.) SCIE 1997. LNCS, vol. 1299. Springer, Heidelberg (1997)
Chinchor, N.: MUC-7 Named Entity Task Definition. In: Proceedings of the 7th Message Understanding Conference, MUC-7 (1997)
Morimoto, Y., Aono, M., Houle, M.: Extracting Spatial Knowledge from the Web. In: Proceedings of the 2003 IEEE Symposium on Applications and the Internet (2003)
Sagara, T., Kitsuregawa, M.: Yellow Page Driven Methods of Collecting and Scoring Spatial Web Documents. In: Proceedings of the Workshop on Geographic Information Retrieval, SIGIR (2004)
Guarino, N.: Formal Ontology and Information Systems. In: Proceedings of the 1st International Conference on Formal Ontology in Information Systems (1998)
Chen, H., Schatz, B.R.: Semantic Retrieval for the NCSA Mosaic. In: Proceedings of the 2nd International World Wide Web Conference (1994)
Chen, H., Martinez, J., Ng, T., Schatz, B.R.: A Concept Space Approach to Addressing the Vocabulary Problem in Scientific Information Retrieval: An Experiment on the Worm Community System. Journal of the American Society for Information Science 48 (1997)
Loh, S., Wives, L.K., de Oliveira, J.P.M.: Concept-Based Knowledge Discovery in Texts Extracted from the Web. ACM SIGKDD Explorations 1(1) (2000)
Sowa, J.: Conceptual Structures: Information Processing in Mind and Machine. Addison- Wesley, reading (1984)
Gruber, T.: Toward Principles for Design of Ontologies Used for Knowledge Sharing. International Journal of Human and Computer Studies 43(5) (1993)
Zhong, J., Zhu, H., Li, J., Yu, Y.: Conceptual graph matching for semantic search. In: Priss, U., Corbett, D.R., Angelova, G. (eds.) ICCS 2002. LNCS (LNAI), vol. 2393, p. 92. Springer, Heidelberg (2002)
Montes-y-Gomez, M., Gelbukh, A., Lopez-Lopez, A., Baeza-Yates, R.: Flexible Comparison of Conceptual GraphsWork done under partial support of CONACyT, CGEPI-IPN, and SNI, Mexico. In: Mayr, H.C., Lazanský, J., Quirchmayr, G., Vogel, P. (eds.) DEXA 2001. LNCS, vol. 2113, p. 102. Springer, Heidelberg (2001)
Ullman, J.: An Algorithm for Subgraph Isomorphism. Journal of the ACM 23 (1976)
Corneil, D., Gotlieb, C.: An Efficient Algorithm for Graph Isomorphism. Journal of the Association for Computing Machinery 17 (1970)
Hlaoui, A., Wang, S.: A New Algorithm for Inexact Graph Matching. In: Proceedings of the 16th International Conference on Pattern Recognition, ICPR 2002 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cai, W., Wang, S., Jiang, Q. (2005). Address Extraction: Extraction of Location-Based Information from the Web. In: Zhang, Y., Tanaka, K., Yu, J.X., Wang, S., Li, M. (eds) Web Technologies Research and Development - APWeb 2005. APWeb 2005. Lecture Notes in Computer Science, vol 3399. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31849-1_88
Download citation
DOI: https://doi.org/10.1007/978-3-540-31849-1_88
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25207-8
Online ISBN: 978-3-540-31849-1
eBook Packages: Computer ScienceComputer Science (R0)