Abstract
As a successful micro-blogging service, Twitter has demonstrated unprecedented popularity and international reach. Location extraction from micro-blogs (tweets) on this domain is an important challenge and can harness noisy but rich contents. Extracting location information can enable a variety of applications such as query-by-location, local advertising, crises awareness and also systems designed to provide information about events, points of interests (POIs) and landmarks. Considering the high throughput rate in Twitter space, we propose an approach to detect location-oriented phrases solely relying on tweet contents. The system finds associated phrases dedicated to each specific scalable geographical area. We have evaluated our approach based on real-world Twitter dataset from Australia. We conducted a comprehensive comparison between strong local terms (uni-word) and phrases (multi-words). Our experiments verify the system’s capabilities using multiple trending baselines and demonstrate that our phrase based approach can better specify locality instead of words.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amitay, E., Har’El, N., Sivan, R., Soffer, A.: Web-a-where: geotagging web content. In: SIGIR, pp. 273–280 (2004)
Backstrom, L., Kleinberg, J.M., Kumar, R., Novak, J.: Spatial variation in search engine queries. In: WWW, pp. 357–366 (2008)
Chang, H.-W., Lee, D., Eltaher, M., Lee, J.: @phillies tweeting from philly? predicting twitter user locations with spatial word usage. In: ASONAM, pp. 111–118 (2012)
Cheng, Z., Caverlee, J., Lee, K.: You are where you tweet: a content-based approach to geo-locating twitter users. In: CIKM, pp. 759–768 (2010)
Cheng, Z., Caverlee, J., Lee, K.: A content-driven framework for geolocating microblog users. ACM TIST 2, 1–2 (2013)
Fink, C., Piatko, C.D., Mayfield, J., Finin, T., Martineau, J.: Geolocating blogs from their textual content. In: AAAI Spring Symposium: Social Semantic Web: Where Web 2.0 Meets Web 3.0, pp. 25–26 (2009)
Hecht, B., Hong, L., Suh, B., Chi, E.H.: Tweets from justin bieber’s heart: the dynamics of the location field in user profiles. In: CHI, pp. 237–246 (2011)
Kelm, P., Schmiedeke, S., Sikora, T.: Multi-modal, multi-resource methods for placing flickr videos on the map. In: ICMR, pp. 52:1–52:8 (2011)
Larson, M., Soleymani, M., Serdyukov, P., Rudinac, S., Wartena, C., Murdock, V., Friedland, G., Ordelman, R., Jones, G.J.F.: Automatic tagging and geotagging in video collections and communities. In: ICMR, pp. 51:1–51:8 (2011)
Li, C., Weng, J., He, Q., Yao, Y., Datta, A., Sun, A., Lee, B.-S.: Twiner: named entity recognition in targeted twitter stream. In: SIGIR, pp. 721–730 (2012)
Liu, X., Zhang, S., Wei, F., Zhou, M.: Recognizing named entities in tweets. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, vol. 1, pp. 359–367. Association for Computational Linguistics, Stroudsburg (2011)
McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI 1998 Workshop on Learning for Text Categorization, pp. 41–48. AAAI Press (1998)
Ritter, A., Clark, S., Mausam, Etzioni, O.: Named entity recognition in tweets: An experimental study. In: EMNLP, pp. 1524–1534. ACL (2011)
Serdyukov, P., Murdock, V., van Zwol, R.: Placing flickr photos on a map. In: SIGIR, pp. 484–491 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Hosseini, S., Unankard, S., Zhou, X., Sadiq, S. (2014). Location Oriented Phrase Detection in Microblogs. In: Bhowmick, S.S., Dyreson, C.E., Jensen, C.S., Lee, M.L., Muliantara, A., Thalheim, B. (eds) Database Systems for Advanced Applications. DASFAA 2014. Lecture Notes in Computer Science, vol 8421. Springer, Cham. https://doi.org/10.1007/978-3-319-05810-8_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-05810-8_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05809-2
Online ISBN: 978-3-319-05810-8
eBook Packages: Computer ScienceComputer Science (R0)