Abstract
Twitter users post observations about their immediate environment as a part of the 500 million tweets posted everyday. As such, Twitter can become the source for invaluable information about objects, locations, and events, which can be analyzed and monitored in real time, not only to understand what is happening in the world, but also an event’s exact location. However, Twitter data is noisy as sensory values, and information such as the location of a tweet may not be available, e.g., only 0.9 % of tweets have GPS data. Due to the lack of accurate and fine-grained location information, existing Twitter event monitoring systems focus on city-level or coarser location identification, which cannot provide details for local events. In this paper, we propose SNAF (Sense and Focus), an event monitoring system for Twitter data that emphasizes local events. We increase the availability of the location information significantly by finding locations in tweet messages and users’ past tweets. We apply data cleaning techniques in our system, and with extensive experiments, we show that our method can improve the accuracy of location inference by 5 % to 20 % across different error ranges. We also show that our prototype implementation of SNAF can identify critical local events in real time, in many cases earlier than news reports.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) The Semantic Web. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Branch, J., Szymanski, B., Giannella, C., Wolff, R., Kargupta, H.: In-network outlier detection in wireless sensor networks. In: Proceedings of the 26th IEEE International Conference on Distributed Computing Systems (2006)
Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems (2013)
Graham, M., Hale, S.A., Gaffney, D.: Where in the world are you? Geolocation and language identification in twitter. Prof. Geogr. 66(4), 568–578 (2014)
Hong, L., Ahmed, A., Gurumurthy, S., Smola, A.J., Tsioutsiouliklis, K.: Discovering geographical topics in the twitter stream. In: Proceedings of the 21st International World Wide Web Conference (2012)
Ikawa, Y., Enoki, M., Tatsubori, M.: Location inference using microblog messages. In: Proceedings of the 21st International World Wide Web Conference Companion (2012)
Jeffery, S.R., Alonso, G., Franklin, M.J., Hong, W., Widom, J.: Declarative support for sensor data cleaning. In: Fishkin, K.P., Schiele, B., Nixon, P., Quigley, A. (eds.) PERVASIVE 2006. LNCS, vol. 3968, pp. 83–100. Springer, Heidelberg (2006)
Jurka, T.P., Collingwood, L., Boydstun, A., Grossman, E., van Atteveldt, W.: Rtexttools: Automatic text classification via supervised learning. R package version 1.3, 9 (2012)
Knox, E.M., Ng, R.T.: Algorithms for mining distance based outliers in large datasets. In: Proceedings of 24th International Conference on Very Large Data Bases (1998)
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (1995)
Lee, p., Lakshmanan, L.V., Milios, E.E.: Incremental cluster evolution tracking from highly dynamic network data. In: Proceedings of the 30th International Conference on Data Engineering (2014)
Li, C., Sun, A.: Fine-grained location extraction from tweets with temporal awareness. In: Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval (2014)
Li, R., Lei, K.H., Khadiwala, R., Chang, K.-C.: Tedas: a twitter-based event detection and analysis system. In: Proceedings of 28th International Conference on Data Engineering (2012)
Lingad, J., Karimi, S., Yin, J.: Location extraction from disaster-related microblogs. In: Proceedings of the 22nd International World Wide Web Conference Companion (2013)
McMinn, A.J., Moshfeghi, Y., Jose, J.M.: Building a large-scale corpus for evaluating event detection on twitter. In: Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, pages 409–418. ACM (2013)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: Real-time event detection by social sensors. In: Proceedings of the 19th International World Wide Web Conference (2010)
Sakaki, T., Okazaki, M., Matsuo, Y.: Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans. Knowl. Data Eng. 25(4), 919–931 (2013)
Schulz, A., Hadjakos, A., Paulheim, H., Nachtwey, J., Mühlhäuser, M.: A multi-indicator approach for geolocalization of tweets. In: Proceedings of the Seventh International Conference on Weblogs and Social Media (2013)
Sheng, B., Li, Q., Mao, W., Jin, W.: Outlier detection in sensor networks. In: Proceedings of the 8th ACM International Symposium on Mobile Ad Hoc Networking and Computing (2007)
Subramaniam, S., Palpanas, T., Papadopoulos, D., Kalogeraki, V., Gunopulos, D.: Online outlier detection in sensor data using non-parametric models. In: Proceedings of the 32nd International Conference on Very Large Data Bases (2006)
Unankard, S., Li, X., Sharaf, M., Zhong, J., Li, X.: Predicting elections from social networks based on sub-event detection and sentiment analysis. In: Benatallah, B., Bestavros, A., Manolopoulos, Y., Vakali, A., Zhang, Y. (eds.) WISE 2014, Part II. LNCS, vol. 8787, pp. 1–16. Springer, Heidelberg (2014)
Unankard, S., Li, X., Sharaf, M.A.: Emerging event detection in social networks with location sensitivity. World Wide Web J. (2015, in press)
Wen, Y.-J., Agogino, A.M., Goebel, K.: Fuzzy validation and fusion for wireless sensor networks. In: Proceedings of the ASME International Mechanical Engineering Congress (2004)
Zhang, Y., Meratnia, N., Havinga, P.: Outlier detection techniques for wireless sensor networks: A survey. IEEE Commun. Surv. Tutor. 12(2), 159–170 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhang, Y., Szabo, C., Sheng, Q.Z. (2015). Sense and Focus: Towards Effective Location Inference and Event Detection on Twitter. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9418. Springer, Cham. https://doi.org/10.1007/978-3-319-26190-4_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-26190-4_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26189-8
Online ISBN: 978-3-319-26190-4
eBook Packages: Computer ScienceComputer Science (R0)