Abstract
Hashtags, which started to be widely used since 2007, are always utilized to mark keywords in tweets to categorize messages and form conversation for topics in Twitter. However, it is hard for users to use hashtags for sharing their opinions/interests/comments for their interesting topics. In this paper, we present a new approach for recommending news-topic oriented hashtags to help Twitter users easily join the conversation about news topics in Twitter. We first detect topic-specific informative words co-occurring with a given target word, which we call characteristic co-occurrence words, from news articles to form a vector for representing the news topic. Then by creating a hashtag vector based on tweets with the same hashtag, we calculate the similarity between these two vectors and recommend hashtags of high similarity scores with the news topic. Experimental results show that our approach could recommend hashtags which are highly relevant to the news topics, helping users share their tweets with others in Twitter.
Keywords
- Social Media
- hashtags
- tweet
- characteristic co-occurrence word
- clustering
- news topic
Download conference paper PDF
References
Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proc. of the 19th Int. Conf. on World Wide Web, Raleigh, North Carolina, USA (2010)
Brooks, C.H., Montanez, N.: Improved Annotation of the Blogosphere via Autotagging and Hierarchical Clustering. In: Proc. of the 15th Int. Conf. on World Wide Web, Edinburgh, UK (2006)
Mishne, G.: AutoTag: A Collaborative Approach to Automated Tag Assignment for Weblog Posts. In: Proc. of the 15th Int. Conf. on World Wide Web, Edinburgh, UK (2006)
Sood, S.C., Owsley, S.H., Hammond, K.J., Birnbaum, L.: TagAssist: Automatic Tag Suggestion for Blog Posts. In: Int. Conf. on Weblogs and Social Media (2007)
Singhal, A., Buckley, C., Mitra, M.: Pivoted Document Length Normalization. In: 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 1996), pp. 21–29. ACM (1996)
Sigurbjornsson, B., van Zwol, R.: Flickr Tag Recommendation based on Collective Knowledge. In: Proc. of the 17th Int. Conf. on World Wide Web, Beijing, China (2008)
Wartena, C., Brussee, R., Wibbels, M.: Using Tag Co-Occurrence for Recommendation. In: Proc. of Int. Conf. on Intelligent System Design and Application (ISDA 2009), Pisa, Italy (November 2009)
Belem, F., Martins, E., Pontes, T., Almeida, J., Goncalces, M.: Associative Tag Recommendation Exploiting Multiple Textual Features. In: Proc. of the 34th Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, Beijing, China (July 2011)
Huang, J., Thornton, K.M., Efthimiadis, E.N.: Conversational Tagging in Twitter. In: Proc. of the 21st ACM Conf. on Hypertext and Hypermedia, Toronto, Ontario, Canada (2010)
Weng, J., Lim, E.-P., He, Q., Leung, C.W.-K.: What Do People Want in Microblogs? Measuring Interestingness of Hashtags in Twitter. In: Proc. of the 2010 IEEE Int. Conf. on Data Mining, ICDM 2010, pp. 1121–1126 (2010)
Correa, D., Sureka, A.: Mining Tweets for Tag Recommendation on Social Media. In: Proc. of the 3rd Int. Workshop on Search and Mining User-Generated Contents, SMUC 2011, Glasgow, Scotland, UK (2011)
Efron, M.: Hashtag Retrieval in a Microblogging Environment. In: Proc. of the 33rd Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, ACM SIGIR 2010, Geneva, Switzerland (2010)
Wagner, C., Strohmaier, M.: The Wisdom in Tweetonomies: Acquiring Latent Conceptual Structures from Social Awareness Streams. In: Proc. of the 3rd International Semantic Search Workshop, p. 6. ACM (2010)
Zangerle, E., Gassler, W., Specht, G.: Recommending #-Tags in Twitter. In: Proc. of the Workshop on Semantic Adaptive Social Web, UMAP 2011, Gerona, Spain (2011)
Phuvipadawat, S., Murata, T.: Detecting a Multi-Level Content Similarity from Microblogs Based on Community Structures and Named Entities. Journal of Emerging Technologies in Web Intelligence 3(1) (February 2011)
Sankaranarayanan, J., Samet, H., Heitler, B.E., Lieberman, M.D., Sperling, J.: TwitterStand: News in Tweets. In: Proc. of the 17th ACM SIGSPATIAL Int. Conf. on Advances in Geographic Information System, ACM GIS, Seattle, WA, USA (November 2009)
Phelan, O., McCarthy, K., Smyth, B.: Using Twitter to Recommend Real-Time Topical News. In: Proc. of the 3rd ACM Conf. on Recommender Systems, ACM RecSys, New York, NY, USA (October 2009)
Abel, F., Gao, Q., Houben, G.-J., Tao, K.: Semantic Enrichment of Twitter Posts for User Profile Construction on the Social Web. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part II. LNCS, vol. 6644, pp. 375–389. Springer, Heidelberg (2011)
Salton, G., Wong, A., Yang, C.S.: A Vector Space Model for Automatic Indexing. Communications of the ACM 18(11), 613–620 (1975)
Xiao, F., Noro, T., Tokuda, T.: Detection of Characteristic Co-Occurrence Words from News Articles on the Web. In: 21st European-Japanese Conference on Information Modelling and Knowledge Base, vol. 1, pp. 242–258 (June 2011)
Twitter Streaming API, https://dev.twitter.com/docs/streaming-api
Schmid, H.: Probabilistic Part-of-Speech Tagging Using Decision Trees. In: First International Conference on New Methods in Natural Language Processing, pp. 44–49 (1994)
Stanford Named Entity Recognizer, http://nlp.stanford.edu/software/CRF-NER.shtml
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xiao, F., Noro, T., Tokuda, T. (2012). News-Topic Oriented Hashtag Recommendation in Twitter Based on Characteristic Co-occurrence Word Detection. In: Brambilla, M., Tokuda, T., Tolksdorf, R. (eds) Web Engineering. ICWE 2012. Lecture Notes in Computer Science, vol 7387. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31753-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-31753-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31752-1
Online ISBN: 978-3-642-31753-8
eBook Packages: Computer ScienceComputer Science (R0)
