Abstract
The r/subsimulatorGPT2 is an online forum in which all posts and comments are generated through automation using a fine-tuned version of the GPT-2 language model developed using OpenAI. This subreddit has been created and is moderated by the Reddit user u/disumbrationist (Disumbrationist in What is R/subsimulatorgpt2? 2020). The intricate language model employed here results in very coherent, highly realistic, simulated content. Approximately, 1,00,000 users subscribe to the forum; however, the posting of content and comments is done only by bots trained by the moderator. The process of training a bot includes the machine learning of 5 lac comments from a particular subreddit, which then sorts those comments through their popularity on those respective subreddits. The casual employment of language model machine learning on a popular website like Reddit is unique and important. This study examines the comments and a title posted by the bots and tests them on a sample of 126 participants, where the survey places posts generated by human users versus posts created by bots. The study tries to gauge whether popular and public, social media websites like Reddit have the potential to host content that can be entirely generated through artificial intelligence and whether this content can be distinguished by the layperson. The study opens up vast avenues which indicate that automation during industry 4.0 would include huge sections of online information in casual settings that will purely be generated through machine learning. It also questions the authenticity of online found text and ponders upon its implications in the rapidly approaching technological change.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kaplan, A.M., Haenlein, M.: Users of the world, unite! The challenges and opportunities of social media. Bus. Horiz. 53(1), 59–68 (2010)
Luxton, D.D., June, J.D., Fairall, J.M.: Social media and suicide: a public health perspective. Am. J. Public Health 102(S2), S195–S200 (2012)
Stieglitz, S., Brachten, F., Ross, B., Jung, A.K.: Do social bots dream of electric sheep? A categorisation of social media bot accounts. Preprint at arXiv:1710.04044 (2017)
Xu, A., Liu, Z., Guo, Y., Sinha, V., Akkiraju, R.: A new chatbot for customer service on social media. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 3506–3510 (2017)
Oentaryo, R.J., Murdopo, A., Prasetyo, P.K., Lim, E.P.: On profiling bots in social media. In: International Conference on Social Informatics, pp. 92–109. Springer, Cham (2016)
Veale, T., Valitutti, A., Li, G.: Twitter: The best of bot worlds for automated wit. In: International Conference on Distributed, Ambient, and Pervasive Interactions, pp. 689–699. Springer, Cham (2015)
Wilkie, A., Michael, M., Plummer-Fernandez, M.: Speculative method and Twitter: bots, energy and three conceptual characters. The Sociological Review 63(1), 79–101 (2015)
Adewole, K.S., Anuar, N.B., Kamsin, A., Varathan, K.D., Razak, S.A.: Malicious accounts: dark of the social networks. J. Netw. Comput. Appl. 79, 41–67 (2017)
Wang, A.H.: Detecting spam bots in online social networking sites: a machine learning approach. In: IFIP Annual Conference on Data and Applications Security and Privacy, pp. 335–342. Springer, Berlin (2010)
Subrahmanian, V., Azaria, A., Durst, S., Kagan, V., Galstyan, A., Lerman, K., Zhu, L., Ferrara, E., Flammini, A., Menczer, F.: The DARPA Twitter bot challenge. Computer 49(6), 38–46 (2016)
Alarifi, A., Alsaleh, M., Al-Salman, A.: Twitter during test: identifying social machines. Inf. Sci. 372, 332–346 (2016)
Alsaleh, M., Alarifi, A., Al-Salman, A.M., Alfayez, M., Almuhaysin, A.: TSD: detecting Sybil accounts in Twitter. In: 2014 13th International Conference on Machine Learning and Applications. IEEE, pp. 463–469 (2014).
Wang, G., Mohanlal, M., Wilson, C., Wang, X., Metzger, M., Zheng, H., Zhao, B.Y.: Social during tests: Crowdsourcing Sybil detection. Preprint at arXiv:1205.3856 (2012)
Disumbrationist: What is R/subsimulatorgpt2? Reddit (2020). Retrieved 2 March 2022, from https://old.reddit.com/r/SubSimulatorGPT2/comments/btfhks/what_is_rsubsimulatorgpt2/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Questionnaire Survey: https://forms.gle/8UKNB4HKxFStzUvYA.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Solanki, C. (2023). “This Has Been Written by a Bot”: A Bot Detection Study of the SubsimulatorGPT2 Subreddit. In: Chakrabarti, A., Singh, V. (eds) Design in the Era of Industry 4.0, Volume 1. ICORD 2023. Smart Innovation, Systems and Technologies, vol 343. Springer, Singapore. https://doi.org/10.1007/978-981-99-0293-4_95
Download citation
DOI: https://doi.org/10.1007/978-981-99-0293-4_95
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0292-7
Online ISBN: 978-981-99-0293-4
eBook Packages: EngineeringEngineering (R0)