Skip to main content

“This Has Been Written by a Bot”: A Bot Detection Study of the SubsimulatorGPT2 Subreddit

  • Conference paper
  • First Online:
Design in the Era of Industry 4.0, Volume 1 (ICORD 2023)

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 343))

Included in the following conference series:

  • 835 Accesses

Abstract

The r/subsimulatorGPT2 is an online forum in which all posts and comments are generated through automation using a fine-tuned version of the GPT-2 language model developed using OpenAI. This subreddit has been created and is moderated by the Reddit user u/disumbrationist (Disumbrationist in What is R/subsimulatorgpt2? 2020). The intricate language model employed here results in very coherent, highly realistic, simulated content. Approximately, 1,00,000 users subscribe to the forum; however, the posting of content and comments is done only by bots trained by the moderator. The process of training a bot includes the machine learning of 5 lac comments from a particular subreddit, which then sorts those comments through their popularity on those respective subreddits. The casual employment of language model machine learning on a popular website like Reddit is unique and important. This study examines the comments and a title posted by the bots and tests them on a sample of 126 participants, where the survey places posts generated by human users versus posts created by bots. The study tries to gauge whether popular and public, social media websites like Reddit have the potential to host content that can be entirely generated through artificial intelligence and whether this content can be distinguished by the layperson. The study opens up vast avenues which indicate that automation during industry 4.0 would include huge sections of online information in casual settings that will purely be generated through machine learning. It also questions the authenticity of online found text and ponders upon its implications in the rapidly approaching technological change.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kaplan, A.M., Haenlein, M.: Users of the world, unite! The challenges and opportunities of social media. Bus. Horiz. 53(1), 59–68 (2010)

    Article  Google Scholar 

  2. Luxton, D.D., June, J.D., Fairall, J.M.: Social media and suicide: a public health perspective. Am. J. Public Health 102(S2), S195–S200 (2012)

    Article  Google Scholar 

  3. Stieglitz, S., Brachten, F., Ross, B., Jung, A.K.: Do social bots dream of electric sheep? A categorisation of social media bot accounts. Preprint at arXiv:1710.04044 (2017)

  4. Xu, A., Liu, Z., Guo, Y., Sinha, V., Akkiraju, R.: A new chatbot for customer service on social media. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 3506–3510 (2017)

    Google Scholar 

  5. Oentaryo, R.J., Murdopo, A., Prasetyo, P.K., Lim, E.P.: On profiling bots in social media. In: International Conference on Social Informatics, pp. 92–109. Springer, Cham (2016)

    Google Scholar 

  6. Veale, T., Valitutti, A., Li, G.: Twitter: The best of bot worlds for automated wit. In: International Conference on Distributed, Ambient, and Pervasive Interactions, pp. 689–699. Springer, Cham (2015)

    Google Scholar 

  7. Wilkie, A., Michael, M., Plummer-Fernandez, M.: Speculative method and Twitter: bots, energy and three conceptual characters. The Sociological Review 63(1), 79–101 (2015)

    Article  Google Scholar 

  8. Adewole, K.S., Anuar, N.B., Kamsin, A., Varathan, K.D., Razak, S.A.: Malicious accounts: dark of the social networks. J. Netw. Comput. Appl. 79, 41–67 (2017)

    Article  Google Scholar 

  9. Wang, A.H.: Detecting spam bots in online social networking sites: a machine learning approach. In: IFIP Annual Conference on Data and Applications Security and Privacy, pp. 335–342. Springer, Berlin (2010)

    Google Scholar 

  10. Subrahmanian, V., Azaria, A., Durst, S., Kagan, V., Galstyan, A., Lerman, K., Zhu, L., Ferrara, E., Flammini, A., Menczer, F.: The DARPA Twitter bot challenge. Computer 49(6), 38–46 (2016)

    Article  Google Scholar 

  11. Alarifi, A., Alsaleh, M., Al-Salman, A.: Twitter during test: identifying social machines. Inf. Sci. 372, 332–346 (2016)

    Article  Google Scholar 

  12. Alsaleh, M., Alarifi, A., Al-Salman, A.M., Alfayez, M., Almuhaysin, A.: TSD: detecting Sybil accounts in Twitter. In: 2014 13th International Conference on Machine Learning and Applications. IEEE, pp. 463–469 (2014).

    Google Scholar 

  13. Wang, G., Mohanlal, M., Wilson, C., Wang, X., Metzger, M., Zheng, H., Zhao, B.Y.: Social during tests: Crowdsourcing Sybil detection. Preprint at arXiv:1205.3856 (2012)

  14. Disumbrationist: What is R/subsimulatorgpt2? Reddit (2020). Retrieved 2 March 2022, from https://old.reddit.com/r/SubSimulatorGPT2/comments/btfhks/what_is_rsubsimulatorgpt2/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chaitanya Solanki .

Editor information

Editors and Affiliations

Appendix

Appendix

Questionnaire Survey: https://forms.gle/8UKNB4HKxFStzUvYA.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Solanki, C. (2023). “This Has Been Written by a Bot”: A Bot Detection Study of the SubsimulatorGPT2 Subreddit. In: Chakrabarti, A., Singh, V. (eds) Design in the Era of Industry 4.0, Volume 1. ICORD 2023. Smart Innovation, Systems and Technologies, vol 343. Springer, Singapore. https://doi.org/10.1007/978-981-99-0293-4_95

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-0293-4_95

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-0292-7

  • Online ISBN: 978-981-99-0293-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics