Skip to main content

Estimating Support Scores of Autism Communities in Large-Scale Web Information Systems

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2017 (WISE 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10569))

Included in the following conference series:

Abstract

Individuals with Autism Spectrum Disorder (ASD) have been shown to prefer communication at a socio-spatial distance. So while rarely found in the real world, autism communities are popular in Web-based forums, convenient for people with ASD to seek and share health related information. Reddit is one such avenue for people of common interest to connect, forming communities of specific interest, namely subreddits. This work aims to estimate support scores provided by a popular subreddit interested in ASD – www.reddit.com/r/aspergers. The scores were measured in both the quantities and qualities of the conversations in the forum, including conversational involvement, emotional, and informational support. The support scores of the subreddit Aspergers was compared with that of an average subreddit derived from entire Reddit, represented by two big corpora of approximately 200 million Reddit posts and 1.66 billion Reddit comments. The ASD subreddit was found to be a supportive community, having far higher support scores than did the average subreddit. Apache Spark, an advanced cluster computing framework, is employed to speed up processing of the large corpora. Scalable machine learning techniques implemented in Spark help discriminate the content made in Aspergers versus other subreddits and automatically discover linguistic predictors of ASD within minutes, providing timely reports.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Notes

  1. 1.

    https://www.reddit.com/r/autism/.

  2. 2.

    https://www.reddit.com/r/aspergers/.

  3. 3.

    https://praw.readthedocs.org/en/stable/, retrieved March 2017.

  4. 4.

    http://bit.ly/1MvQobz, downloaded 1 October 2015.

  5. 5.

    http://bit.ly/1RmhQdJ, downloaded 1 October 2015.

References

  1. American Psychiatric Association.: Diagnostic and Statistical Manual of Mental Disorders, 5th edn. American Psychiatric Publishing, Arlington (2013)

    Google Scholar 

  2. Bradley, M.M., Lang, P.J.: Affective norms for English words (ANEW): instruction manual and affective ratings (1999)

    Google Scholar 

  3. De Choudhury, M., De, S.: Mental health discourse on Reddit: self-disclosure, social support, and anonymity. In: Proceedings of the International AAAI Conference on Weblogs and Social Media, pp. 71–80 (2014)

    Google Scholar 

  4. Meng, X., Bradley, J., Yavuz, B., Sparks, E., Venkataraman, S., Liu, D., Freeman, J., Tsai, D.B., Amde, M., Owen, S.: MLlib: machine learning in Apache Spark. J. Mach. Learn. Res. 17(34), 1–7 (2016)

    MathSciNet  MATH  Google Scholar 

  5. Nguyen, T.: Mood patterns and affective lexicon access in weblogs. In: Proceedings of the ACL Student Research Workshop, pp. 43–48 (2010)

    Google Scholar 

  6. Nguyen, V., Nguyen, D.T., Le, T., Venkatesh, S., Phung, D.: One-pass logistic regression for label-drift and large-scale classification on distributed systems. In: Proceedings of International Conference on Data Mining (ICDM), pp. 1113–1118 (2016)

    Google Scholar 

  7. Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic inquiry and word count (LIWC) [Computer software]. LIWC Inc (2007)

    Google Scholar 

  8. Tausczik, Y.R., Pennebaker, J.W.: The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 29(1), 24–54 (2010)

    Article  Google Scholar 

  9. Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the USENIX Conference on Hot Topics in Cloud Computing, p. 10 (2010)

    Google Scholar 

Download references

Acknowledgment

This work is partially supported by the Telstra-Deakin Centre of Excellence in Big Data and Machine Learning.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nguyen Thin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Thin, N., Hung, N., Venkatesh, S., Phung, D. (2017). Estimating Support Scores of Autism Communities in Large-Scale Web Information Systems. In: Bouguettaya, A., et al. Web Information Systems Engineering – WISE 2017. WISE 2017. Lecture Notes in Computer Science(), vol 10569. Springer, Cham. https://doi.org/10.1007/978-3-319-68783-4_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-68783-4_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-68782-7

  • Online ISBN: 978-3-319-68783-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics