Abstract
Autism Spectrum Disorder (ASD) is distinguished by a variety of behavioral and social deficits. One of the most prominent problems in an autistic child is their aggressive nature that can cause physical injuries. In order to address the following behavioral deficits, this paper is an attempt to present a novel monitoring framework to predict irregularities from the physical activities of an autistic child. The primary motive behind this study is to improve the safety measures for the child in an indoor environment by generating time sensitive-alerts for caretakers and doctors. In this study, a deep 3D CNN and LSTM based activity prediction methodology are proposed to recognize physical irregularities. The 3D CNN model extracts the spatio-temporal features from the video templates for stance prediction with subject position. The LSTM model calculates the temporal relationship in the feature maps to analyze the scale of irregularity. Moreover, in order to deal with such irregularities, a time-sensitive alert-based decision process is proposed in the present work to generate early warnings to the doctor and caretaker. The proficiency of the system is increased by storing the performed activity scores in the local database of the system which can be further utilized to provide medicinal or therapeutic assistance. In addition, the measured experimental results are validated with state-of-the-art methodologies. Hence, the calculated statistical measures justify the utility of the proposed study in the healthcare and assistive-care domain.
Similar content being viewed by others
References
Akula A, Shah AK, Ghosh R (2018) Deep learning approach for human action recognition in infrared images. Cogn Syst Res 50:146–154. https://doi.org/10.1016/j.cogsys.2018.04.002
American Psychiatric Association (2013) Diagnostic and statistical manual of mental disorders: DSM-5. American Psychiatric Association, Philadelphia
Chen G, Zou Y, Zhang C (2019) STMP: spatial temporal multi-level proposal network for activity detection. Springer, Berlin, pp 29–41
Cheron G, Laptev I, Schmid C (2015) P-CNN: pose-based CNN features for action recognition. In: Proceedings of the IEEE international conference on computer vision
Chu WS, Song Y, Jaimes A (2015) Video co-summarization: video summarization by visual co-occurrence. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition
Dunlap G, Dyer K, Koegel RL (1983) Autistic self-stimulation and intertrial interval duration. Am J Ment Defic 88:194–202
Ezzahout A, Haj Thami RO (2013) Conception and development of a video surveillance system for detecting, tracking and profile analysis of a person. In: 2013 3rd international symposium ISKO-Maghreb
Feichtenhofer C, Pinz A, Zisserman A (2016) Convolutional two-stream network fusion for video action recognition. https://doi.org/10.1109/CVPR.2016.213
Feichtenhofer C, Pinz A, Wildes RP (2017) Spatiotemporal multiplier networks for video action recognition. In: Proceedings—30th IEEE conference on computer vision and pattern recognition, CVPR 2017
Ganz ML (2007) The lifetime distribution of the incremental societal costs of autism. Arch Pediatr Adolesc Med. https://doi.org/10.1001/archpedi.161.4.343
Ghasemzadeh H, Loseu V, Jafari R (2010a) Structural action recognition in body sensor networks: distributed classification based on string matching. IEEE Trans Inf Technol Biomed. https://doi.org/10.1109/TITB.2009.2036722
Ghasemzadeh H, Jafari R, Prabhakaran B (2010b) A body sensor network with electromyogram and inertial sensors: multimodal interpretation of muscular activities. IEEE Trans Inf Technol Biomed. https://doi.org/10.1109/TITB.2009.2035050
Graves A (2013) Generating sequences with recurrent neural networks. Dict Mark Commun. https://doi.org/10.4135/9781452229669.n2760
Griffith GM, Hastings RP, Nash S, Hill C (2010) Using matched groups to explore child behavior problems and maternal well-being in children with down syndrome and autism. J Autism Dev Disord. https://doi.org/10.1007/s10803-009-0906-1
Hutt C, Ounsted C (1966) The biological significance of gaze aversion with particular reference to the syndrome of infantile autism. Behav Sci. https://doi.org/10.1002/bs.3830110504
Hutt C, Forrest SJ, Richer J (1975) Cardiac arrhythmia and behaviour in autistic children. Acta Psychiatr Scand. https://doi.org/10.1111/j.1600-0447.1975.tb00014.x
Ijjina EP, Chalavadi KM (2017) Human action recognition in RGB-D videos using motion sequence information and deep learning. Pattern Recognit. https://doi.org/10.1016/j.patcog.2017.07.013
Karafi M, Cernock JH (2010) Recurrent neural network language modeling, pp 1045–1048. https://doi.org/10.1021/jp056727x
Kolda TG, Bader BW, Laboratories SN (2008) Tensor decompositions and applications. Soc Ind Appl Math 51:455–500. https://doi.org/10.1137/07070111x
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst. https://doi.org/10.1016/j.protcy.2014.09.007
Lindemann U, Hock A, Stuber M et al (2005) Evaluation of a fall detector based on accelerometers: a pilot study. Med Biol Eng Comput. https://doi.org/10.1007/BF02351026
Liu J, Shahroudy A, Xu D, Wang G (2016) Spatio-temporal LSTM with trust gates for 3D human action recognition. In: Leibe B, Matas J, Sebe N, Welling M (eds) Computer vision – ECCV 2016, vol 9907. Springer, Cham, pp 816–833
Lord C, Volkmar F, Lombroso P (2002) Genetics of childhood disorders: XLII. Autism, part 1: diagnosis and assessment in autistic spectrum disorders. J Am Acad Child Adolesc Psychiatry. https://doi.org/10.1097/00004583-200209000-00015
Mubashir M, Shao L, Seed L (2013) A survey on fall detection: principles and approaches. Neurocomputing. https://doi.org/10.1016/j.neucom.2011.09.037
Myers BJ, Mackintosh VH, Goin-Kochel RP (2009) My greatest joy and my greatest heart ache: parents own words on how having a child in the autism spectrum has affected their lives and their families lives. Res Autism Spectr Disord. https://doi.org/10.1016/j.rasd.2009.01.004
Ng S, Fakih A, Fourney A et al (2009) Towards a mobility diagnostic tool: tracking rollator users leg pose with a monocular vision system. In: Proceedings of the 31st annual international conference of the IEEE engineering in medicine and biology society: engineering the future of biomedicine, EMBC 2009
Noury N (2002) A smart sensor for the remote follow up of activity and fall detection of the elderly. In: 2nd Annual international IEEE-EMBS special topic conference on microtechnologies in medicine and biology—proceedings
Noury N, Rumeau P, Bourke AK et al (2008) A proposal for the classification and evaluation of fall detectors. IRBM 29:340–349
Nyan MN, Tay FEH, Tan AWY, Seah KHW (2006) Distinguishing fall activities from normal activities by angular rate characteristics and high-speed camera characterization. Med Eng Phys. https://doi.org/10.1016/j.medengphy.2005.11.008
Pascanu R, Mikolov T, Bengio Y (2012) On the difficulty of training recurrent neural networks. https://doi.org/10.1109/72.279181
Perry JT, Kellog S, Vaidya SM et al (2009) Survey and evaluation of real-time fall detection approaches. In: 6th International symposium on high capacity optical networks and enabling technologies, HONET 09
Rimminen H, Lindstrm J, Linnavuo M, Sepponen R (2010) Detection of falls among the elderly by a floor sensor using the electric near field. IEEE Trans Inf Technol Biomed. https://doi.org/10.1109/TITB.2010.2051956
Shahroudy A, Liu J, Ng T-T, Wang G (2016) NTU RGB+D: a large scale dataset for 3D human activity analysis. https://doi.org/10.1109/CVPR.2016.115
Shi G, Chan CS, Li WJ et al (2009) Mobile human airbag system for fall protection using MEMS sensors and embedded SVM classifier. IEEE Sens J. https://doi.org/10.1109/JSEN.2008.2012212
Singhal S, Tripathi V (2019) Action recognition framework based on normalized local binary pattern. Springer, Singapore, pp 247–255
Song S, Lan C, Xing J et al (2016) An end-to-end spatio-temporal attention model for human action recognition from Skeleton data. In: Thirty-first AAAI conference on artificial intelligence, pp 4263–4270
Sun L, Jia K, Yeung DY, Shi BE (2015) Human action recognition using factorized spatio-temporal convolutional networks. In: Proceedings of the IEEE international conference on computer vision
Sutskever I, Hinton G, Krizhevsky A, Salakhutdinov RR (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. https://doi.org/10.1214/12-AOS1000
Tran D, Bourdev L, Fergus R et al (2015) Learning spatiotemporal features with 3D convolutional networks. In: Proceedings of the IEEE international conference on computer vision
Wang Y, Long M, Wang J, Yu PS (2017) Spatiotemporal pyramid network for video action recognition. In: Proceedings—30th IEEE conference on computer vision and pattern recognition, CVPR 2017
Xu Z, Yang Y, Hauptmann AG (2015) A discriminative CNN video representation for event detection. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition
Yang X, Molchanov P, Kautz J (2016) Multilayer and multimodal fusion of deep neural networks for video classification. In: Proceedings of the 24th ACM international conference on multimedia, pp 978–987
Yang X, Luo X, Huang T et al (2018) Towards efficient and objective work sampling: recognizing workers activities in site surveillance videos with two-stream convolutional networks. Autom Constr 94:360–370. https://doi.org/10.1016/j.autcon.2018.07.011
Ye Y, Ci S, Katsaggelos AK et al (2013) Wireless video surveillance: a survey. IEEE Access. https://doi.org/10.1109/ACCESS.2013.2282613
Ye J, Stevenson G, Dobson S (2015) KCAR: a knowledge-driven approach for concurrent activity recognition. Pervasive Mob Comput 19:4770. https://doi.org/10.1016/j.pmcj.2014.02.003
Zeiler MD (2012) ADADELTA: an adaptive learning rate method
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Manocha, A., Singh, R. An intelligent monitoring system for indoor safety of individuals suffering from Autism Spectrum Disorder (ASD). J Ambient Intell Human Comput 14, 15793–15808 (2023). https://doi.org/10.1007/s12652-019-01277-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-019-01277-3