1 Introduction

In the last few decades, robotics research has witnessed significant changes in the traditional paradigm as it has shifted to cover new areas, such as entertainment, transportation, space, healthcare, and others [102]. Robots are now being considered to be used in many applications (e.g. rehabilitation and elderly care) that require direct physical human–robot interaction [37, 102].

Autism spectrum disorder (ASD) is a condition that is diagnosed during childhood and affects neurodevelopment [120]. Previous studies have shown that children on the spectrum have strong interest in technology, such as computer applications [48], virtual environments [87], and robots [16, 98]. There has been a growing interest in the clinical use of robots to assist in the therapy of children with ASD [10, 28, 52]. The usage of technology, especially robotics, for ASD therapy opens many possibilities in the early intervention for children with ASD, and toward more personalized therapy [90, 91]. The application of robots for intervention provides many options and flexibility as robots can be used as an intervention tool to facilitate the therapeutic session, as a co-therapist with turn-taking with the main therapist or as a sole therapist [10, 29].

Fig. 1
figure 1

Some of the robots that have been considered or have the potential for ASD intervention: a Keepon, a yellow snowman-like robot [68]. b Chippies, a pack of playful puppies (With kind permission from WowWee Group Ltd). c SPRK+, a ball robot (With kind permission from Sphero). d Lynx, a humanoid robot companion (With kind permission from UBTECH). e Cozmo, an interactive forklift-inspired robot (With kind permission from Anki). f Leka, an autonomous ball-like robot (With kind permission from Leka Inc). g Tipster, an interactive toy robot (With kind permission from WowWee Group Ltd)

There has been a growing concern pertaining to the rise at an unprecedented rate of ASD among children. For example, the pervasiveness rate of ASD was found to be 1 out of 45 in the United States [118] while in the United Kingdom, the estimate was 1 out of 100 [39]. Due to the diverse nature of ASD, the manifestation of behaviors among children on the spectrum varies greatly in their degree. Children with ASD face lifelong impairments in social interaction, communication, and the exhibition of restricted interests or behaviors [5]. Furthermore, children face a multitude of daily behavioral challenges as compared to neurotypical children [23, 26]. Intervention, especially during the early years, seems to play a significant role in the treatment, or at least in the mitigation of challenging behaviors so that the children can be more independent [92].

The traditional intervention sessions for ASD have been usually conducted by therapists. However, the progress in technology has enabled the introduction of new methods and techniques to facilitate the screening of ASD and intervention [15]. Children were found to be more intrigued to interact with robots as compared to humans due to the limited complexity of robots [16]. There have been many reported evidence of the efficacy of using robots for ASD intervention [16, 98, 103]. Many studies have demonstrated that robots can be used to elicit many behaviors, such as imitation, joint attention, eye contact, and gestural responses [19,20,21, 47, 113].

The diversity that exists within the spectrum of ASD appear to reflect upon the design of social robots that have been considered in this line of research. The social robots that have been developed were designed to have different appearances, sizes, and shapes that can be broadly categorized as being anthropomorphic, non-anthropomorphic or non-biomimetic [16]. Kaspar [54], a minimally-expressive robot is an example of an anthropomorphic robotic design. Keepon [68], a yellow snowman-like robot, is an example of a non-anthropomorphic design while Roball [79] is an example of a non-biomimetic design due to its spherical shape. Furthermore, there are many commercially-available social robots and robotic toys that have been developed to assist in ASD therapy or have the potential to be considered (Fig. 1). The introduction of a new eliciting stimulus (e.g. robot) to such children could become a source of harm during the manifestation of challenging behaviors. Children with ASD often exhibit a variety of stereotypical behaviors that are often impulsive and aggressive. Such behaviors could be a response to an unwanted stimulus or as a result from a failed emotional regulation [77].

The rapid evolution of technology has sparked a global interest in robotics and their prospective applications. The International Federation of Robotics (IFR) [57] has predicted that the number of entertainment robots, such as toy robots, personal edutainment robots, and multi-media robots, will rise to 11 million units by 2019. This dramatic increase in the number of robots, especially the ones in close proximity with humans, has implications on safety that emphasize the need for standardization. Some of the standards that were established were ISO 10218 [58, 59], which is concerned with safety in industrial robots; ISO/TS15066 [62], which is related to collaborative industrial robots; and ISO 13482 [60], which is related to personal care robots. Currently, there is no safety standard for social robots or toy robots.

Fig. 2
figure 2

Different forms of challenging behaviors that are exhibited by children with ASD [35, 66, 72, 75]

One of the earliest works concerning safety and harm quantification with robotics introduced the human pain tolerance as an indicator for potential risks [114]. In another work, the simulation of impact tests of an industrial robot on a crash test dummy using Finite Element Method (FEM) was proposed and demonstrated to assess safety [84]. In later works, actual crash test dummies have been used in impact tests using industrial robotic arms [44, 46]. Various safety indices, such as for the head, chest, and neck, have been used in the evaluation of safety and potential injury levels due to impacts. To date, limited studies have been conducted on safety in social robotics, especially pertaining to the safety of children with ASD [38, 50, 109].

The objective of this study is to quantify the harm levels based on severity indices for one of the identified potentially harmful scenarios that might arise between a child and a social robot due to the manifestation of challenging behaviors. This paper is organized as follows. Section 2 describes the epidemiology of challenging behaviors and the potential risks with robots. Section 3 describes the harm quantification and relevant severity indices. Section 4 provides the materials and methods. Section 5 describes results, which are then discussed in Sect. 6. Section 7 concludes the study.

2 Challenging Behaviors in Children with ASD

2.1 Overview

Individuals on the spectrum are unique and complex in their dispositions and in their manifestation of ASD. Autism affects such individuals and causes a lot of deficits and challenges in their communication skills, interactions with others, behaviors, sensory inputs perception, and social life [23]. Furthermore, they exhibit self-stimulatory behaviors, perfectionist tendencies, meltdowns, and delayed echolalia [30]. Children diagnosed with ASD tend to exhibit more challenging and aggressive behaviors than their neurotypical peers (Fig. 2). For example, those with perfectionist tendencies and emotional regulation deficiencies have shown higher level of aggressive behavior, anxiety, and depression [4, 6].

Frustration is another contributing factor toward the exhibition of more challenging behaviors. Children with ASD might face frustration when being exposed to new unpredictable, overwhelming, and noisy environments as that found in hospitals [66, 97]. In addition, such environments are rich in stimuli that might overload their body’s senses, and that would make meeting their needs even harder due to the increased struggle with the new environmental changes [82, 97]. Challenging behaviors take different forms, such as withdrawal, repetitive and stereotypical habits, aggression against others, self-injury, tantrums, meltdowns, and property destruction [66, 72, 75]. Not only such behaviors pose risks on the children themselves, but also pose a lot of risks on others around them, such as other children, nurses, patients, caregivers, parents, and family members [66, 88].

2.2 Epidemiology of Challenging Behaviors

Among all the children with developmental disabilities, challenging behaviors and anxiety problems seem to have higher prevalence rate among children with autism [32, 42]. Those with more severe ASD have showed higher rate of challenging behaviors as compared to those with less ASD severity [76, 81]. Furthermore, studies have shown that even infants or toddlers that are diagnosed with ASD do exhibit challenging behaviors at a higher rate compared to their neurotypical counterparts [35, 69]. Due to difficulties in using proper communication to satisfy their needs, children with autism might turn to challenging behaviors as a form of communication to express themselves [55].

The pervasiveness of challenging behaviors among individuals with autism is relatively high. One study surveying 222 children reported a 50% occurrence rate while another study surveying 32 adults reported a rate of 69% [7, 13]. A more recent study with a larger sample size of 1,380 children has reported a high aggression prevalence rate of 68% against caregivers and 49% against others [67]. The majority of the previous studies have reported the occurrence of at least one or more challenging behaviors among at least half of the individuals with ASD [35]. The existence of such behaviors has many implications on those providing treatments and services to individuals with ASD [55].

2.3 Robots and Potential Risks

Technology offers a lot of potential to therapeutic sessions, such as, but not limited to, independent learning, individualizing, motivation, reinforcement, social and communicative skills practice, hands-on learning, and others [31]. The advancement in several Artificial Intelligence (AI) fields has enabled robots to function independently and more naturally for effective social interactions. Social robots differ from typical toys in many ways, such as the way they engage people at an interpersonal level to achieve positive outcomes in different domains [14]. Furthermore, social robots should be able to convey emotions, form social relationships, demonstrate personality, use natural communication cues, and to understand their social partners [49]. Child-robot interactions are characterized beyond traditional toys by the form of a robot’s embodiment, the interface of communication, two-way interaction, and the robot’s adaptability to the child [24, 36].

Children interacting with social robots are prone to touching the robot. In some cases, they might show aggression toward the robot [1, 11, 107]. This requires that the existing design guidelines must ensure the safety of the children and the physical integrity of the robot, especially during meltdowns [1, 16, 17]. While some of the existing robots could meet many of the therapy objectives, they are still not adequate enough to be used with some of the children on the spectrum who may exhibit hyperactivity and aggression [53]. The majority of the social robots used in the literature are just prototypes, not commercially available, and have yet to find their ways into therapy sessions or special needs schools [52]. Hence, the exposure to such technology is still very limited worldwide and the need to identify potential safety issues arises. The wide adoption of social companions and smart toys would introduce some concerns and ethical considerations that must be addressed early on [22, 24, 74, 100].

Fig. 3
figure 3

The identified possible risky scenarios that might occur between a child and a social robot due to some of the challenging behaviors

The occurrence of challenging behaviors, such as kicking objects, throwing objects at others, banging on objects, and harming oneself by hitting [75], when a robot is present increases the chances of potentially risky scenarios (Fig. 3). Depending on the size of the social robot being used, the magnitude of potential risks might change accordingly. For instance, kicking a large robot will inflict an initial harm to the kicker and secondary damage on others in case of the robot falling down on them. On the other hand, kicking a small robot might affect others and cause harm. Another challenging behavior that could inflict harm on others is throwing, especially in the case of small and light robots. The child could use the small robot involuntarily as an object to be thrown to others. Self-inflicted harmful behaviors, such as banging and hitting, could be increased with the presence of a robot as it can be used by the child. All the aforementioned scenarios must be accounted for when designing robots and solutions to mitigate them must be investigated.

3 Quantifying Harm

3.1 Overview

The human head is susceptible to traumatic brain injury (TBI) when it is receiving blows or bumps, or when subjected to impacts with projectiles, such as a baseball [12]. TBI is categorized as either being mild or severe, and could cause permanent disability or in severe cases, death. The most occurring type of mild TBI is concussion. It is not considered life threatening. However, the results on the affected individual can be serious [83]. In the United States and in 2013 alone, a total of 2.8 million cases (i.e. 50,000 deaths, 282,000 hospitalizations, and 2.5 million emergency department visits) of TBI have been reported with falls being the lead cause followed by getting struck by or against an object [108]. Furthermore, it has been reported that 1 in 5 cases of TBI occurred among children (age < 15 years old). TBIs among young children may impair the neurological development and cause a multitude of challenges, such as depression, attention deficit hyperactivity disorder, and in attaining academic achievements [2, 3, 89, 116].

Throwing objects, especially in the case of small robots, pose a great risk to the head and needs to be quantified. The objective of this study is to simulate objects being thrown at the head to quantify harm levels. Hence, this section describes the related studies and relevant severity indices.

3.2 Related Impact Tests

Laboratory settings using anthropomorphic test dummies (ATD) are typically used to simulate potentially dangerous scenarios to evaluate the possible harm to a human, such as that used in car crash tests. Furthermore, similar setups have been used to quantify harm due to impacts in some sports and to evaluate protective gears, such as helmets [78, 85, 115]. One study used similar settings to assess the influence of taekwondo kicks and the peak velocity of the foot on the dynamics of the head [34]. In that study, a crash test dummy head was secured to an aluminum frame and it was used as a target for the kicks performed. The dummy head was equipped with an accelerometer to measure the dynamics of the head (i.e. changes in acceleration) as it was being hit by the participants. The data generated was used to assess the potential of concussion based on the head injury criterion (HIC).

Similar studies were conducted in industrial robots to quantify the potential harm due to different possible impact scenarios between a human and a robotic arm. For example, studies have conducted impact tests of a manipulator to a dummy at a standard automobile crash test facility [44,45,46]. In such studies, impacts by robotic arms using flat surface impactors were performed against a standard crash test dummy in three regions (i.e. head, chest, and neck). The potential of injury or harm levels were evaluated based on the respective severity index or indices for each region. The evaluations were based on varying some robotic-dependent variables, such as mass and velocity, and their relation on the resultant severity indices. In some studies, low-cost body part models, such as using a head model or an arm model, and low-cost sensors were considered and used to carry out experimental tests to evaluate human–robot impacts [25, 45].

3.3 Severity Indices

Severity indices are associated with injury scaling, such as the Abbreviated Injury Scale (AIS) [40]. AIS is a tool that provides a simple way to grade the observed injury based on a scoring criteria (Table 1). AIS, together with various severity indices, give an estimation of the potential for an injury and its respective severity.

Table 1 Abbreviated Injury Scale (AIS) and the corresponding injury classification

The investigation in this study was limited to the head, hence, only relevant head indices, namely, head injury criterion (HIC), 3 ms criterion, peak linear acceleration, impact forces, and tissue injuries, are summarized.

3.3.1 Head Injury Criterion

Head Injury Criterion (HIC) is one of the most commonly used severity indices to measure the possible injury to the head in many applications, such as in vehicles and in sports [110]. HIC is defined as:

$$\begin{aligned} HIC= \left( t_{2}-t_{1} \right) \left[ \frac{1}{\left( t_{2}-t_{1} \right) }\int _{t_{1}}^{t_{2}} a\left( t\right) dt \right] ^{2.5} \end{aligned}$$
(1)

where a(t) is based on the resultant acceleration of the head and measured in terms of gravitational acceleration (\(\hbox {g} = 9.81~\hbox {m}/\hbox {s}^{2} \)) and \(\varDelta t = t_{2}-t_{1}\) is the duration of the impact considered in calculating the resultant HIC. The two most commonly used durations to evaluate the severity of injury to the head are 36 ms and 15 ms. Throughout this study, the 36 ms duration only will be used.

The HIC standard is converted to a corresponding AIS based on the following relation [70]:

$$\begin{aligned} p\left( head\, injury \right) = \phi \left( \frac{\ln \left( HIC_{36} \right) -\mu }{\sigma } \right) \end{aligned}$$
(2)

where \(\phi \) is the cumulative normal distribution, and \(\mu \) is the population mean, and \(\sigma \) is the standard deviation (Table 2). These values were originally specified for a test dummy head, hence, they will only be used for comparison purposes. At a particular HIC value, the probability of injury occurrence differs between each of the AIS scores (Fig. 4).

Fig. 4
figure 4

The relationship between the head injury criterion (HIC) and the probability of injury according to different abbreviated injury scales (AIS) [70]

Table 2 The values of population mean \(\mu \) and standard deviation \(\sigma \) corresponding to different scores of the Abbreviated Injury Scale (AIS)

3.3.2 The 3 ms Criterion

This criterion requires that the maximum mean value over a 3 ms duration of the resultant head acceleration is less than a certain threshold when there is no hard contact (Fig. 5). This criterion is used as part of the regulations pertaining to the safety of occupants in vehicles and in helmet testing [99].

The European National Car Assessment Protocol (Euro NCAP) states that the thresholds for 3 ms criterion for a child occupant should not exceed 60 g in case of frontal impact and 60 g in case of side impact [33].

Fig. 5
figure 5

Sample of the generated linear acceleration data demonstrating the durations of both the head injury criterion (HIC) and the 3 ms criterion. Also shown is the instance of peak linear acceleration that was considered in the data analysis

3.3.3 Peak Linear Acceleration

Peak linear acceleration has been used as one of the biomechanical measures for head impact to investigate its association with concussion events [41, 94]. A study investigating the head impact exposure in youth football has reported linear accelerations due to impacts anywhere in the range of 10 g to 111 g [117]. While that study did not report the occurrence of any concussions, it is believed that concussions could occur within that range based on the reported football-related concussions [8]. One study based on a finite element head model validated from field collisions has estimated a probability of mild TBI to be 25%, 50%, and 80% corresponding to maximum accelerations of 66 g, 82 g and 106 g, respectively [119]. One study has reported the occurrence of a concussive event at a relatively low linear acceleration value of 31.8 g [80]. Some of these studies did not report the duration of impacts while others reported impact durations of 30 ms or less.

3.3.4 Impact Forces

The HIC severity index is not enough for the assessment of head safety, especially for what concerns any potential damage to the skull and brain injury [111]. Contact force is another indicator to predict the fracture tolerance of the human bone structure. There have been many studies conducted on the heads of cadavers to measure the fracture forces of the skull. The experiments conducted were either by dropping the heads from different heights or impacting the head with an impactor at various velocities. A summary of the studies conducted on facial fracture (i.e. maxilla, zygoma, frontal bone, nasal bone, and mandible) revealed peak force tolerances anywhere in the range of 610–9,880 N [9]. An injury risk function with comparable consistency to facial fracture data has been proposed based on Weibull distribution [9, 27]. The aim is to identify forces at which facial fracture starts for an impactor that has an area of \(13.8\hbox { cm}^{2} \). The function is defined as:

$$\begin{aligned} P_{fracture}\left( F \right) = 1-exp\left[ -\left( \frac{F}{B} \right) ^{\alpha } \right] \end{aligned}$$
(3)

where \(\alpha = 2.27\) is the Weibull shape parameter, and B = 887.7 N is the Weibull scale parameter, and F is the impact force. According to this function, a force of 755 N would have 50% chance of fracture (Fig. 6).

Fig. 6
figure 6

Injury risk function for facial fractures (i.e. maxilla and zygoma) [9, 27]. Dashed red line represents a facial fracture probability of 50% corresponding to a force value of 755 N

Table 3 Classification of tissue injury based on depth [106]

3.3.5 Tissue Injuries

Tissue injuries resulting from impacts with objects could take on different forms, such as skin tears, lacerations and abrasions. The magnitude and the depth of the resultant injury depends on the geometry of impactors or penetrators and their dimensions [101]. Classification by depth is often used as an indicator for wounds and tissue injuries [106]. According to this classification, tissue injuries could either be superficial wounds, partial-thickness, full-thickness or subcutaneous skin loss (Table 3). This classification will be used as a reference and indicator for the possible tissue injury in our study.

Fig. 7
figure 7

The developed low-cost 3D printed head with the embedded sensors

4 Materials and Methods

4.1 Dummy Head Development

A 3D printed head made of polylactide (PLA) was augmented with clay to reach a weight of 3.1 kg that is comparable to the weight of a scaled 50th percentile of 3–6 years old children’s dummy heads [105]. A 2 mm layer of deformable soft material made of silicone (Ecoflex OO-30, Smooth-On, USA) was added to the dummy head to add more lifelike skin [18]. This silicone layer will be used in estimating potential tissue injuries due to its close Shore hardness value to that of the human skin [86]. The developed setup focuses on the dynamics of the head only because for short impact durations, the effects of the neck and body mass on the head are believed to be minimum [112].

A low-cost triple-axis accelerometer (ADXL 377, SparkFun Electronics, Colorado, USA) was placed at the center of the head to measure the linear acceleration of the head. A force-sensing resistor (FlexiForce Force Sensor, Tekscan Inc, USA) was placed at the center of the forehead to measure the impact forces (Fig. 7). The force sensor was calibrated according to the manufacturer’s guide. A small puck (i.e. disk-like force concentrator) was placed at the center of the sensing area to ensure that most of the forces applied can be detected. In case the embedded sensor failed to properly register some of the impacts, a digital force gauge (FGE-100X, Shimpo Instruments, USA) was used in separate experiments to measure the impact forces by attaching it to the top of the head, thus increasing the total weight of the head to 3.5 kg.

Fig. 8
figure 8

The experimental setup used in this study

4.2 Experimental Setup

The experimental setup was based on a low-cost head model situated in a dedicated frame (Fig. 8). The dimensions of the frame were (94.0 \(\times \) 94.0 \(\times \) 94.0 \(\hbox {cm}^{3}\)). Nylon coated wire ropes were used to position the head at the center of the frame. Both sensors (i.e. accelerometer and force) were interfaced to a computer through a data acquisition card (PCI-6031E, National Instrument, USA). The sampling rate was 20 kHz and the signals were filtered according to the Channel Frequency Class 60 [104].

4.3 Impactor

To represent a small social robot, a 3D model with minimum features was designed and then fabricated using a 3D printer (Replicator 5th Generation, MakerBot Industries, USA; Fig. 9). The dimensions of the impactor are around 18.0 \(\times \) 8.0 \(\times \) 17.0 \(\hbox {cm}^{3}\) and weighs around 0.55 kg. The surface roughness of the printed robot model was limited to the resolution of the 3D printer.

While there are many large social robots, the smaller ones are more affordable and are more suited for typical home users. The advancement in technology is allowing smaller robots to be more compact and intelligent. Hence, the size and mass of the proposed dummy robot falls according to such projections. Furthermore, the proposed parameters are within the potential range for throwing objects by the target users.

Fig. 9
figure 9

The impactor representing a small social robot that has been used in the experiments. a Perspective view. b Side view

4.4 Procedures

4.4.1 Setup Validation

To verify the reliability of the developed setup in reporting the comparable HIC values, six impact tests at different velocities were conducted. The impactor used was a 2 kg mass attached to a beam (Fig. 10a). The beam was attached to the main frame of the setup and allowed for free motion that enabled it to hit the frontal side of the head at various velocities (see supplementary material). All impact tests were recorded and the corresponding HIC, 3 ms criterion, peak linear acceleration, and impact velocities were calculated.

Fig. 10
figure 10

Experiments that were conducted in this study. a Validation experiment with the 2 kg impactor. b Harm quantification experiment with the dummy robot

4.4.2 Harm Quantification

Two different experiments of 15 trials each were conducted. Experiment 1 was in a more controlled condition as the dummy robot was tied with a rope to the frame to freely allow it to swing (Fig. 10b). Experiment 2 was in a more comparable condition to the realistic scenario. That involved the throwing of the dummy robot at various velocities from a distance of 1 m away from the head model. The velocities used were in the range of 2.5–8 m/s, which was within the range of a previously reported throwing speeds of tennis balls performed by children of different ages (i.e. 3–9 years) [96]. We believe that this range is reasonable and comparable to the throwing velocities that might be exerted by children on the spectrum.

All experiments were recorded using a video camera (FDR-X1000V, Sony, Japan) in slow-motion mode (240 fps, 720 pixels). All videos were analyzed using the open-source video analysis software Tracker (v4.10.0, Douglas Brown, Open Source Physics). A LabView (v2014, National Instrument, USA) script was used to obtain the raw data from the data acquisition card, processes it and then stores it in a worksheet file. The data were post-processed by a Matlab (v2015, MathWorks, Massachusetts, USA) script that generates the HIC, 3 ms criterion, and peak linear acceleration values.

5 Results

5.1 Setup Validation

Fig. 11
figure 11

Head injury criterion (HIC) values generated by the developed experimental setup due to different impact velocities with a 2 kg impactor. The results were compared with similar impacts conducted by different industrial robots

Fig. 12
figure 12

The 3 ms criterion values generated due to different impact velocities with a 2 kg impactor. The results were compared with similar impacts conducted by an industrial robot

To validate the head model setup, results were compared to previous studies of a similar nature [44, 45], where a low-cost dummy head was developed and an impactor of a mass of 1 kg was used for validation. The impact tests were conducted using robotic arms of different masses moving at different velocities and their results were then compared to that obtained with anthropomorphic test dummies (ATD). Their setup was able to reproduce a comparable numerical HIC values.

The generated HIC values from the validation impact tests in our study were comparable to Haddadin et al’s previous studies (Fig. 11). For example, impact at a velocity of around 1 m/s generated an HIC value in the range of 3–10. The trend is also similar as the values of HIC obtained have increased proportionally with the applied impact velocities. As for the 3 ms criterion, the values at around 1 m/s were in the range of 8–16 (Fig. 12). The differences in the values obtained can be attributed to the differences in the mass of the impactors used (i.e. 2 kg vs 1 kg) and the mass of the developed dummy heads (i.e. 3.1 kg vs 4.5 kg) [56, 73].

5.2 Harm Quantification Measures

5.2.1 Head Injury Criterion (HIC)

In Experiment 1, there is a more consistent trend that as the velocity of impact increases, the corresponding numerical HIC value increases (Fig. 13). The lowest recorded HIC value was 0.013 and it occurred at a velocity of 0.6 m/s while the highest recorded HIC value was 8.568 corresponding to a velocity of 5 m/s.

In Experiment 2, the overall trend is less consistent at certain velocities as compared to Experiment 1, especially around 4 m/s. However, there is an increase in the recorded HIC values as the overall speed of throwing increases. The lowest HIC value obtained was 0.114 corresponding to a velocity of 3.9 m/s while the highest recorded HIC value was 7.066 at a velocity of 7.48 m/s.

Fig. 13
figure 13

The corresponding head injury criterion (HIC) values for impact experiments conducted in Experiment 1 and 2. Experiment 1 had the dummy robot attached to the experimental setup while Experiment 2 had the experimenter conduct the throwing of the dummy robot. The results were compared with the HIC values generated by an industrial robot

Fig. 14
figure 14

The corresponding 3 ms criterion values for impact experiments conducted in Experiment 1 and 2. Experiment 1 had the dummy robot attached to the experimental setup while Experiment 2 had the experimenter conduct the throwing of the dummy robot. The results were compared to the 3 ms criterion thresholds for a child occupant [33]

5.2.2 The 3 ms Criterion

In Experiment 1, the lowest recorded 3 ms value was 1.425 g and it occurred at a velocity of 0.6 m/s while the highest recorded 3 ms value was 21.476 g corresponding to a velocity of 5 m/s (Fig. 14). The trend of the 3 ms values are linearly increasing with the applied velocities.

In Experiment 2, the lowest 3 ms value obtained was 2.96 g corresponding to a velocity of 2.97 m/s while the highest recorded 3 ms value was 18 g at a velocity of 7.48 m/s (Fig. 14). The trend is less consistent as compared to Experiment 1 as evident around 4 m/s.

5.2.3 Peak Linear Acceleration

In Experiment 1, the lowest recorded peak linear acceleration value was 1.5 g and it occurred at a velocity of 0.6 m/s while the highest recorded peak value was 23 g corresponding to a velocity of 5 m/s (Fig. 15). The peak acceleration values are increasing linearly with the throwing velocity.

In Experiment 2, the lowest peak linear acceleration value obtained was 3 g corresponding to a velocity of 2.97 m/s while the highest value was 19 g at a velocity of 7.48 m/s (Fig. 15). the trend is less consistent as compared to Experiment 1, especially around 4 m/s.

Fig. 15
figure 15

The corresponding peak linear acceleration values for impact experiments conducted in Experiment 1 and 2. Experiment 1 had the dummy robot attached to the experimental setup while Experiment 2 had the experimenter conduct the throwing of the dummy robot. The highlighted area represents the range of peak linear accelerations that is associated with the occurrence of subconcussive events [93]

Fig. 16
figure 16

The corresponding impact force values for impact experiments that were conducted using a stand-alone digital force gauge. The right axis represents their corresponding probabilities of causing facial fracture

5.2.4 Impact Forces

The embedded force sensor approach has failed in registering some of the impacts or maximum values due to the lack of sufficient contact between the dummy robot and the effective area of the sensor. However, the maximum force recorded in all of the experiments was 28 N at a velocity of 5.75 m/s.

In order to get a better understanding of the potential impact forces involved, four separate experiments at different velocities were conducted using a stand-alone force gauge. These experiments were conducted similar to Experiment 1 (Fig. 10b). The lowest value was around 30.1 N corresponding to a velocity of 0.75 m/s while the maximum value was 91.3 N at a velocity of 2.15 m/s. There is a trend and linear relationship between the applied velocities and the measured resultant peak force values (Fig. 16).

Fig. 17
figure 17

The observed tissue damage to the artificial skin. a Abrasions-like skin damage of depth that is less than 1 mm. b Laceration-like skin damage of depth that is equal or greater than 2 mm

5.2.5 Tissue Injuries

The evaluation of tissue injuries was based on the visual inspection of the artificial skin. All observations were made after conducting Experiment 1 and Experiment 2. In terms of depth, the majority of the observed damage to the artificial skin were less than 1 mm (Fig. 17a). There were some instances where the depth of the observed damage was more than the thickness of the artificial skin (i.e. greater than 2 mm), which pierced the dummy head (Fig. 17b).

6 Discussion

For the HIC values to be meaningful, they need to be translated to a corresponding metric for potential injury based on the AIS. From the reported equations in Sect. 3, the probabilities of injury due to all impact tests conducted in both Experiments 1 and 2 were negligible (i.e. close to 0%). Even though all the estimated potential for injuries in our experiments were low, there is still a potential for serious harm based on the reported catastrophic injuries and fatalities that occurred due to impacts with lighter objects (e.g. baseball) among children [12, 71]. While the HIC severity index is significant in giving an estimation of the potential for head injury, it is insufficient to estimate pain and assess tissue injuries.

As for the 3 ms criterion, the maximum values obtained for both sets indicate a low potential for harm. The highest value for Experiment 1 is around 36% of the 3 ms criterion impact limit (i.e. 60 g for frontal and side impacts). As for Experiment 2, the corresponding percentage for the highest 3 ms value is around 30% for the 3 ms criterion impact limit of 60 g.

The values for the peak linear accelerations for both experiments were around 20 g, which were far from most of the reported peak accelerations (e.g. 66–106 g) that are associated with concussive events [119]. However, most of the obtained peak acceleration values fall within the range (i.e. 6–46 g) that is associated with subconcussive events (Fig. 15). Furthermore, two peak linear acceleration values were at or above the reported median value of 19 g that has been associated with the occurrence of subconcussive impacts, where the occurrence of which has been linked to neurocognitive deficits [93]. More research needs to be done to understand the biomechanical variables and its relation to causing concussion or mild TBI among children.

Fig. 18
figure 18

Surface areas of the dummy robot that hit the head. a Frontal edge. b Chimney. c Back. d Side

For the HIC, 3 ms criterion, and peak linear acceleration results, there was a noticeable disparity in Experiment 2 at a velocity of around 4 m/s. For example, the range of HIC values was from 0.114 to 2.834. This discrepancy can be attributed to the surface or area of the dummy robot that hit the head (e.g. chimney vs side) as observed from the analysis of the recorded videos (Fig. 18). This implies that at higher velocities, the harm level could even be larger depending on the contact area.

The peak force values for all the experiments were translated to percentages corresponding to the potential of causing fracture to the skull based on the previously stated relation (Fig. 6). All the recorded peak forces have a low potential to cause real harm to any of the facial bones. For example, a peak value of 91.3 N (i.e. the maximum value obtained) corresponds to around 0.6% chance of causing fracture to the bones of the face. However, assuming the linear relation holds true for higher velocities (Fig. 16), the chance for facial fracture increases to 20% at a hypothetical throwing velocity of 10 m/s for the same robot.

The depth of the observed artificial skin damage correspond to the tissue injuries based on the classification listed in Table 3. Most of the damage caused by the impacts conducted in this study falls into the superficial category that affects the epidermis layer, and they were in the form of abrasions. There were few lacerations that are classified as partial-thickness skin loss that would require medical care. No instances of full-thickness (i.e. depth greater than 4 mm) skin loss were observed.

Fig. 19
figure 19

The sustained damage on the small robot after the completion of the experiments

The dummy robot sustained a considerable damage (Fig. 19). The damage was more apparent after performing Experiment 2 (i.e. mimicking real throwing scenario). Some of the lacerations on the artificial skin of the head could have resulted from the newly formed sharp edges on the robot due to the sustained damage from some of the initial impacts. This implies that any robotic design should remain robust and safe, especially after being subjected to similar impacts and conditions. Superficial injury as an indicator for potential harm could be better than some of the typical severity indices. A similar conclusion has been reached in studies investigating the potential harm in industrial robots [43, 44].

The investigations conducted in this study were limited to a potential scenario that may happen between a child and a social robot wherein a child throws a robot to the face of another person. Quantifying harm was limited to the head. Measurements of injury to the head was limited to the existing severity indices and to the available data from the literatures. Thus, some aspects could not be measured, such as pain levels. The tissue injury investigations were limited to the artificial skin. Thus, the actual depth of lacerations on in vivo tissues might differ. Finally, the study was limited to small form factor impactor representing a small social robot. Hence, the obtained results are only applicable to closely comparable robots in terms of mass, design, material, and size.

7 Conclusion

The motivation of this study was to investigate the potential for harm due to the interaction between social robots and children with ASD, especially during the manifestation of challenging behaviors. Throwing, kicking, hitting, and self-harming are some of the challenging behaviors that could inflict some harm to the children themselves and to those around them. Our investigation of harm due to the throwing of a small social robot revealed that it could potentially cause tissue injuries, subconcussive or concussive event in extreme cases.

Social robots have gained a lot of attention in healthcare generally, and specifically in therapy, due to the increased number of studies reporting the efficacy of using such technology. Among children with ASD, social robots have been reported to effectively help in the elicitation of some positive behaviors. This has been attributed to the fact that social robots are simpler and are less complicated than humans. The advancement in technology has enabled robots to be more autonomous, intelligent and has opened new possibilities for smaller social robotic forms exhibiting high interactivity. While larger social robots can be used in therapeutic settings, especially for training, the smaller form of social robots are more affordable and suitable in home usage for continuous monitoring. The introduction of a new tool, such as a social robot, that is meant to evoke behaviors could pose as a new source of harm to the children themselves or others.

Children on the spectrum lack the ability to properly communicate their needs [64, 103]. Generalization of skills is usually done with other children. Hence, being subjected to pain due to impacts might affect any positive outcomes, and potentially cause more challenging behaviors. Any source of potential harm, pain or even annoyance must be kept to a minimum or eliminated altogether.

More research needs to be done on investigating the potential for harm due to the different robotic forms and to search for means to mitigate them through hardware and software approaches [63, 65]. For small social robots, regulations from standards that are concerned with the safety of toys can readily be adopted. For example, ISO 8124-1:2014 is related to the mechanical and physical properties of the safety aspects of toys [61]. Another direction for safer social robots is the adoption of some of the techniques and advances in soft robotics [51], which could potentially allow the embedding of different biopolymer-based sensors [95]. New social robotic safety standards for special needs children, such as children with ASD, must be established to ensure their safety and to take their needs into consideration.