Abstract
Hand sketch psychological data are mysterious and can be used to detect mental disorders early and prevent them from getting worse and with irreversible consequences. The Original Bender Gestalt Test is a psychology test based on hand-sketched patterns. Mental disorders require an automated scoring system. Unfortunately, there is no automatic scoring system for the Original Bender Gestalt test for adults and children with high accuracy. Automating the Original Bender Gestalt test requires 3 phases: Phase 1, collecting a comprehensive Original Bender Gestalt dataset called OBGET. Phase 2, classifying patterns by a proposed method called MYOLO V5; and Phase 3, scoring classified patterns according to associated rules of psychological standard criteria. This research reviews a comprehensive OBGET dataset that includes 817 samples, labeling samples for mental disorders by a psychologist, statistical analysis, the proposed semi-automatic labeling of patterns, patterns classification applied the proposed modified YOLO V5 called MYOLO V5, and automatic scoring of drawing patterns. MYOLO V5 accuracy is 95% and the accuracy of the proposed method called OBGESS as a mental disorder detection is 90%. In this research, a new automatic computer-aided psychological hand sketch drawing test has been proposed.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Early detection of mental disorders prevents their progression. Automating psychological tests can help prevent mental disorders. We deployed three phases to automate psychological drawing tests. These three phases are defined as follows:
Phase1, collecting a comprehensive dataset called OBGET that includes 817 samples of children and adults, male and female participants with healthy or mental disorders, labeling samples for mental disorders by a psychologist and labeling sketched patterns of each test semi-automatically using the proposed semi-automatic labeling of patterns, presenting descriptive and inferential statistical analysis.
Phase 2 classified nine patterns of each test using the proposed modified YOLO V5 called MYOLO V5.
Phase 3 scores drawing patterns according to psychological criteria using associated rules automatically.
The proposed modified YOLO V5 classified patterns of each OBGET dataset sample into nine classes. The proposed modified YOLO V5 includes the proposed architecture, activation function, optimization function, loss function, and final model, including weights, biases, parameters, and gradients.
The Original Bender Gestalt drawing test is a low-cost and non-invasive technique widely used in neural systems and rehabilitation engineering [1]. The Original Bender Gestalt drawing test detects adult brain lesions [2]. To detect a wide range of mental disorders, a low-cost and quick solution is to use the automatic scoring of the Original Bender Gestalt drawing test [1]. Most of the research [3,4,5] uses EEG. People may have a phobia of EEG. Therefore, they would like to use other solutions such as psychological drawing tests, or they want to use online systems with the least spending time and money with the help of psychological tests. In this work, with the help of the original Bander Gestalt drawing test at the lowest cost using paper and pencil, data collection, labeling, classification, and automatic scoring of the original Bander Gestalt drawing test are done to check participants’ mental health.
The number of people suffering from mental disorders has increased these days. Due to the increasing need for psychologists, the desire for online services, and the lack of psychologists, the need for interactive visual systems is much greater than before [6]. Psychological computer-aided programs help people detect mental disorders faster and better. For mental disorders like physical diseases, prevention is more important than treatment. Therefore, developing applications to automate psychological tests can be a significant step in mental disorders early detection.
Using computer-aided programs to automate psychological tests makes it possible to detect mental disorders [7,8,9,10,11,12]. Computer-aided programs [13] and mental health detection systems are presented [14,15,16,17]. OBGESS is a psychologist-assistant system. For presenting OBGESS, a comprehensive dataset, pattern classification, and automatic scoring to detect patterns is needed. The Original Bender Gestalt test is one of the most powerful tests for detecting brain lesions [18], psychotic [19] such as anxiety, depression, hysteria, obsessive behaviors, psychopathy, alcoholism, psychosomatic disorders, and neurotic [20] such as schizophrenia in adults [1]. The original Bender Gestalt test includes nine patterns presented one at a time to participants and asked to draw patterns using a pencil and paper [21].
Mental disorders interfere with human behavior and daily activities. The original Bender Gestalt test detects psychosomatic disorders. Psychosomatic disorders limit human behavior and physical performance. Early detection of mental disorders prevents their progression. The Original Bender Gestalt drawing test is a low-cost, non-invasive, and nonverbal technique widely used to detect abnormal human behavior.
The Original Bender Gestalt drawing test detects brain lesions in adults. Brain lesions affect human behavior and physical movements. Using the Original Bender Gestalt drawing test to detect various mental disorders is a low-cost and quick solution. Due to the lack of the Original Bender Gestalt drawing test dataset, OBGET is fed as a comprehensive dataset to OBGESS.
Due to the lack of a computer-aided program for automatic scoring with high accuracy of the Original Bander Gestalt test in previous works [22,23,24,25,26,27,28,29,30], OBGESS is presented with high accuracy for children and adults in this work. For this purpose, to provide an OBGESS system for automatic scoring and mental disorder detection, some steps are required: dataset collection of the Original Bender Gestalt test for children and adults along with metadata, statistical analysis, mental disorder labeling, pattern labeling, and pattern classification with high accuracy.
Despite the high importance of data, some previous works [29, 30] do not mention data collection, statistical analysis, data labeling, mental disorder detection, and just present pattern classification of the Original Bender Gestalt test. Previous research in this field [29, 30] did not use a standard dataset, so their evaluation result is unreliable. Unfortunately, a comprehensive dataset has not been presented in this field. The OBGET dataset includes 817 tests. A psychologist detects mental disorders from tests, and statistical analysis is conducted. Necessary pre-processing is done, and dataset samples have been labeled by deploying the proposed semi-automatically labeling.
Contributions to the OBGESS proposed method include:
-
A comprehensive Original Bender Gestalt test dataset called OBGET is presented.
-
OBGET dataset statistical analysis is presented.
-
OBGET samples are tagged as different types of mental disorders by a psychologist.
-
The proposed semi-automatically labeling is deployed for labeling nine patterns in each OBGET sample.
-
Patterns are classified based on the proposed modified YOLO V5 called MYOLO V5.
-
Automatic scoring to classify patterns is based on associated rules according to the standard scoring of the Original Bender Gestalt Test.
This work is organized as follows: Sect. 2 describes related scoring systems. The proposed solution is described in Sect. 3. Section 4 explains the evaluation. Section 5 discusses the results. Finally, Sect. 6 presents an overview of the results and future works.
2 Related Works
Handwriting and hand sketches can detect abnormal human behavior and mental disorders [31, 32]. Detection of human emotional states [33,34,35,36,37] is presented using the DEAP datasetFootnote 1 [38]. EEG signals and faces of 32 participants were recorded. Participants may be reluctant to undergo EEG testing. This work detects mental disorders without special and stressful tools but only with a simple hand sketch. For social interaction scoring and analysis, authors [11] used the MatchNMingle datasetFootnote 2 [2] using wearable devices and cameras to collect 92 participants. Behavioral analysis can score and detect autism at an early age of 12 months, which requires interpretation by clinicians and is high cost and time-consuming. Using these devices may harm patients.
Computer-aided applications in the human behavior field [39,40,41,42,43,44,45,46,47] and psychology field [48,49,50,51,52,53,54], such as extracting emotions [55] from music and sound [56,57,58,59,60,61,62,63], detecting emotional data in colors [64, 65], and detecting human behavior [66]. Authors [67] provided a mobile-based application that extracts and scores a child's behavioral responses to a video played on mobile and records the reaction using a mobile camera.
Recent studies are divided into two general categories: (a) Scoring systems for freehand sketches and (b) Scoring systems for pattern drawings.
2.1 Scoring Systems for Free Hand-Sketches
This section includes scoring systems for freehand sketches [68] and related datasets [69].
Authors [70] present a scoring system for participants' drawings on a web-based dataset called KDF. A new object-based image retrieval method is presented to bring similar images together. Euclidean equivalence of shapes is meant to be the same. Authors [71] score and detect mental disorders by measuring participants' painting location on the page on the KDF dataset [70]. In another research [72], authors present a mental scoring system based on painting colors on the KDF dataset [70]. A dataset includes 70 children's paintings suffering from ADHD [73]. Children are asked to paint freely within 5 min. Authors use unsupervised learning to extract features. In [44], 111 participants' characteristics are analyzed. A text with 1,600 words and 13 paragraphs about water is provided for students. Participants pass three steps: read the paragraph, create their mental image, and draw their picture.
For detecting similar mental disorders, 11,550 sketches [74] were used. The lowest Hamming distance of each pair of rows of the sparse matrix means that participants' drawings interpretation is similar. A Clock Drawing Test is used to detect Alzheimer's disease [75,76,77,78,79] using a dataset containing 10,992 manuscripts from 44 authors aged 25 to 87 years. Smoothing Gaussian is also used to eliminate noise. A fuzzy k-nearest neighbor was used to extract dynamic features, and a convolutional neural network was used to extract static features. For scoring to visual memory [80], deep learning [81, 82] is used on a dataset containing 69,495 drawings. A support vector machine classifies 160 Gy-level facial sketches [69]. K-means is used to classify hand-drawing shapes [83], also mathematical calculations are used to identify the probability and estimation of rotations and changes. Support vector machine classifies 1000 handwritten documents [84], including texts, drawings, charts, tables, lists, and logos. Authors [85] designed six pencil and paper psychological drawing tests to score and detect mental disorders and used rule-based methods to predict mental disorders.
2.2 Scoring Systems for Pattern Drawings
This section investigates scoring systems for pattern drawings [69].
Authors [32] tried to detect depression, anxiety, and Parkinson's disease [86] on the EMOTHOW dataset. Each participant performs seven tests, including drawing geometric shapes and writing Italian sentences. The Random Forest algorithm analyzes these features. VGG-16 is used to classify dataset images [87] to score and detect mental disorders on a dataset that includes 120 participants ranging from 16 to 66 years. A convolutional neural network is used for scoring mental disorders on a dataset including 152 offline paintings [88]. For detecting mental disorders [89], participants fill shapes with colors. Images are classified using linear regression based on the number and type of used colors. Machine learning classifies and scores CDTFootnote 3 [69] on a dataset containing 3541 tests. Convolutional neural networks classify objects [91] on a CDT dataset [90] containing 65 samples. KNN and neural networks detect handwritten numbers for CDT [92] using the MNIST datasetFootnote 4 with 5000 samples. CDT scoring system records strokes, coordinates (x, y), and time online [93]. After completing the drawing and pressing the "Complete" button, test results will be displayed immediately using machine learning and geometric analysis. A support vector machine is used for classification and scoring a psychological test to detect dementia [94] on a dataset (Clock Drawing Test URL: www.clockdrawingtest.com).
An application [95] is presented to take a psychological test for detecting mental disorders. The CBFA algorithm is used for segmentation. Six machine learning methods classify and evaluate 456 CDT results [96]. Machine learning classifies [69] 65 CDT samples. CDT mobile application [97] detects mental disorders. K-nearest neighbor is used for classification. Biometric features are identified [98] from a dataset with 11,000 hand sketches. CNN extracts features, and SVM classifies gender and biometric identification. ANN (Approximate Nearest Neighbor) is used for sketch retrieval using two re-ranking schemes: relevance feedback and query retouching on the Manga dataset [99]. Manga contains 109 comic books with 21,142 pages. The Manga dataset scores and examines participant feelings after reading sentences. Sketch recognition has many methods [100]. Analyzing human visual data and emotion detection [101, 102] is challenging and a key issue. Identifying visual concepts about humans can affect decision-making models and is a challenging task that cannot be easily solved with automated approaches. OBGET dataset statistical analysis is reported. This section deals with descriptive statistics, and Sect. 5–1 presents inferential statistics. Descriptive statistics [103] include gathering, summarizing, organizing, classifying data, and reporting facts in diagrams. Diagrams describe the mean population and median range of variation and standard deviation [103].
This work presents a scoring system called OBGESS to detect mental disorders using pencil and paper Original Bender Gestalt Test without stressful, expensive devices. For training and testing phases, OBGESS is deployed on OBGET as a comprehensive dataset, which will be explained in the continue.
3 The Proposed Solution
3.1 Dataset
This section presents the OBGET dataset collection process, dataset statistics, and pattern classification. OBGET dataset doi is: XXX. OBGET dataset participants include students at schools and universities, university employees, and people with mental disorders. Data collection is offline, using a pencil and paper to draw patterns. OBGET dataset samples include metadata such as participants’ age, sex, education, income, being single or married, duration of drawing, and location of patterns on the page. The original Bender Gestalt test covers the age from 4 to 60 years [1] with five subgroups: children ranging from 4 to 11 years, adolescents ranging from 12 to 18 years, young participants from 19 to 35, middle-aged participants from 36 to 50 and old participants from 51 to 60 years.
Participants are asked to sketch a Fig. 1 graphical pattern on an A4 white paper. At the end of the test, information about participants, including age, gender, time duration, and educational degree, is stored in a text file. This is because these metadata may affect the test result and may be needed for different classifications for future works. Then, a laser scanner scans images, and metadata is stored within each test.
Original Bender Gestalt test cards [1]
Table 1 shows the number of adult participants who suffer from brain lesions is 9 (1.77%), psychotic participants are 3 (0.6%), neurotic participants are 177 (35.13%), and healthy participants are 315 (62.5%). Figure 2a shows the chart.
According to Table 1, number of children participants who suffer from anxiety and depression is 88 (28.1%), ADHD 7 (2.2%), learning disorder 11 (3.5%) and healthy participants is 207 (66.1%). The diagram is shown in Fig. 2b.
3.2 The Proposed Semi-automatic Labeling and Data Pre-processing
This section first deals with data pre-processing and then explains labeling [104, 105] data. One of the significant challenges in image auto-labeling is selecting the right features. Feature selection has a meaningful impact on system performance. Usually, a combination of different features is used [106].
Scanned test images are saved in JPG format. Text files are also stored for each test containing participants' test data. Test images are converted into grayscale and size converted to 800 to 600 pixels resolution. In the Original Bender Gestalt test, placing patterns on the page is important and meaningful for psychologists. For detecting the place of patterns on each test, the page is divided into nine sections:
Top left, top middle, top right, bottom left, bottom middle, bottom right, middle left, middle right, and middle as shown in Fig. 3.
In each section, the number of non-white pixels is checked. The drawing is located in a section of the screen where the number of non-white pixels in that section is more significant than any other section. Overlaps of up to three sections may be selected at a time.
Figure 4 shows a raw OBGET dataset sample. A psychologist then reviews participants’ tests to detect mental disorders. OBGET dataset samples are labeled by mental disorder classes by a psychologist: 1-Depression, 2-Anxiety, 3- Stress, 4- Sexual problems, 5-Brain lesion, 6-Neuroticism, 7-Alcoholism, 8-Hysteria, 9-Obsessive, 10-Mental health, 11-Children Depression, 12-Children Anxiety, 13-ADHD, 14-Learning Disorders and 15-Normal Children. Next, each test's nine graphical patterns should be labeled from 1 to 9, corresponding to that pattern. Data is labeled using the proposed semi-automatic labeling method. Draw a rectangle around each pattern to label raw data and assign a class from 1 to 9. The coordinates of the surrounding box pixels are determined and stored in a file. Information includes test name, JPG file, rectangle coordinates (in pixels), and pattern number. Classes consist of nine categories from 1 to 9, shown in Fig. 1.
Each category consists of raw data, participant information in text format, and pattern information for each test in CSV format. The metadata of Fig. 4 is presented in Table 2.
Table 2 specifies patterns, test names, and coordinates of the surrounding boxes for each pattern and category. For example, the first row of the second column in Table 2, “6. JPG, 81, 26, 154, 78, 1,” means that for a test named 6.jpg in the OBGET dataset, 26, 81, and 78 154 are pixels of two coordinates of a surrounding box that belongs to category number 1. Metadata with the same structure is stored for each 817 raw data in the OBGET dataset.
The proposed Semi-automatic labeling algorithm works as follows:
For each sample of the OBGET dataset, the following steps are done for all nine patterns.
-
a)
CSV file is opened
-
b)
User draws a surrounding box around each pattern manually
-
c)
and then the coordinates of surrounding are registered into a CSV file named as same as sample name automatically
-
d)
User inserts class number (1 to 9) manually and it is registered in front of related coordinates in the CSV file automatically
-
e)
to announce end of labeling for each sample user can press zero key
CSV file is saved and closed.
According to description above, a sample of using the proposed semi-automatic labeling program for patterns is shown in Fig. 5.
Each category consists of raw data, participant’s information in text format and patterns information for each sample in CSV format. Metadata of Fig. 5 is presented in Table 2.
As shown in Fig. 5a, user draws a surrounding box around a pattern manually, and the coordinates of surrounding are registered into a CSV file automatically, then in Fig. 5b, user inserts the class number from 1 to 9 manually and it is registered in the CSV file automatically. After all patterns in a sample of the OBGET dataset are labeled semi-automatically, user should announce the program to save and close current CSV file and open a new CSV file for a new dataset sample. To do this task user can select and draw a random surrounding box and insert zero key to finish the current task as shown in Fig. 5c. Inserting zero key announce the proposed semi-automatic labeling program to close and save the current opened CSV file and named the file as same as related pattern. For continuing semi-automatic labeling, user can open another sample of the OBGET dataset and follow mentioned steps.
A part of OBGET dataset is published open access on the reserved https://doi.org/10.17632/62kwttmkcv.1. The comprehensive OBGET dataset samples include raw data, labeled data and metadata of are available if needed and requested.
3.3 Deep Learning
Patterns are classified by applying the proposed modified YOLO V5 as a deep learning method [107] with high accuracy. YOLO V5 [108,109,110] is an object detector [111, 112] which consists of three important parts: a) Backbone, b) Neck, and c) Head.
(a) Backbone extracts important features of an input image. The proposed method applied a CSPFootnote 5 network as a backbone model to extract features from an input image. (b) Neck helps to identify an object at different scales. The proposed modified YOLO V5 deployed PANet as a Neck model, and (c) Head is run for final detection. Generates final output vectors with class probability. Selecting an activation function for any deep neural network is crucial. This research applies Leaky ReLU as an activation function in the middle layer, and the Sigmoid activation function is deployed in the final detection layer. SGD is deployed as an Optimization Function, and Ultralytics calculates Binary Cross-Entropy with the Logit Loss function deploying PyTorch. The final model includes weights, biases, parameters, and gradients.
The proposed modified YOLO V5 classified OBGET dataset samples into nine classes. The proposed modified YOLO V5 includes architecture, activation, optimization, and loss functions. The final model includes weights, biases, parameters, and gradients. The proposed modified YOLO V5 network increased accuracy and solved the problem of detecting pattern number 2. YOLO is the most complete network for object detection and deep learning problems [112, 113].
In the proposed method, the batch size is 10 to analyze 10 images at each step of the training phase. Number of epochs is 300. The medium model is used in the proposed modified YOLO V5 with appropriate results, so there is no need to use large or very large models. Initial weights are random. The learning rate value is initially set to 0.01 and finally to 0.2. SGD momentum value is 0.937. Table 3 shows the initial parameters. The proposed MYOLO V5 algorithm is shown in Algorithm 1 according to [114].
The proposed system architecture is shown in Fig. 6.
The Original Bender Gestalt test visualizes mental states with graphic shapes, and scoring is based on graphics rules, including:
-
Hand-sketching graphic shapes
-
Place of hand-sketching graphic shapes
-
Size of hand-sketching graphic shapes
-
Line thickness of hand-sketching graphic shapes
Associated rules for Tables 4 and 5 are as follows:
-
By calculating the area of the surrounding box for each pattern, the large or small size of patterns can be determined.
-
By examining the existence of a rectangle around each pattern, drawing a box around each pattern is determined.
-
If the distance between two surrounding boxes for two patterns is more or less than the default value, it means that the distance of drawing patterns is near or far.
-
Pattern rotation is detected according to the length and width of the surrounding box for each pattern.
-
Extra drawing occurs if some patterns are not categorized into nine specified classes.
In some patterns, the circle recognition function in OpenCV, imported in Python, is applied to detect dots or dashes instead of circles.
Inferential statistics evaluate OBGET dataset samples running SPSS version 21. The OBGET dataset is evaluated using inferential statistics, including hypothetical tests [103]. Chi-square analysis [103] evaluates descriptive statistics and relationships between variables considered for participants.
Values of all variables are not numerical, so Chi-square is used. Chi-square analysis examines the existence or absence of relationships between variables related to participants when all variables are not numerical.
4 Evaluation
4.1 Evaluating OBGET Pattern Classification Accuracy
Precision, Recall, and mAP criteria evaluated the proposed object detection and pattern classification on the OBGET dataset. The following results are obtained after implementing the proposed modified YOLO V5 on the OBGET dataset. Equations (1) and (2) explain the meaning of Precision and Recall in this research.
Also, the proposed method evaluates Precision and sensitivity (Recall) based on Eqs. 1 and 2.
In object detection, mAP diagrams are obtained from Eq. (3), which provides important results.
4.2 Evaluating OBGESS Accuracy
In this section, the automatic scoring phase of OBGESS is evaluated. This research implements automatic scoring by associated rules according to standard criteria scoring of the Original Bender Gestalt Test applying Open CV.
The diameter element values of the confusion matrix for each column are divided into the total number of tests according to Eq. (4) to calculate the OBGESS accuracy for detecting each mental disorder.
where i and j are counters, aij (i = j) is the elements on the diameter of the matrix, d is the number of rows or columns in the squared matrix, or, in other words, the number of detectable mental disorders by OBGESS mentioned above.
The Eq. \(\frac{{a}_{ij }, (i=j)}{\sum_{i=1}^{d}aij}\times 100\) calculates the OBGESS accuracy for detecting each mental disorder (The percentage of each column) and Eq. (4) calculates OBGESS's total accuracy for detecting all mental disorders detectable by OBGESS (the percentage of all columns). According to Eq. (4), to calculate the total OBGESS accuracy of mental disorder detection, the accuracy of each 15 mental disorders detection accuracy is averaged.
5 Result and Discussion
Segmentation and object detection are performed simultaneously (Fig. 7). YOLO V5 is a real-time approach that processes up to 45 images per second. After running the proposed modified YOLO V5 network on the OBGET dataset and reviewing the results, pattern detection accuracy is 95%. Six hundred fifty-four images were selected for the training and 163 for the testing phases. Figure 8 shows the pattern detection of OBGET dataset samples deploying the proposed modified YOLO V5.
The mental disorder detection phase was performed after pattern detection for each test using the Spider environment in Python. After the classification step, patterns are scored automatically, applying associated rules according to the standard criteria of the Original Bender Gestalt Test.
OBGESS extracts meaningful visual concepts about humans from the Original Bender Gestalt test accurately. This section presents the final phase of OBGESS and associated rules for automatic scoring. These rules are based on standard criteria for mental disorders for adult and children participants described in Tables 4 and 5.
This section discusses descriptive statistics of OBGET, OBGET pattern classification accuracy, and OBGESS accuracy.
5.1 OBGET Inferential Statistics
According to the following results:
-
A significant relationship exists between participants' age and test results (Fig. 9a).
-
There is no significant relationship between participants' gender, income status, marital status, number of children in the family, presence of parents, and test results.
-
There is a significant relation between education and test results variables (Fig. 9b).
5.2 Comparing the Proposed Object Detection Results
In this section, the proposed MYOLO V5 object detection accuracy results are compared with ResNet 50 on the proposed OBGET dataset.
Figure 10 shows the confusion matrix for the proposed MYOLO V5 method.
mAP@0.5 is good for accurately detecting objects. In Fig. 11, mAP@0.5 is 95% and acceptable, indicating powerful object detection. Precision and Recall diagrams at best status are 95%.
Figure 11 shows Precision, Recall, and mAP diagrams obtained from implementing the proposed modified YOLO V5 on the OBGET dataset with 817 data samples.
Also, OBGET dataset samples are labeled manually by a psychologist. OBGESS results and psychology labels can be calculated by comparing OBGESS result's accuracy (Fig. 12).
As shown in Fig. 12, mental disorder detection accuracy was evaluated manually with psychologist detection results and is 90% for the test dataset.
OBGESS accuracy on the OBGET dataset is evaluated. The Confusion Matrix evaluates it. A 7 × 7 square matrix is deployed to evaluate OBGESS accuracy using the proposed MYOLO V5 method (Fig. 13).
Each row and column of the matrix is one of the mental disorders mentioned in Sect. 3.2.
We compared the proposed method MYOLO V5 with other methods in Table 6.
6 Conclusions and Future Work
Mental disorders affect the neural system and patients' daily physical and mental activities. The Original Bender Gestalt drawing test detects psychosomatic disorders. Automatic scoring of visual psychological tests can help detect mental disorders early. Early detection of mental disorders can prevent disorders from getting worse. OBGESS is a psychologist-assistant visual system. Interactive visual systems for detecting mental disorders need a comprehensive dataset of psychological tests. This work presents a comprehensive OBGET dataset.
Pre-processing and the proposed semi-automatic labeling phase were executed on 817 OBGET dataset samples. Chi-square is used to analyze data and consider whether there is a relation between variables for participants. The proposed modified YOLO V5 classifies patterns into nine classes. Each class includes one of the nine patterns from the Original Bender Gestalt drawing test. Pattern classification accuracy is 95%.
For graphical pattern detection and classification, Precision and Recall achieved 95%. Association rules apply for automatic mental disorders detection according to standard criteria. OBGESS accuracy is 90%. For future studies, we will try to automate the process of detecting mental disorders in the training phase at the same time as pattern detection. This is not in a separate phase. We will also attempt to increase OBGET dataset samples.
Availability of Data and Materials
Data are available on demand.
Notes
Clock Drawing Test.
Dataset URL: http://yann.lecun.com/exdb/mnist.
Cross Stage Partial.
References
Bahrami H. (2006) Psychological Tests (Foundations & Practical Techniques) 3trd Edition. In: Tehran, Allame Tabataba’i University Pres 148/3, Iran National Library M77–18239, 150/15195, Bf, 39,/ B 9 A 4, ISBN: 964–8415–45–5
Jiang, Y., Sun, J., Ren, X., Wang, H.: Nutritional Risk Assessment and Countermeasures for Stroke Patients Based on Deep Learning and Multimodal MRI Images. International Journal of Computational Intelligence Systems 16, 72 (2023)
Xia, Y., Yao, Z., Ye, Q., Cheng, N.: A Dual-Modal Attention-Enhanced Deep Learning Network for Quantification of Parkinson’s Disease Characteristics. IEEE Trans. Neural Syst. Rehabil. Eng. 28, 42–51 (2020)
Jiang, C., Li, Y., Tang, Y., Guan, C.: Enhancing EEG-Based Classification of Depression Patients Using Spatial Information. IEEE Trans. Neural Syst. Rehabil. Eng. 29, 566–575 (2021)
Klepl, D., He, F., Wu, M., Blackburn, D.J., Sarrigiannis, P.: EEG-based Graph Neural Network Classification of Alzheimer’s Disease: An Empirical Evaluation of Functional Connectivity Methods. IEEE Trans. Neural Syst. Rehabil. Eng. (2022). https://doi.org/10.1109/TNSRE.2022.3204913
Shao, Z., Cheng, G., Ma, J., Wang, Z., Wang, J., Li, D.: Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic. IEEE Trans. Multimedia 24, 2069–2083 (2021). https://doi.org/10.1109/TMM.2021.3075566
Moetesum, M., Siddiqi, I., Vincent, N., Cloppet, F.: Assessing Visual Attributes of Handwriting for Prediction of Neurological Disorders - A Case Study on Parkinson’s Disease. Elsevier Journal on Pattern Recognition Letters 121, 19–27 (2019)
Trotzek M, Koitka S, Friedrich CM (2020) Utilizing Neural Networks and Linguistic Metadata for Early Detection of Depression Indications in Text Sequences, IEEE Transactions on Knowledge and Data Engineering, Vol. 32, No. 3, P. 588–601, 1 March 2020, doi: https://doi.org/10.1109/TKDE.2018.2885515
Moetesum, M., Siddiqi, I., Ehsan, S., Vincent, N.: Deformation modeling and classification using deep convolutional neural networks for computerized analysis of neuropsychological drawings. Neural Comput. Appl. 32, 12909–12933 (2020)
Lin H, Jia J, Qiu J, Zhang Y, Shen G, Xie L, Tang J, Feng L, Chua T (2017) Detecting Stress Based on Social Interactions in Social Networks, IEEE Transactions on Knowledge and Data Engineering, Vol. 29, No. 9.
Savargiv M, Bastanfard A (2013) Text material design for fuzzy emotional speech corpus based on persian semantic and structure, 2013 International Conference on Fuzzy Theory and Its Applications (iFUZZY), Taipei, Taiwan, 2013, pp. 380–384, doi: https://doi.org/10.1109/iFuzzy.2013.6825469
Seifallahi, M., Mehraban, A.H., Galvin, J.E., Ghoraani, B.: Alzheimer’s Disease Detection Using Comprehensive Analysis of Timed Up and Go Test via Kinect vol 2 Camera and Machine Learning. IEEE Trans. Neural Syst. Rehabil. Eng. 30, 1589–1600 (2022). https://doi.org/10.1109/TNSRE.2022.3181252
Movahedi, Z., Bastanfard, A.: Toward competitive multi-agents in Polo game based on reinforcement learning. Multimedia Tools and Applications 80, 26773–26793 (2021)
Wohlfahrt-Laymann, J., Hermens, H., Villalonga, C., Vollenbroek-Hutten, M., Banos, O.: Mobile Cogni Tracker: A mobile experience sampling tool for tracking cognitive behavior. J. Ambient. Intell. Humaniz. Comput. 10, 2143–2160 (2019)
Thakur, S., Roy, R.B.: Predicting mental health using smart-phone usage and sensor data. J. Ambient. Intell. Humaniz. Comput. 12, 9145–9161 (2021)
Giuntini, F.T., Cazzolato, M.T., Reis, M., Campbell, A.T., Traina, A.J., Ueyama, J.: A review on recognizing depression in social networks: challenges and opportunities. J. Ambient. Intell. Humaniz. Comput. 11, 4713–4729 (2020)
Wu, Y.: The impact of criminal psychology trend prediction based on deep learning algorithm and three-dimensional convolutional neural network. J. Ambient. Intell. Humaniz. Comput. (2021). https://doi.org/10.1007/s12652-021-03455-8
American Psychiatric Association, (2013) Diagnostic and statistical manual of mental disorders: DSM-5 May 18, 2013, American Psychiatric Association Publishing, 5th edition, Dewey Decimal: 616.89'075, ISBN: 978–0–89042–554–1, LC Class: RC455.2.C4
Geldard D (1989) Basic Personal Counseling: A Training Manual for Counselors, Charles C Thomas Pub Ltd, ISBN-13: 978–0398055400
Halgin R, Whitbourne P, Krauss S (2009) Abonormal psychology: clinical perspectives on psychological disorders, 6th eddition, McGraw Hill Higher Education Publisher, ISBN-13: 978–0070165076
Groth-Marnat G, Wright J (2016) Handbook of Psychological Assessment, 6th Edition, Wiley Publisher, ISBN-13: 978–1118960646
Moetesum M, Zeeshan O, Siddiqi I (2018) Multi-object sketch segmentation using convolutional object detectors, Proc. SPIE 11069, Tenth International Conference on Graphics and Image Processing (ICGIP 2018), 1106929 (6 May 2019); https://doi.org/10.1117/12.2524293
Chang W, Yang Y, Choi S (2019) Bender Gestalt Test Image Recognition with Convolutional Neural Network, Journal of Korea Multimedia Society, Vol. 22, P. 455–462, 1229–7771(pISSN) / 2384–0102(eISSN), https://doi.org/10.9717/kmms.2019.22.4.455
Kumar M, Rana P, Modi S, Tyagi R, Kaur P, Kanwar R, Sekhri T, D'souza M, Khushu S (2019) Aberrant intra and inter network resting state functional connectivity in thyrotoxicosis, Journal of Neuroendocrinology, Vol. 31, https://doi.org/10.1111/jne.12683
Boostani, R., Asadi, F., Mohammadi, N.: Diagnosis of the OCD Patients using Drawing Features of the Bender Gestalt Shapes. Journal of Biomedical Physics Engineering 7, 59–68 (2017)
Vazquez DR, Alonso GR, Gurrola LCG, García RC, Reyes FM (2020) Exploring Convolutional Neural Networks Architectures for the Classification of Hand-Drawn Shapes in Learning Therapy Applications, Computación y Sistemas, Vol. 24, https://doi.org/10.13053/cys-24-4-3359
Caitlin H, William R (2019) Using Convolutional Neural Networks for the Automated Scoring of the Bender-Gestalt Test Ii", South Carolina Junior Academy of Science, 296, https://scholarexchange.furman.edu/scjas/2019/all/296
Zeeshan MO, Siddiqi I, Moetesum M (2021) Two-Step Fine-Tuned Convolutional Neural Networks for Multi-label Classification of Children’s Drawings, Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), Vol 12822. Springer, Cham. https://doi.org/10.1007/978-3-030-86331-9_21
Golgoneh, A., Tarvirizadzadeh, B.: Development of a Mechatronics System to Real-Time Stress Detection Based on Physiological Signals. Journal of Signals and Data Processing 37, 59–74 (2018). https://doi.org/10.29252/jsdp.15.3.59
Pourmohammadi, S., Maleki, A.: A Fuzzy c-means Clustering Approch for Continuous Stress Detection during Driving. Journal of Signals and Data Processing 14, 129–142 (2017). https://doi.org/10.29252/jsdp.14.4.129
Fathi Ahmadsaraei M, Bastanfard A, Amini A (2021) A survey of Datasets, Data Classification Algorithms, and Automatic detection of mental disorders Systems in Psychological Drawing Test, An Official Publication of the Iranian Society of Machine Vision and Image Processing, https://jmvip.sinaweb.net/article_129851.html, http://jmvip.sinaweb.net/article_129851_403766c0f0c571c335d886cdecd060d9.pdf.
Likforman-Sulem, L., Esposito, A., Faundez-Zanuy, M., Clemencon, S., Cordasco, G.: EMOTHAW: A Novel Database for Emotional State Recognition from Handwriting and Drawing. IEEE Transactions on Human-Machine Systems 47, 273–284 (2017)
Yao, X., She, D., Zhang, H., Yang, J., Cheng, M., Wang, L.: Adaptive Deep Metric Learning for Affective Image Retrieval and Classification. IEEE Trans. Multimedia 23, 1640–1653 (2020)
Yao, X., Zhao, S., Lai, Y., She, D., Liang, J., Yang, J.: APSE: Attention-aware Polarity-Sensitive Embedding for Emotion-based Image Retrieval. IEEE Trans. Multimedia 23, 4469–4482 (2020)
Wang, H., Tang, P., Li, Q., Cheng, M.: Emotion Expression with Fact Transfer for Video Description. IEEE Trans. Multimedia 24, 715–727 (2021)
Lee, I., Kim, D., Lee, S.: 3D Human Behavior Understanding using Generalized TS-LSTM Networks. IEEE Trans. Multimedia 23, 415–428 (2020)
Liny, S., Baiy, M., Liu, F., Shen, L., Zhou, Y.: Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition. IEEE Trans. Multimedia 23, 1581–1591 (2020)
Koelstra, S., Muhl, C., Soleymani, M., Lee, J., Yazdani, A., Ebrahimi, T.: DEAP: A Database for Emotion Analysis, Using Physiological Signals. IEEE Trans. Affect. Comput. 3, 18–31 (2011)
Hermes, M., Albers, F., Böhnke, J.R., Huelmann, G., Maier, J., Stelling, D.: Measurement and structural invariance of cognitive ability tests after computer-based training. Comput. Hum. Behav. 93, 370–378 (2019). https://doi.org/10.1016/j.chb.2018.11.040
Hansen, T.I., Lehn, H., Evensmoen, H.R., Håberg, A.K.: Initial Assessment of Reliability of a Self-Administered Web-Based Neuropsychological Test Battery. Comput. Hum. Behav. 63, 91–97 (2016). https://doi.org/10.1016/j.chb.2016.05.025
Lv M, Liu H, Zhou W, Zheng C, (2020) Efficiency model of micro-course study based on cognitive psychology in the college, Computers in Human Behavior, Vol. 107, https://doi.org/10.1016/j.chb.2019.05.024
Farzanfar, R., Frishkopf, S., Friedman, R., Ludena, K.: Evaluating an automated mental health care system: making meaning of human–computer interaction. Comput. Hum. Behav. 23, 1167–1182 (2007). https://doi.org/10.1016/j.chb.2004.11.015
Thompson, S.B.N., Ennis, E., Coffin, T., Farman, S.: Design and Evaluation of a Computerised Version of the Benton Visual Retention Test. Comput. Hum. Behav. 23, 2383–2393 (2007). https://doi.org/10.1016/j.chb.2006.03.014
Leutner, D., Leopold, C., Sumfleth, E.: Cognitive load and science text comprehension: Effects of drawing and mentally imagining text content. Elsevier Journal of Computers in Human Behavior 25, 284–289 (2009)
Yang H, Guo X, Peng Z, Lai KH (2022) Patient empowerment in an online health platform: Exploring the quadratic effects of patients’ conscious-competence on perceived health status, Computers in Human Behavior, Vol. 136, 107346, https://doi.org/10.1016/j.chb.2022.107346
Herrero, J., Meneses, J.: Short Web-based versions of the perceived stress (PSS) and Center for Epidemiological Studies-Depression (CESD) Scales: a comparison to pencil and paper responses among Internet users. Comput. Hum. Behav. 22, 830–846 (2006). https://doi.org/10.1016/j.chb.2004.03.007
Jin, S.A.: The effects of incorporating a virtual agent in a computer-aided test designed for stress management education: The mediating role of enjoyment. Comput. Hum. Behav. 26, 443–451 (2010). https://doi.org/10.1016/j.chb.2009.12.003
Ruengchaijatuporn N, Chatnuntawech I, Teerapittayanon S, Sriswasdi S, Itthipuripat S, Hemrungrojn S, Bunyabukkana P, Petchlorlian A, Chunamchai S, Chotibut T, Chunharas C (2022) An explainable self-attention deep neural network for detecting mild cognitive impairment using multi-input digital drawing tasks, Alzheimer's Research & Therapy, Vol. 14, https://doi.org/10.1186/s13195-022-01043-2
Furukawa, S., Hirano, S., Yamamoto, T., Asahina, M., Uchiyama, T., Yamanaka, Y., Nakano, Y., Ishikawa, A., Kojima, K., Abe, M., Uji, Y., Higuchi, Y., Horikoshi, T., Uno, T., Kuwabara, S.: Decline in drawing ability and cerebral perfusion in Parkinson’s disease patients after subthalamic nucleus deep brain stimulation surgery. Elsevier Journal of Parkinsonism & Related Disorders 70, 60–66 (2020). https://doi.org/10.1016/j.parkreldis.2019.12.002
Alissa, M., Lones, M.A., Cosgrove, J., Alty, J.E., Jamieson, S., Smith, S.L., Vallejo, M.: Parkinson’s disease diagnosis using convolutional neural networks and figure-copying tasks. Neural Comput. Appl. 34, 1433–1453 (2022). https://doi.org/10.1007/s00521-021-06469-7
Youn YC, Pyun JM, Ryu N, Baek MJ, Jang JW, Park YH, Ahn SW, Shin HW, Park KY, Kim SY (2021) Use of the Clock Drawing Test and the Rey–Osterrieth Complex Figure Test-copy with convolutional neural networks to predict cognitive impairment, Alzheimer's Research & Therapy, Vol. 13, https://doi.org/10.1186/s13195-021-00821-8
Valenzuela SE, Calabrese JB, Ortiz-Medina J, Sánchez CN, (2020) Convolutional neural networks for detection of hand-written drawings, 2020 IEEE ANDESCON, Quito, Ecuador, P. 1–5, doi: https://doi.org/10.1109/ANDESCON50619.2020.9272066
Cheah, W., Hwang, J., Hong, S., Fu, L., Chang, Y., Chen, T., Chen, I., Chou, C.: (2022) A Digital Screening System for Alzheimer Disease Based on a Neuropsychological Test and a Convolutional Neural Network: System Development and Validation. JMIR Med. Inform. 10(3), e31106 (2022). https://doi.org/10.2196/31106PMID:35262497PMCID:8943541
Estella-Nonay E, Bachiller-Mayoral M, Valladares-Rodriguez S, Rincón M (2022) Automatic Diagnosis of Mild Cognitive Impairment Using Siamese Neural Networks. Springer Artificial Intelligence in Neuroscience: Affective Analysis and Health Applications, IWINAC - Lecture Notes in Computer Science, Vol 13258, https://doi.org/10.1007/978-3-031-06242-1_41
Sivasangari, A., Ajitha, P., Rajkumar, I., Poonguzhali, S.: Emotion recognition system for autism disordered people. J. Ambient. Intell. Humaniz. Comput. (2019). https://doi.org/10.1007/s12652-019-01492-y
Bastanfard A, Amirkhani D, Naderi S (2020) A Singing Voice Separation Method from Persian Music Based on Pitch Detection Methods, 2020 6th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS), Mashhad, Iran, 2020, P. 1–7, doi: https://doi.org/10.1109/ICSPIS51611.2020.9349583
Bastanfard A, Fazel M, Kelishami AA, Aghaahmadi M (2010) The Persian Linguistic Based Audio-Visual Data Corpus, AVA II, Considering Coarticulation. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_30
Savargiv, M., Bastanfard, A.: Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques. Journal of Computer & Robotics 7, 19–25 (2014)
Savargiv, M., Bastanfard, A.: Real-time speech emotion recognition by minimum number of features, 2016 Artificial Intelligence and Robotics (IRANOPEN). Qazvin, Iran 2016, 72–76 (2016). https://doi.org/10.1109/RIOS.2016.7529493
Keshtkar M, Bastanfard A (2015) Determining the best proportion of music genre to be played in a radio program, 2015 7th Conference on Information and Knowledge Technology (IKT), Urmia, Iran, P. 1–7, doi: https://doi.org/10.1109/IKT.2015.7288794
Bastanfard A, Amirkhani D (2020) Detect Hidden Message in Reverse Timestamp in Farsi, 2020 25th International Computer Conference, Computer Society of Iran (CSICC), Tehran, Iran, P. 1–8, doi: https://doi.org/10.1109/CSICC49403.2020.9050082
Vargas A, Díaz P, Zarraonandia T (2020) Using Virtual Reality and Music in Cognitive Disability Therapy, AVI '20: Proceedings of the International Conference on Advanced Visual InterfacesSeptember 2020 Article, No. 35, P. 1–9, https://doi.org/10.1145/3399715.3399916
Bastanfard A, Rezaei NA, Mottaghizadeh M, Fazel M (2010) A Novel Multimedia Educational Speech Therapy System for Hearing Impaired Children, Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15696-0_65
Minoofam, A.H., Bastanfard, A., Keyvanpour, M.: RALF: an adaptive reinforcement learning framework for teaching dyslexic students. Multimedia Tools and Applications 81, 6389–6412 (2022)
Schouten, K., Frasincar, F.: Survey on Aspect-Level Sentiment Analysis. IEEE Trans. Knowl. Data Eng. 28, 813–830 (2016)
Rawassizadeh, R., Momeni, E., Dobbins, C., Gharibshah, J., Pazzani, M.: Scalable Daily Human Behavioral Pattern Mining from Multivariate Temporal Data. IEEE Trans. Knowl. Data Eng. 28, 3098–3112 (2016)
Hashemi, J., Dawson, G., Carpenter, K., Campbell, K., Qiu, Q., Espinosa, S., Marsan, S., Baker, J.P., Egger, H.L., Sapiro, G.: Computer Vision Analysis for Quantification of Autism Risk Behaviors. IEEE Trans. Affect. Comput. 12, 215–226 (2018)
Xu, P., Joshi, C.K.: Bresson X (2022) Multigraph Transformer for Free-Hand Sketch Recognition. IEEE Trans. Neural Netw. Learn. Syst. 33(10), 5150–5161 (2022). https://doi.org/10.1109/TNNLS.2021.3069230
Souillard-Mandar, W., Davis, R., Rudin, C., Au, R., Libon, D.J., Swenson, R., Price, C., Lamar, M., Penney, D.L.: Learning classification models of cognitive conditions from subtle behaviors in the digital Clock Drawing Test. Springer Journal of Machine Learning 102, 393–441 (2016). https://doi.org/10.1007/s10994-015-5529-5
Im YH, Oh SG, Chung MJ, Yu JH, Lee HS, Chang JK, Park DH (2010) A KFD web database system with an object-based image retrieval for family art therapy assessments, Elsevier Journal of the Arts in Psychotherapy, Vol. 37, P.163–171
Kim, S., Kang, H., Kim, K.: Computer determination of placement in a drawing for art therapy assessments. Elsevier Journal of the Arts in Psychotherapy 35, 49–59 (2008)
Kim, S., Bae, J., Lee, Y.: A computer system to rate the color-related formal elements in art therapy assessments. Elsevier Journal of the Arts in Psychotherapy 34, 223–237 (2007)
Gauthier, B., Véronique, P., Lageix, P.: Exploring the dynamics of design fluency in children with and without ADHD using artificial neural networks. Journal on Normal and Abnormal Development in Childhood and Adolescence 22, 238–246 (2016)
Wang, F., Lin, S., Luo, X., Wu, H., Wang, R., Zhou, F.: A Data-Driven Approach for Sketch-Based 3D Shape Retrieval via Similar Drawing-Style Recommendation, Wiley & Sons Ltd. Journal of Computer Graphics Forum 36, 157–166 (2017)
Harbi Z, Hicks Y, Setchi R (2016) Clock Drawing Test Digit Recognition Using Static and Dynamic Features, 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems, Vol. 96, P. 1221–1230, ISSN 1877–0509, https://doi.org/10.1016/j.procs.2016.08.166
Gnanadesigan, N.S., Dhanasegar, N., Ramasamy, M.D., Muthusamy, S., Mishra, O.P., Pugalendhi, G.K., Sundararajan, S.C.M., Ravindaran, A.: An integrated network topology and deep learning model for prediction of Alzheimer disease candidate genes. Soft. Comput. 27, 14189–14203 (2023). https://doi.org/10.1007/s00500-023-08390-8
Mishra, S.D., Dutta, M.: Image-Fused: improved Alzheimer’s disease diagnosis using efficient deep neural network. Soft. Comput. (2023). https://doi.org/10.1007/s00500-023-08108-w
Sekhar, B.V.D.S., Jagadev, A.K.: Efficient Alzheimer’s disease detection using deep learning technique. Soft. Comput. 27, 9143–9150 (2023). https://doi.org/10.1007/s00500-023-08434-z
Alwindawi, A.F., Uçan, O.N., Ibrahim, A.A., Yusuf, A.: Novel semi-supervised learning approach for descriptor generation using artificial neural networks. Soft. Comput. 26, 7709–7720 (2022). https://doi.org/10.1007/s00500-022-06742-4
Sangkloy P, Burnell N, Ham C, Hays J (2016) The Sketchy Database: Learning to Retrieve Badly Drawn Bunnies, Journal of ACM Transactions on Graphics (TOG), Vol. 35, P. 1–12
Lai, C.H., Peng, P.Y.: A Hybrid Deep Learning Method to Extract Multi-features from Reviews and User-Item Relations for Rating Prediction. International Journal of Computational Intelligence Systems 16, 109 (2023)
Salih, N., Ksantini, M., Hussein, N., Ben Halima, D., Abdul Razzaq, A., Ahmed, S.: Prediction of ROP Zones Using Deep Learning Algorithms and Voting Classifier Technique. International Journal of Computational Intelligence Systems 16, 8 (2023)
Moetesum M, Aslam T, Saeed H, Siddiqi I, Masroor U, (2017) Sketch-based Facial Expression Recognition for Human Figure Drawing Psychological Test, 2017 International Conference on Frontiers of Information Technology (FIT), Islamabad, Pakistan, P. 258–263, doi: https://doi.org/10.1109/FIT.2017.00053
Liu, L., Dong, H., Huang, H., Alan, B.C.: No-reference image quality assessment in curvelet domain, Elsevier Journal of. Signal Processing: Image Communication 29, 494–505 (2014)
Liang, Y., Fairhurst, M., Guest, R., Potter, J.: A Learning Model for the Automated Assessment of Hand-Drawn Images for Visuo-Spatial Neglect Rehabilitation. IEEE Trans. Neural Syst. Rehabil. Eng. 18, 560–570 (2010)
Jyotiyana, M., Kesswani, N., Kumar, M.: A deep learning approach for classification and diagnosis of Parkinson’s disease. Springer journal of Soft Computing 26, 9155–9165 (2022). https://doi.org/10.21203/rs.3.rs-254647/v1
Moetesum M, Siddiqi I, Masroor U, Djeddi C, (2015) Automated scoring of Bender Gestalt Test using image analysis techniques, 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia, P. 666–670, doi: https://doi.org/10.1109/ICDAR.2015.7333845
Demisse, G., Aouada, D., Ottersten, B.: Deformation Based Curved Shape Representation. IEEE Trans. Pattern Anal. Mach. Intell. 40, 1338–1351 (2018)
Kim, S., Kang, H., Kim, Y.: A computer system for art therapy assessment of elements in structured mandala. Elsevier Journal of the Arts in Psychotherapy 36, 19–28 (2009)
Nazar H, Moetesum M, Ehsan S, Siddiqi I, Khurshid K, Vincent N, McDonald-Maier KD (2017) Classification of Graphomotor Impressions using Convolutional Neural Networks: An Application to Automated Neuro-psychological Screening Tests, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan, 2017, pp. 432–437, doi: https://doi.org/10.1109/ICDAR.2017.78
Li, H., Huang, G., Li, Y., Zhang, X., Wang, Y., Li, J.: SEML: Self-Supervised Information-Enhanced Meta-learning for Few-Shot Text Classification. International Journal of Computational Intelligence Systems 16(1), 111 (2023)
Harbi Z, Hicks Y, Setchi R (2017) Clock Drawing Test Interpretation System, International Conference on Knowledge Based and Intelligent Information and Engineering Systems, KES2017, Vol. 112, P. 1641–1650, https://doi.org/10.1016/j.procs.2017.08.259
Guha A, Kim H, Yi-Luen E (2010) Automated Clock Drawing Test through Machine Learning and Geometric Analysis, Proceedings of the 16th International Conference on Distributed, Multimedia Systems, DMS 2010, October 14–16, 2010, Hyatt, Lodge at McDonald's Campus, Oak Brook, Illinois, USA., Knowledge Systems Institute 2010, ISBN 1–891706–28–4, P. 311–314
Kim H, Cho SY, Yi-Luen E (2011) Computational Clock Drawing Analysis for Cognitive Impairment Screening, ACM Trans., TEI '11: Proceedings of the fifth international conference on Tangible, embedded, and embodied interaction January 2011, P. 297–300, https://doi.org/10.1145/1935701.1935768
Harbi Z, Hicks Y, Setchi R, Bayer A (2015) Segmentation of Clock Drawings Based on Spatial and Temporal Features, 19th International Conference on Knowledge Based and Intelligent Information and Engineering Systems, Vol. 60, P. 1640–1648, https://doi.org/10.1016/j.procs.2015.08.274
Kim, H., Hsiao, C., Yi-Luen, E.: Home-based computerized cognitive assessment tool for dementia screening. Journal of Ambient Intelligence and Smart Environments 4, 429–442 (2012)
Kim H, Young CS, Yi-Luen E (2010) Context-bounded Refinement Filter Algorithm: Improving Recognizer Accuracy of Handwriting in Clock Drawing Test, AAAIWS'10–07: Proceedings of the 7th AAAI Conference on Visual Representations and Reasoning, January 2010, P. 53–60
Afifi, M.: 11K Hands: Gender recognition and biometric identification using a large dataset of hand images. Springer, Multimedia Tools and Applications 78, 20835–20854 (2019)
Matsui, Y., Ito, K., Aramaki, Y., Fujimoto, A., Ogawa, T., Yamasaki, T., Aizawa, K.: Sketch-based manga retrieval using manga109 dataset. Springer, Multimedia Tools and Applications 76, 21811–21838 (2017)
Delavar A. (2001) Applied Probabilities and Statistics in Psychology and Educational Sciences. In: Tehran, Roshd Publications, Iran National Library M78–9.23, 150/15195, BF39/D8M7, ISBN: 968–6115–23–3
Wang, X., He, J., Jin, Z., Yang, M., Wang, Y., Qu, H.: M2Lens: Visualizing and Explaining Multimodal Models for Sentiment Analysis. IEEE Trans. Visual Comput. Graphics 28, 802–812 (2022)
Xiao, Z., Chen, Y., Dou, W., Tao, Z., Chen, L.: MES-P: An Emotional Tonal Speech Dataset in Mandarin with Distal and Proximal Labels. IEEE Trans. Affect. Comput. 13, 408–425 (2022)
Thangakrishnan, M.S., Ramar, K.: RETRACTED ARTICLE: Automated Hand-drawn sketches retrieval and recognition using regularized Particle Swarm Optimization based deep convolutional neural network. J. Ambient. Intell. Humaniz. Comput. 12, 6407–6419 (2021)
Borkin, M.A., Bylinskii, Z., Kim, N.W., Bainbridge, C.M., Yeh, S., Borkin, D., Pfister, H., Oliva, A.: Beyond Memorability: Visualization Recognition and Recall. IEEE Trans. Visual Comput. Graphics 22, 519–528 (2016)
Xu, X., Liu, C., Zheng, Y.: 3D Tooth Segmentation and Labeling Using Deep Convolutional Neural Networks. IEEE Trans. Visual Comput. Graphics 25, 2336–2348 (2019)
Obaidullah, M.S., Halder, C., Santosh, C.K., Das, N., Roy, K.: PHDIndic_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification. Spring Multimed. Tools Appl. 77, 1643–1678 (2018). https://doi.org/10.1007/s11042-017-4373-y
Dong, S., Wang, P., Abbas, K.: A survey on deep learning and its applications. Computer Science Review 40, 100379 (2021)
Zhao, Z., Xu, P., Scheidegger, C., Ren, L.: Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning Models. IEEE Trans. Visual Comput. Graphics 28, 780–790 (2022)
Zhao X, Ni Y, Jia H (2017) Modified Object Detection Method Based on YOLO, Computer Vision. CCCV 2017, Communications in Computer and Information Science, Vol. 773, Springer, Singapore, https://doi.org/10.1007/978-981-10-7305-2_21
Huang Z, Yin Z, Ma Y, Fan C, Chai A (2021) Mobile phone component object detection algorithm based on improved SSD, Procedia Computer Science, 10th International Conference of Information and Communication Technology (ICICT-2020), Vol. 183, P. 107–114, https://doi.org/10.1016/j.procs.2021.02.037
Wang, C., Liu, Q., Li, Y., Gao, M.: LightCSPNet: A Lightweight Network for Image Classification and Objection Detection. International Journal of Computational Intelligence Systems 16, 46 (2023)
Murthy, C.B., Hashmi, M.F., Bokde, N.D.: Geem ZW (2020) Investigations of Object Detection in Images/Videos Using Various Deep Learning Techniques and Embedded Platforms — A Comprehensive Review. Appl. Sci. 10, 3280 (2020). https://doi.org/10.3390/app10093280
Chen, C., Wu, J., Wang, X., Xiang, S., Zhang, S., Tang, Q., Liu, S.: Towards Better Caption Supervision for Object Detection. IEEE Trans. Visual Comput. Graphics 28, 1941–1954 (2022)
Davis W.S., Yen, D.C. (1998) The Information System Consultant's Handbook: Systems Analysis and Design (1st ed.) CRC Press. ISBN-13: 978–0849370014, https://doi.org/10.1201/9781420049107
Fathi, A.M., Bastanfard, A., Amini, A.: Child Psychological Drawing Pattern Detection on OBGET Dataset, a case study on accuracy Based on MYOLO v5 and MResNet 50. Multimedia Tools and Applications (2023). https://doi.org/10.1007/s11042-023-17112-z
Acknowledgements
We gratefully acknowledge Dr. Narges Fathi Ahmadsaraei's cooperation as a psychologist in running and interpreting the Original Bender Gestalt Drawing Test.
Funding
Funding information is not applicable/no funding was received.
Author information
Authors and Affiliations
Contributions
All three authors contributed to writing and reviewing the manuscript. The first author (M. Fathi Ahmadsaraei) has a major contribution in data collection and proposing new algorithm and architecture. The first (M. Fathi Ahmadsaraei) and second (A. Bastanfard) authors have more contributions in writing the manuscript, examining and analyzing the data. The first (M. Fathi Ahmadsaraei), second (A. Bastanfard) and third (A. Amini) authors have contributions in revising the manuscript. All three authors contributed equally to the revision of the manuscript, read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no conflicts of interest. All authors certify that they have no affiliation with or involvement in any organization or entity with any financial or non-financial interest in the subject matter or materials discussed in this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fathi Ahmadsaraei, M., Bastanfard, A. & Amini, A. OBGESS: Automating Original Bender Gestalt Test Based on One Stage Deep Learning. Int J Comput Intell Syst 16, 178 (2023). https://doi.org/10.1007/s44196-023-00353-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s44196-023-00353-z