Abstract
Intimate partner violence (IPV) is a problem that has been studied by different researchers to determine the factors that influence its occurrence, as well as to predict it. In Peru, 68.2% of women have been victims of violence, of which 31.7% were victims of physical aggression, 64.2% of psychological aggression, and 6.6% of sexual aggression. Therefore, in order to predict psychological, physical and sexual intimate partner violence in Peru, the database of denouncements registered in 2016 of the “Ministerio de la Mujer y Poblaciones Vulnerables” was used. This database is comprised of 70510 complaints and 236 variables concerning the characteristics of the victim and the aggressor. First of all, we used Chi-squared feature selection technique to find the most influential variables. Next, we applied the SMOTE and random under sampling techniques to balance the dataset. Then, we processed the balanced dataset using cross validation with 10 folds on Multinomial Logistic Regression, Random Forest, Naive Bayes and Support Vector Machines classifiers to predict the type of partner violence and compare their results. The results indicate that the Multinomial Logistic Regression and Support Vector Machine classifiers performed better on different scenarios with different feature subsets, whereas the Naïve Bayes classifier showed inferior. Finally, we observed that the classifiers improve their performance as the number of features increased.
A. Guerrero and V. H. Ayma—Authors contributed equally to this manuscript
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abramsky, T., Watts, C.H., Garcia-Moreno, C., Devries, K., Kiss, L., Ellsberg, M., Heise, L: What factors are associated with recent intimate partner violence? Findings from the WHO multi-country study on women’s health and domestic violence. BMC Pub. Health 11(1), 109 (2011)
Babu, B.V., Kar, S.K.: Domestic violence in Eastern India: factors associated with victimization and perpetration. Pub. Health 124(3), 136–148 (2010)
Belgiu, M., Drăguţ, L.: Random Forest in remote sensing: a review of applications and future directions. ISPRS J. Photogr. Remote Sens. 114, 24–31 (2016)
Berk, R.A., Sorenson, S.B., Barnes, G.: Forecasting domestic violence: a machine learning approach to help inform arraignment decisions. J. Empir. Legal Stud. 13(1), 94–115 (2016)
Bengio, Y., Delalleau, O., Le Roux, N.: The curse of dimensionality for local kernel machines. Technical report, 1258 (2005)
Breiman, L.: Machine Learning, 45(1), 5–32 (2001)
Brignone, L., Gomez, A.M.: Double jeopardy: predictors of elevated lethality risk among intimate partner violence victims seen in emergency departments. Prevent. Med. 103, 20–25 (2017)
Chandrashekar, G., Sahin, F.: A survey on feature selection methods. Comput. Electr. Eng. 40(1), 16–28 (2014)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Clark, C.J., Alonso, A., Everson-Rose, S.A., Spencer, R.A., Brady, S.S., Resnick, M.D., Borowsky, I.W., Connett, J.E., Krueger, R.F., Nguyen-Feng, V.N., Feng, S.L., Feng, S.L.: Intimate partner violence in late adolescence and young adulthood and subsequent cardiovascular risk in adulthood. Preventive Med. 87, 132–137 (2016)
Genuer, R., Poggi, J.-M., Tuleau-Malot, C.: Variable selection using Random Forests. Pattern Recogn. Lett. 31(14), 2225–2236 (2010)
Ghosh, D.: Predicting vulnerability of Indian women to domestic violence incidents. Res. Pract. Soc. Sci. 3(1), 48–72 (2007)
Goel, E., Abhilasha, E.: Random Forest: a review. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 7(1), 251–257 (2017)
Hsieh, T.C., Wang, Y.-H., Hsieh, Y.-S., Ke, J.-T., Liu, C.-K., Chen, S.-C.: Measuring the unmeasurable—a study of domestic violence risk prediction and management. J. Technol. Hum. Serv. 36(1), 56–68 (2018). https://doi.org/10.1080/15228835.2017.1417953
Hu, S., Liang, Y., Ma, L., He, Y.: MSMOTE: improving classification performance when training data is imbalanced. In: 2009 Second International Workshop on Computer Science and Engineering (2009). https://doi.org/10.1109/wcse.2009.756
Instituto Nacional de Estadística e informática: Perú: Indicadores de violencia familiar y sexual, 2000–20017 (2017)
Ismi, D.P., Panchoo, S., Murinto, M.: K-means clustering based filter feature selection on high dimensional data. Int. J. Adv. Intell. Inf. 2(1), 38–45 (2016)
Iverson, K., Litwack, S., Pineles, S., Suvak, M., Vaughn, R., Resick, P.: Predictors of intimate partner violence revictimization: the relative impact of distinct PTSD symptoms, dissociation, and coping strategies. J. Traumat. Stress 26(1), 102–110 (2013)
Izmirli, G., Sonmez, Y., Sezik, M.: Prediction of domestic violence against married women in southwestern Turkey. Int. J. Gynecol. Obstet. 127(3), 288–292 (2014)
Jewker, R.: Intimate partner violence causes and prevention. The Lancet- 359(9315), 1423–1429 (2002)
Jia, J., Liu, Z., Xiao, X., Liu, B., Chou, K.-C.: pSuc-Lys: predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach. J. Theor. Biol. 394, 223–230 (2016). https://doi.org/10.1016/j.jtbi.2016.01.020
Jung, H., Herrenkohl, T.I., Skinner, M.L., Lee, J.O., Klika, J.B., Rousson, A.N.: Gender differences in intimate partner violence: a predictive analysis of IPV by child abuse and domestic violence exposure during early childhood. Violence Against Women 25(8), 903–924 (2019)
Kecman, V.: Support vector machines–an introduction. In Support Vector Machines: Theory and Applications, pp. 1–47. Springer, Heidelberg (2005)
Koning, M., Smith, C.: Decision Trees and Random Forests: A Visual Introduction for Beginners: A Simple Guide to Machine Learning with Decision Trees. Seattle (2017)
Kranjčić, N., Medak, D., Župan, R., Rezo, M.: Machine learning methods for classification of the green infrastructure in city areas. ISPRS Int. J. Geo-Inf. 8, 463 (2019)
Laeheem, K., Boonprakarn, K.: Factors predicting domestic violence among Thai Muslim married couples in Pattani province. Kasetsart J. Soc. Sci. 38(3), 352–358 (2017)
Leonardsson, M., San Sebastian, M.: Prevalence and predictors of help-seeking for women exposed to spousal violence in India–a cross-sectional study. BMC Women’s Health 17(1), 99 (2017)
Longadge, R., Dongre, S.: Class imbalance problem in data mining review. arXiv preprint arXiv:1305.1707 (2013)
Mansilla, M.: Etapas del desarrollo humano. Revista de investigación en Psicología 3(2), 105–116 (2000)
Ministerio de la Mujer y Poblaciones Vulnerables: Impacto y consecuencias de la violencia contra las mujeres. Lima (2017)
Moraes, C.L., de Tavares da Silva, T.S., Reichenheim, M.E., Azevedo, G.L., Dias Oliveira, A.S., Braga, J.U.: Physical violence between intimate partners during pregnancy and postpartum: a prediction model for use in primary health care facilities. Paediatr. Perinat. Epidemiol. 25(5), 478–486 (2011)
Moyano, N., Monge, F.S., Sierra, J.C.: Predictors of sexual aggression in adolescents: Gender dominance vs. rape supportive attitudes. Eur. J. Psychol. Appl. Legal Context 9(1), 25–31 (2017)
Nitze, I., Schulthess, U., Asche, H.: Comparison of machine learning algorithms random forest, artificial neural network and support vector machine to maximum likelihood for supervised crop type classification. In: Fourth International Conference on Geographic Object-Based Image Analysis (GEOBIA), 035, Rio de Janeiro, 7–9 May 2012 (2012)
Parsian, M.: Data Algorithms: Recipes for Scaling Up with Hadoop and Spark. O’Reilly Media, Inc., Sebastopol (2015)
Phan, T.-N., Kappas, M.: Comparison of random forest, k-nearest neighbor, and support vector machine classifiers for land cover classification using sentinel-2 imagery. Sensors 18, 18 (2017). https://doi.org/10.3390/s18010018
Pueyo, A., Redondo Illescas, S.: Predicción de la violencia: Entre la peligrosidad y la valoración del riesgo de violencia. Papeles del Psicólogo 157–173 (2007)
Rachburee, N., Punlumjeak, W.: A comparison of feature selection approach between greedy, IG-ratio, Chi-square, and mRMR in educational mining. In: 7th International Conference on Information Technology and Electrical Engineering (ICITEE) (2015)
Raschka, S.: Naive Bayes and text classification i-introduction and theory. arXiv preprint arXiv:1410.5329 (2014)
Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)
Saile, R., Neuner, F., Ertl, V., Catani, C.: Prevalence and predictors of partner violence against women in the aftermath of war: a survey among couples in Northern Uganda. Soc. Sci. Med. 86, 17–25 (2013)
Schafer, K.R., Brant, J., Gupta, S., Thorpe, J., Winstead-Derlega, C., Pinkerton, R., Laughon, K., Ingersoll, K., Dillingham, R.: Intimate partner violence: a predictor of worse HIV outcomes and engagement in care. AIDS Patient Care STDs 26(6), 356–365 (2012)
Sheridan, R.P.: Using random forest to model the domain applicability of another random forest model. J. Chem. Inf. Model. 53(11), 2837–2850 (2013)
Silva, J., Aleman, E.G., Acuña, G.C., Bilbao, O.R., Hernandez-P.H., Castro, B.L., Meléndez, P.A., Neira, D.: Use of artificial neural networks in determining domestic violence predictors. In: International Conference on Swarm Intelligence, pp. 132–141. Springer, Cham, July 2019
Suthar, B., Patel, H., Goswami, A.: A survey: classification of imputation methods in data mining. Int. J. Emerg. Technol. Adv. Eng. 2(1), 309–312 (2012)
Swartout, K.M., Cook, S.L., White, J.W.: Trajectories of intimate partner violence victimization. West. J. Emerg. Med. 13(3), 272 (2012)
Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Education India (2016)
Ting, K.M.: Confusion Matrix. Encyclop. Mach. Learn. Data Min. 260–260 (2017). https://doi.org/10.1007/978-1-4899-7687-1_50
Tjaden, P., Thoennes, N.: Prevalence, Incidence, and Consequences of Violence Against Women: Findings from the National Violence Against Women Survey. National Institute of Justice Centers for Disease Control and Prevention. Research in Brief (1998)
Wang, Y.: A multinomial logistic regression modeling approach for anomaly intrusion detection. Comput. Secur. 24(8), 662–674 (2005)
Wijenayake, S., Graham, T., Christen, P.: A decision tree approach to predicting recidivism in domestic violence. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 3–15. Springer, Cham, June 2018
Xiang, Y., Xie, Y.: Imbalanced data classification method based on ensemble learning. In: International Conference in Communications, Signal Processing, and Systems, pp. 18–24. Springer, Singapore, July 2018
Xing, E.P., Jordan, M.I., Karp, R.M.: Feature selection for high-dimensional genomic microarray data. In: ICML, vol. 1, pp. 601–608, June 2001
Yin, M., Zeng, D., Gao, J., Wu, Z., Xie, S.: Robust multinomial logistic regression based on RPCA. IEEE J. Sel. Top. Sig. Process. 12(6), 1144–1154 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Guerrero, A., Cárdenas, J.G., Romero, V., Ayma, V.H. (2021). Comparison of Classifiers Models for Prediction of Intimate Partner Violence. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Proceedings of the Future Technologies Conference (FTC) 2020, Volume 2 . FTC 2020. Advances in Intelligent Systems and Computing, vol 1289. Springer, Cham. https://doi.org/10.1007/978-3-030-63089-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-63089-8_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63088-1
Online ISBN: 978-3-030-63089-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)