Skip to main content
Log in

A method for automatic classification of gender based on text- independent handwriting

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Handwriting recognition is used for the prediction of various demographic traits such as age, gender, nationality, etc. Out of all the applications gender prediction is mainly admired topic among researchers. The relation between gender and handwriting can be seen from the physical appearance of the handwriting. This research work predicts gender from handwriting using the landmarks of differences between the two genders. We use the shape or visual appearance of the handwriting for extracting features of the handwriting such as slanteness (direction), area (no of pixels occupied by text), perimeter (length of edges), etc. Classification is carried out using the Support Vector Machine (SVM) as a classifier which transforms the nonlinear problem into linear using its kernel trick, logistic regression, KNN and at the end to enhance the classification rates we use Majority Voting. The experimental results obtained on a dataset of 282 writers with 2 samples per writer shows that the proposed method attains appealing performance on writer detection and text-independent environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22

Similar content being viewed by others

References

  1. Ahmed M, Ghulam A, Afzal H, Siddiqi I (2017) Improving handwriting based gender classification using ensemble classifiers. Expert Syst Appl 85:158–168

    Article  Google Scholar 

  2. Akbari Y, Nouri K, Sadri J, Djeddi C, Siddiqi I (2017) Wavelet-based gender detection on off-line handwritten documents using probabilistic finite state automata. IMAVIS (Image and Vision Computing) 59(C):7–30

    Google Scholar 

  3. Al M S, Hassaïne A (2012) ICFHR2012 Competition on writer identification - challenge 2 : arabic scripts. 2012 Int Conf Front Handwriting Recognition 831–836. https://doi.org/10.1109/ICFHR.2012.218

  4. Al Maadeed S, Ayouby W, Hassaïne A, Mohamad Aljaam J (2012) QUWI: an Arabic and English handwriting dataset for offline writer identification. IIEEE:nternational Conference on Frontiers in Handwriting Recognition QUWI 95(15):742–747

    Google Scholar 

  5. Al Maadeed S, Hassaine A (2014) Automatic prediction of age, gender, and nationality in offline handwriting. Eurasip J Image Vid Process:1–10

  6. Altman N, Krzywinski M (2015) Simple linear regression. BMJ (Online) 346(7904):999–1000

    Google Scholar 

  7. Atanasiu V, Likforman-sulem L, Vincent N (2011) Writer retrieval — exploration of a novel biometric scenario using perceptual features derived from script orientation. International Conference on Document Analysis and Recognition 1:628–632

  8. Atmaja RD, Murti MA, Halomoan J, Suratman FY (2016) An image processing method to convert RGB image into binary. Indones J Electr Eng Comput Sci 3(2):377–382

    Article  Google Scholar 

  9. Bangare SL, Dubal A, Bangare PS, DSTP (2015) Reviewing Otsu ’ s method for image Thresholding. Int J Appl Eng Res 10(August 2016):21777–21783

    Article  Google Scholar 

  10. Beech JR,Mackintosh IC (2005) Do differences in sex hormones affect handwriting style? Evidence from digit ratio and sex role identity as determinants of the sex of handwriting. Pers Individ Differ 39(2):459–468

  11. Benchamardimath B (2015) A study on the importance of image processing and its applications. Int J Res Eng Technol 03(15):155–160

  12. Bouadjenek N, Nemmour H, Chibani Y (2017) Writer’s gender classification using HOG and LBP features. Recent advances in electrical engineering and control applications. ICEECA. Lect Notes Electr En 411:317–325. https://doi.org/10.1007/978-3-319-48929-2_24

  13. Boyadzhieva D, Gluhchev G (2014) A combined method for on-line signature verification. Cybern Inf Technol 14(2):92–97. https://doi.org/10.2478/cait-2014-0022

  14. Bradley AP (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159

    Article  Google Scholar 

  15. Burr V (2002) Judging gender from samples of adult handwriting: accuracy and use of cues. J Soc Psychol 142(6):691–700. https://doi.org/10.1080/00224540209603929

  16. Chanda S, Franke K, Pal U (2012) Text independent writer identification for Oriya script. 10th IAPR International Workshop on Document Analysis Systems 369–373. https://doi.org/10.1109/DAS.2012.86

  17. Chanda S, Franke K, Pal U, Wakabayashi T (2010) text independent writer identification for Bengali script. Int Conf Pattern Recognition Text 2005–2008

  18. Cutler A (2010) Random forests for survival, Regression Classification 1–129

  19. Dinehart LH (2015) Handwriting in early childhood education: current research and future implications. Journal of Early Childhood Literacy 15(1):97–118

  20. Djeddi C (2015) ICDAR2015 competition on multi-script writer identification and gender classification using ’ QUWI ’ database. 13th international conference on document analysis and recognition (ICDAR) 1191–1195

  21. Djeddi C, Gattal A, Souici-Meslati L, Siddiqi I, Chibani Y, El Abed H (2014) LAMIS-MSHD: a multi-script offline handwriting database. Proceedings of international conference on Frontiers in handwriting recognition, ICFHR 2014-Decem (September):93–97

  22. Duda RO, Hart PE (1972) Use of the Hough transformation to detect lines and curves in pictures. Commun ACM 15(1):11–15

    Article  Google Scholar 

  23. Fakhr MW (2011) Arabic optical character recognition (OCR) systems overview. Doi: https://doi.org/10.13140/2.1.3898.3682

  24. Fornes A, Llados J, Sanchez G (2008) Writer identification in old handwritten music scores. Work Doc Anal Syst 347–353

  25. Gattal A, Djeddi C, Bensefia A, Ennaji A (2020) Handwriting based gender classification using cold and hinge features. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) doi: https://doi.org/10.1007/978-3-030-51935-3_25

  26. Gattal A, Djeddi C, Siddiqi I, Chibani Y (2018) Gender classification from offline multi-script handwriting images using oriented basic image features (oBIFs). Expert Syst Appl 99(February):155–167

    Article  Google Scholar 

  27. Gupta B, Uttarakhand P, Rawat IA (2017) Analysis of various decision tree algorithms for classification in data mining. Int J Comput Appl 163(8):975–8887

    Google Scholar 

  28. Hadavandkhani F, Bahrami H (2008) Handwriting difficulties: introducing an instrument. Iran Rehabil 6(7):39–46

    Google Scholar 

  29. Hamid S, Loewenthal KM (1996) The journal of social inferring gender from handwriting in Urdu and English. J Soc Psychol 136(January 2015):37–41

    Google Scholar 

  30. Hartley J (1991) Sex. Differ Handwriting : Comment \Spear 17(2):141–145

    MathSciNet  Google Scholar 

  31. Hassaïne A, Maadeed S Al, Aljaam J, Jaoua A (2013) ICDAR2013 - Competition on Gender Prediction from Handwriting. doi: https://doi.org/10.1109/ICDAR.2013.286

  32. Hayes WN (1996) Identifying sex from handwriting. Percept Mot Skills 83:791–800

    Article  Google Scholar 

  33. Hazra TK, Sarkar R, Kumar A (2016) Handwritten English character recognition using logistic regression and neural network. Int J Sci Res (IJSR) 5(6):750–754

    Article  Google Scholar 

  34. Huanrui H (2016) New mixed kernel functions of SVM used in pattern recognition. Cybern Inf Technol 16(5):5–14

    MathSciNet  Google Scholar 

  35. Jaeger S, Jaeger S, Manke S, Reichert J, Waibel A (2001) Online handwriting recognition : the NPen ++ recognizer. Int J Doc Anal Recognit 3(May 2017):169–180

    Article  Google Scholar 

  36. Jeyalaksshmi S, Prasanna S (2017) Measuring distinct regions of grayscale image using pixel values. Int J Eng Technol 7(1.1):121–124

  37. Jung Y, Hu J (2015) A K-fold averaging cross-validation procedure. J Nonparametric Stat 27(2):167–179

  38. Kaira A (2017) The discovery of Beta-blockers. J Practice Cardiovasc Sci 3:36–38

    Article  Google Scholar 

  39. Kameya H, Mori S, Oka R (2006) A segmentation-free biometric writer verification method based on continuous dynamic programming. Pattern Recogn Lett 27(6):567–577

    Article  Google Scholar 

  40. Karlik B (2015) Performance analysis of various activation functions in generalized MLP architectures of neural networks. Int J Artif Intell Expert Syst 1(4):111–122

    Google Scholar 

  41. Koppenhaver KM (2007) Forensic document examination. Principles and practice 207–208

  42. Kumar G, Bhatia PK (2014) A detailed review of feature extraction in image processing systems. International conference on advanced computing and communication technologies, ACCT 5–12

  43. Kumar T, Verma K (2010) A theory based on conversion of RGB image to Gray image. Int J Comput Appl 7(2):5–12

    Google Scholar 

  44. Liwicki M, Schlapbach A, Bunke H (2010) Automatic gender detection using on-line and off-line information. Pattern Anal Applic 14(1):87–92

  45. Liwicki M, Schlapbach A, Bunke H, Bengio S, Mariéthoz J, Richiardi J (2006) Writer identification for smart meeting room systems. Lect Notes Comput Sci 186–195

  46. Liwicki M, Schlapbach A, Loretan P, Bunke H (2007) Automatic detection of gender and handedness from on-line handwriting. J Soc Psychol (march):179–183

  47. Mahanta LB, Deka A (2013) Skew and slant angles of handwritten signature. 2030–2034

  48. Mahmoud SA, Ahmad I, Al-Khatib WG, Alshayeb M, Tanvir Parvez M, Märgner V, Fink GA (2014) KHATT: an open Arabic offline handwritten text database. Pattern Recogn 47(3):1096–1112

    Article  Google Scholar 

  49. Maken P, Gupta A (2019) Review of gender prediction based on handwriting recognition using computer vision algorithms. Int J Sci Res 6(5):206–211

  50. Maken P, Gupta A, Gupta MK (2019) A Study on various techniques involved in gender prediction system: a comprehensive review. Cybern Inf Technol 19(2):51–73

    Google Scholar 

  51. Marti U-V, Bunke H (1999) A full English sentence database for off-line handwriting recognition. International Conference on Document Analysis and Recognition 705–708

  52. Marti UV, Bunke H (2003) The IAM-database: an English sentence database for offline handwriting recognition. Int J Doc Anal Recognit 5(1):39–46

    Article  Google Scholar 

  53. McLachlan GJ (1999) Mahalanobis distance. Resonance 4(6):20–26. https://doi.org/10.1007/BF02834632

  54. Mente R, Dhandra BV, Mukarambi G (2014) Image recognition using shape descriptor: eccentricity and color. BMRD’s J Manag Res 3(1):210–216

  55. Mirza A, Moetesum M, Siddiqi I (2016) ICFHR2016 Competition on Multi-script Writer Demographics Classification Using “ QUWI “. 15th International Conference on Frontiers in Handwriting Recognition ICFHR2016. , pp 602–606

  56. Mirza A, Moetesum M, Siddiqi I, Djeddi C (2017) Gender classification from offline handwriting images using textural features. Proceedings of international conference on Frontiers in handwriting recognition 395–398

  57. Pali V, Goswami S, Bhaiya LP (2014) An extensive survey on feature extraction techniques for facial image processing. International Conference on Computational Intelligence and Communication Networks 142–148. https://doi.org/10.1109/CICN.2014.43

  58. Patil AB, Shaikh J (2016) OTSU Thresholding method for flower image segmentation. Int J Comput Eng Sci 6(5):1–6

  59. Purkait P, Kumar R, Chanda B (2010) Writer Identification for Handwritten Telugu Documents using Directional Morphological Features. International Conference on Frontiers in Handwriting Recognition 658–663

  60. Raid A, Khedr W, El-dosuky M, Aoud M (2014) Image restoration based on morphological operations. Int J Comput Sci Eng Inf Technol 4(3):9–21

    Google Scholar 

  61. Rastogi P, Pillai MKRA study of fingerprints in relation to gender and blood group. J Indian Acad Forensic Med 32(1):11–14

  62. Razak Z, Zulkiflee K, Yamani M, Idris I, Tamil EM, Noorzaily M, Noor M, Salleh R, Yaacob M, Yusof ZM (2008) Off-line handwriting text line segmentation: a review. Int J Comput Sci Netw Secur 8(7):12–20

    Google Scholar 

  63. Sapp M, Obiakor F, Scholze S (2007) Mahalanobis distance : a multivariate of effect in hypnosis research. Sleep hynosis 9(2):67–70

    Google Scholar 

  64. Saravanan C (2010) Color image to grayscale image conversion. 2010 2nd international conference on computer engineering and applications. ICCEA 2010(2):196–199

    Google Scholar 

  65. Sathya R, Abraham A (2013) Comparison of supervised and unsupervised learning algorithms for pattern classification. Int J Adv Res Artif Intell 2(2):34–38

    Article  Google Scholar 

  66. Schomaker L, Franke K, Bulacu M (2006) Using codebooks of fragmented connected-component contours in forensic and historic writer identification. Pattern Recognit Lett 28(6):719–727

  67. Matjaz M, Zoran B (2013) ROC analysis of classifiers in machine learning: a survey May. Intell Data Anal 17(3):531–558. https://doi.org/10.3233/IDA-130592

  68. Ghosha S, Shivakumarab P, Roya P, Pala U Luc T (2020) Graphology based handwritten character analysis for human behavior identification. CAAI Transactions on Intelligence Technology 5(1):1–11. https://doi.org/10.1049/trit.2019.0051

  69. Shabani S, Norouzi Y, Fariborz M (n.d.) Handwritten objects recognition using regularized logistic regression and feedforward neural

  70. Sharma V (2014) Object counting using MATLAB. Int J Sci Eng Res 5(3):614–616

  71. Sharma A, Khare S, Chavan S (2017) A review on handwritten character recognition 1 1,2,3. 8491:71–75

  72. Siddiqi I, Cloppet F (September 2014) Vincent N (2009) contour based features for the classification of ancient manuscripts. Advances Graphonomics: Proceedings of IGS 2009:226–229

    Google Scholar 

  73. Siddiqi I, Djeddi C, Raza A (2015) Automatic analysis of handwriting for gender classification. Pattern Anal Applic 18(4):887–899

    Article  MathSciNet  Google Scholar 

  74. Sinwar D, Kaushik R (2014) Study of Euclidean and Manhattan distance metrics using simple K-means clustering. Int J Res Appl Sci Eng Technol 2(V):270–274

    Google Scholar 

  75. Spear MG (1989) Differences between the written work of boys and girls. Br Educ Res J 15(3):271–277. https://www.jstor.org/stable/1501263

  76. Strecht P, Cruz L, Soares C, Mendes-Moreira J, Abreu R (2015) PER-08: a comparative Study of classification and regression algorithms for modelling students’ academic performance. Portugal 392–395

  77. Tanvir Parvez M, Mahmoud SA (2013) Arabic handwriting recognition using structural and syntactic pattern attributes. Pattern Recogn 46(1):141–154

    Article  Google Scholar 

  78. Tett RP, Palmer CA (1997) The validity of handwriting elements in relation to self-report personality trait measures. Personal Individ Differ 22(1):11–18

    Article  Google Scholar 

  79. Ting KM (2017) Confusion matrix. Encyclopedia Mach Learn Data Min 260–260

  80. Upadhyay S, Singh J, Shukla SK (2017) Determination Sex Through Handwriting Character 9(13):11–18

    Google Scholar 

  81. Visa Sofia D (2011) Confusion matrix-based feature selection Sofia Visa. ConfusionMatrix-based Feature Selection Sofia 710(May 2014):8

    Google Scholar 

  82. Wang X, Ding X (2004) An effective writer verification algorithm using negative samples. Ninth International Workshop on Frontiers in Handwriting Recognition 509–513. https://doi.org/10.1109/IWFHR.2004.14

  83. Xie Q, Xu Q (2013) Gender prediction from handwriting data mining course project -fall 2013. International Conference on Document Analysis and Recognition 10–13. https://www.kaggle.com/c/icdar2013-gender-prediction-from-handwriting

  84. Yilmaz MB, Yanikoglu B, Tirkaz C, Kholmatov A (2011) Offline signature verification using classifier combination of HOG and LBP features. 2011 international joint conference on biometrics, IJCB 2011. doi: https://doi.org/10.1109/IJCB.2011.6117473

Download references

Funding

Not received.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abhishek Gupta.

Ethics declarations

Conflicts of interest/Competing interests

Authors declare that there is no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maken, P., Gupta, A. A method for automatic classification of gender based on text- independent handwriting. Multimed Tools Appl 80, 24573–24602 (2021). https://doi.org/10.1007/s11042-021-10837-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-10837-9

Keywords

Navigation