Abstract
In this chapter, we examine the k-fold cross-validation for small sample method (Method 1) by combining the resampling technique with k-fold cross-validation. By this breakthrough, we obtain the error rate means, M1 and M2, for the training and validation samples, respectively, and the 95 % CI of the discriminant coefficient and the error rate. Moreover, we propose a straightforward and powerful model selection procedure where we select the model with minimum M2 as the best model. We apply Method 1 and model selection procedure to the pass/fail determination using examination scores. By setting the intercept to one for seven LDFs, we obtain several good results, as follows: (1) M2 of Fisher’s LDF is over 4.6 % worse than Revised IP-OLDF. (2) The soft-margin SVM (S-SVM) for penalty c = 1 (SVM) is worse than other five MP-based LDFs and logistic regression. (3) We obtain the 95 % CI of the discriminant coefficients. If we select the coefficient median of seven LDFs, with the exception of Fisher’s LDF, the coefficient median is almost the same as the trivial LDF for the linearly separable model. (4) Although these datasets are LSD and show the natural feature-selection, we do not introduce this theme because there are only four independent variables.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cox DR (1958) The regression analysis of binary sequences (with discussion). J Roy Stat Soc B 20:215–242
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugenics 7:179–188
Fisher RA (1956) Statistical methods and statistical inference. Hafner Publishing Co, New Zealand
Friedman JH (1989) Regularized discriminant analysis. J Am Stat Assoc 84(405):165–175
Goodnight JH (1978) SAS technical report—the sweep operator: its importance in statistical computing—(R100). SAS Institute Inc, UK
Lachenbruch PA, Mickey MR (1968) Estimation of error rates in discriminant analysis. Technometrics 10:1–11
Miyake A, Shinmura S (1976) Error rate of linear discriminant function. In: Dombal FT, Gremy F (ed). North-Holland Publishing Company, The Netherland, pp 435–445
Miyake A, Shinmura S (1979) An algorithm for the optimal linear discriminant functions. Proceedings of the international conference on cybernetics and society, pp 1447–1450
Miyake A, Shinmura S (1980) An algorithm for the optimal linear discriminant function and its application. Jpn Soc Med Electron Biol Eng 18(6):452–454
Nomura Y, Shinmura S (1978) Computer-assisted prognosis of acute myocardial infarction. MEDINFO 77, In: Shires, W (ed) IFIP. North-Holland Publishing Company, The Netherland, pp 517–521
Sall JP (1981) SAS regression applications. SAS Institute Inc., USA. (Shinmura S. translate Japanese version)
Sall JP, Creighton L, Lehman A (2004) JMP start statistics, third edition. SAS Institute Inc., USA. (Shinmura S. edits Japanese version)
Schrage L (1991) LINDO—an optimization modeling systems. The Scientific Press, UK. (Shinmura S. & Takamori, H. translate Japanese version)
Schrage L (2006) Optimization modeling with LINGO. LINDO Systems Inc., USA. (Shinmura S. translates Japanese version)
Shimizu T, Tsunetoshi Y, Kono H, Shinmura S (1975). Classification of subjective symptoms of junior high school Students affected by photochemical air pollution. J Jpn Soc Atmos Environ 9/4:734–741. Translated for NERCLibrary, EPA, from the original Japanese by LEO CANCER Associates, P.O.Box 5187 Redwood City, California 94063, 1975 (TR 76-213)
Shinmura S, Miyake A (1979) Optimal linear discriminant functions and their application. COMPSAC 79:167–172
Shinmura S (1984) Medical data analysis, model, and OR. Oper Res 29(7):415–421
Shinmura S, Iida K, Maruyama C (1987) Estimation of the effectiveness of cancer treatment by SSM using a null hypothesis model. Inf Health Social Care 7(3):263–275. doi:10.3109/14639238709010089
Shinmura S (1998) Optimal linear discriminant functions using mathematical programming. J Jpn Soc Comput Stat 11/2:89–101
Shinmura S, Tarumi T (2000) Evaluation of the optimal linear discriminant functions using integer programming (IP-OLDF) for the normal random data. J Jpn Soc Comput Stat 12(2):107–123
Shinmura S (2000a) A new algorithm of the linear discriminant function using integer programming. New Trends Prob Stat 5:133–142
Shinmura S (2000b) Optimal linear discriminant function using mathematical programming. Dissertation, 200:1–101, Okayama University, Japan
Shinmura S, Suzuki T, Koyama H, Nakanishi K (1983) Standardization of medical data analysis using various discriminant methods on a theme of breast diseases. MEDINFO 83, In: Van Bemmel JH, Ball MJ, Wigertz O (ed). North-Holland Publishing Company, The Netherland, pp 349–352
Shinmura S (2001) Analysis of effect of SSM on 152,989 cancer patient. ISI2001, pp 1–2. doi:10.13140/RG.2.1.30779281
Shinmura S (2003) Enhanced algorithm of IP-OLDF. ISI2003 CD-ROM, pp 428–429
Shinmura S (2004) New algorithm of discriminant analysis using integer programming. IPSI 2004 pescara VIP conference CD-ROM, pp 1–18
Shinmura S (2005) New age of discriminant analysis by IP-OLDF—beyond Fisher’s linear discriminant function. ISI2005, pp 1–2
Shinmura S (2007) Overviews of discriminant function by mathematical programming. J Jpn Soc Comput Stat 20(1-2):59–94
Shinmura S (2010a) The optimal linearly discriminant function (Saiteki Senkei Hanbetu Kansuu). Union of Japanese Scientist and Engineer Publishing, Japan
Shinmura S (2010b) Improvement of CPU time of Revised IP-OLDF using linear programming. J Jpn Soc Comput Stat 22(1):39–57
Shinmura S (2011a) Beyond Fisher’s linear discriminant analysis—new world of the discriminant analysis. 2011 ISI CD-ROM, pp 1–6
Shinmura S (2011b) Problems of discriminant analysis by mark sense test data. Jpn Soc Appl Stat 40(3):157–172
Shinmura S (2013) Evaluation of optimal linear discriminant function by 100-fold cross-validation. ISI CD-ROM, pp 1–6
Shinmura S (2014a) End of discriminant functions based on variance-covariance matrices. ICORE2014, pp 5–16
Shinmura S (2014b) Improvement of CPU time of linear discriminant functions based on MNM criterion by IP. Stat Optim Inf Comput 2:114–129
Shinmura S (2014c) Comparison of linear discriminant functions by K-fold cross-validation. Data Anal 2014:1–6
Shinmura S (2015a) The 95 % confidence intervals of error rates and discriminant coefficients. Stat Optim Inf Comput 2:66–78
Shinmura S (2015b) A trivial linear discriminant function. Stat Optim Inf Comput 3:322–335. doi:10.19139/soic.20151202
Shinmura S (2015c) Four serious problems and new facts of the discriminant analysis. In: Pinson E, Valente F, Vitoriano B (ed) Operations research and enterprise systems, pp 15–30. Springer, Berlin (ISSN: 1865-0929, ISBN: 978-3-319-17508-9, doi:10.1007/978-3-319-17509-6)
Shinmura S (2015d) Four problems of the discriminant analysis. ISI 2015:1–6
Shinmura S (2016a) The best model of Swiss banknote data. Stat Optim Inf Comput 4:118–131. International Academic Press (ISSN: 2310-5070 (online) ISSN: 2311-004X (print), doi: 10.19139/soic.v4i2.178)
Shinmura S (2016b) Matroska feature selection method for microarray data. Biotechno 2016:1–8
Shinmura S (2016c) Discriminant analysis of the linear separable data—Japanese automobiles. J Stat Sci Appl X, X:0–14
VapnikV (1995) The nature of statistical learning theory. Springer, Berlin
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this chapter
Cite this chapter
Shinmura, S. (2016). Pass/Fail Determination Using Examination Scores. In: New Theory of Discriminant Analysis After R. Fisher. Springer, Singapore. https://doi.org/10.1007/978-981-10-2164-0_5
Download citation
DOI: https://doi.org/10.1007/978-981-10-2164-0_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2163-3
Online ISBN: 978-981-10-2164-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)