Abstract
This paper reviews Prof. Kai-Tai Fang’s contributions in data mining. Since the 1970s, Prof. Fang has been committed to applying statistical ideas and methods to deal with large amounts of data in practical projects. By analyzing more than 400,000 pieces of data, he found representative clothing indicators and established the first adult clothing standard in China; through cleaning and modeling steel-making data from steel mills all over the country, he revised the national standard for alloy structural steel; by studying various data in chemometrics, he introduced many new advanced statistical methods to improve the identification and classification of chemical components, established more effective models for the relationship between quantitative structure and activity, and promoted the application of the traditional Chinese medicine (TCM) fingerprint in TCM quality control. Professor Fang and his team’s research achievements in data mining have been highly appreciated by relevant experts. This article is written to celebrate the 80th birthday of Prof. Kaitai Fang.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Atkinson, A.C., Bogacka, B., Bogacki, M.B.: \(D\)- and \(T\)-optimum designs for the kinetics of a reversible chemical reaction. Chemom. Lab. Syst. 43, 185–198 (1998)
Cox, D.R.: Note on grouping. J. Am. Stat. Assoc. 52, 543–547 (1957)
Fang, K.T.: Using conditional distributions to formulate a national clothing standard. J. Appl. Math. 2, 63–74 (1976). (Chinese)
Fang, K.T.: Uniform design—application of number theory method in experimental design. J. Appl. Math. 3, 363–372 (1980). (Chinese)
Fang, K.T.: Endless and unbending journey to statistics research: the oral autobiography of Kai-tai Fang. Hunan Education Press (2016) (in Chinese)
Fang, K.T., He, S.: The problem of selecting a given number of representative points in a normal population and a generalized mill’s ratio. Technical report, No. 5, Department of Statistics, Stanford University, USA (1982)
Fang, K.T., Wang, Y.: Number-Theoretic Methods in Statistics. Chapman and Hall, London (1994)
Fang, K.T., Wu, C.Y.: A probability extremum problem. J. Appl. Math. 2, 132–148 (1979) (in Chinese)
Fang, K.T., Zhang Y.T.: Introduction to Multivariate Statistical Analysis. Science Press (1982) (in Chinese)
Fang, K.T., Bentler, P.M., Yuan, K.H.: Applications of number-theoretic methods to quantizers of elliptically contoured distributions. In: Multivariate Analysis and Its Applications, IMS Lecture Notes—Monograph Series, pp. 211–225 (1994)
Fang, K.T., Liang, Y.Z., Yin, X.L., Chen, K., Lu, G.H.: Critical value determination on similarity of fingerprints. Chemom. Intell. Lab. Syst. 82, 236–240 (2006)
Fang, K.T., Liang, Y.Z., Yu, R.Q. (eds.): Data Mining and Bioinformatics in Chemistry and Chinese Medicines. Hong Kong Baptist University (2003)
Fang, K.T., Liang, Y.Z., Yu, R.Q. (eds.): Data Mining and Bioinformatics in Chemistry and Chinese Medicines, vol. 2, Hong Kong Baptist University (2004)
Fang, K.T., Xiang, K.F., Liu, G.Y.: Precision of Test Method. China Standard Press, Beijing (1988)
Fang, K.T., Yin, H., Liang, Y.Z.: New approach by Kriging methods to problems in QSAR. J. Chem. Inform. Model. 44, 2106–2113 (2004)
Fang, K.T., Zhou, M., Wang, W.J.: Applications of the representative points in statistical simulations. Sci. China Ser. A 57, 2609–2620 (2014)
He, P., Fang, K.T., Xu, C.J.: The classification tree combined with SIR and its applications to classification of mass spectra. J. Data Sci. 1, 425–445 (2003)
He, P., Fang, K.T., Liang, Y.Z., Li, B.Y.: A Generalized boosting algorithm and its application to two-class chemical classification problem. Analytica Chimica Acta 543, 181–191 (2005)
He, P., Xu, C.J., Liang, Y.Z., Fang, K.T.: Improving the classification accuracy in chemistry via boosting technique. Chem. Intell. Lab. Syst. 70, 39–46 (2004)
Hu, Q.N., Liang, Y.Z., Fang, K.T.: The matrix expression, topological index and atomic attribute of molecular topological structure. J. Data Sci. 1, 361–389 (2003)
Hu, Q.N., Liang, Y.Z., Peng, X.L., Yin, H., Fang, K.T.: Structural interpretation of a topological index. 1. External factor variable connectivity index (EFVCI). J. Chem. Inf. Comput. Sci. 44, 437–446 (2004)
Hu, Q.N., Liang, Y.Z., Yin, H., Peng, X.L., Fang, K.T.: Structural interpretation of a topological index. 2. The molecular connectivity index, the Kappa index, and the atom-type E-state index. J. Chem. Inf. Comput. Sci. 44, 1193–1201 (2004)
Lee, A.W.M., Chan, W.F., Yuen, F.S.Y., Tse, P.K., Liang, Y.Z., Fang, K.T.: An example of a sequential uniform design: application in capillary electrophoresis. Chem. Intell. Lab. Syst. 39, 11–18 (1997)
Liang, Y.Z., Fang, K.T.: Robust multivariate calibration algorithm based on least median squares and sequential number theoretic optimization method. Anal. Chem. 121, 1025–1029 (1996)
Liang, Y.Z., Fang, K.T., Xu, Q.S.: Uniform design and its applications in chemistry and chemical engineering. Chem. Intell. Lab. Syst. 58, 43–57 (2001)
Peng, X.L., Hu, Q.N., Liang, Y.Z.: Variable selection via nonconcave penalty function in structure-boiling points correlations. J. Mol. Struct.: THEOCHEM, 714, 235–242 (2005)
Peng, X.L., Yin, H., Li, R., Fang, K.T.: The application of Kriging and empirical Kriging based on the variables selected by SCAD. Analytica Chimica Acta 578, 178–185 (2006)
Tang, Y., Liang, Y.Z., Fang, K.T.: Data mining in chemometrics: sub-structures learning via peak combinations searching in mass spectra. J. Data Sci. 1, 481–496 (2003)
Varmuza, K., He, P., Fang, K.T.: Boosting applied to classification of mass spectral data. J. Data Sci. 1, 391–404 (2003)
Xu, Q.S., Liang, Y.Z., Fang, K.T.: The effects of different experimental designs on parameter estimation in the kinetics of a reversible chemical reaction. Chem. Intell. Lab. Syst. 52, 155–166 (2000)
Xu, Q.S., Massart, D.L., Liang, Y.Z., Fang, K.T.: Two-step multivariate adaptive regression spline for modeling a quantitative relationship between gas chromatography retention indices and molecular descriptors. J. Chromatogr. A 998, 155–167 (2003)
Xu Q.S., Xu Y.D., Li L., Fang K.T.: Uniform experimental design in Chemometrics. J. Chem. e3020 (2018). https://doi.org/10.1002/cem.3020
Yin, X.L., Fang, K.T., Liang, Y.Z., Wong, R.N.S., Ha, A.W.Y.: Assessing phylogenetic relationships of Lycium samples using RAPD and entropy theory. Acta Pharmacologica Sinica 26, 1217–1224 (2005)
Ying, H., Li, R.Z., Fang, K.T., Liang, Y.Z.: Empirical Kriging models and their applications to QSAR. J. Chem. 20, 1–10 (2007)
Zhang, L., Liang, Y.Z., Yu, R.Q., Fang, K.T.: Sequential number-theoretic optimization (SNTO) method applied to chemical quantitative analysis. J. Chem. 11, 267–281 (1997)
Zhang, L., Liang, Y.Z., Jiang, J.H., Yu, R.Q., Fang, K.T.: Uniform design applied to nonlinear multivariate calibration by ANN. Analytica Chimica Acta 370, 65–77 (1998)
Acknowledgments
This work was partially supported by Guangdong Natural Science Foundation (No. 2018A0303130231) and Guangdong Innovation and Enhancement Project: Education Research Programme (R5201920).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
He, P., Peng, X., Xu, Q. (2020). From “Clothing Standard” to “Chemometrics”. In: Fan, J., Pan, J. (eds) Contemporary Experimental Design, Multivariate Analysis and Data Mining. Springer, Cham. https://doi.org/10.1007/978-3-030-46161-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-46161-4_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-46160-7
Online ISBN: 978-3-030-46161-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)