Skip to main content

From “Clothing Standard” to “Chemometrics”

Review of Prof. Kai-Tai Fang’s Contributions in Data Mining

  • Chapter
  • First Online:
Contemporary Experimental Design, Multivariate Analysis and Data Mining

Abstract

This paper reviews Prof. Kai-Tai Fang’s contributions in data mining. Since the 1970s, Prof. Fang has been committed to applying statistical ideas and methods to deal with large amounts of data in practical projects. By analyzing more than 400,000 pieces of data, he found representative clothing indicators and established the first adult clothing standard in China; through cleaning and modeling steel-making data from steel mills all over the country, he revised the national standard for alloy structural steel; by studying various data in chemometrics, he introduced many new advanced statistical methods to improve the identification and classification of chemical components, established more effective models for the relationship between quantitative structure and activity, and promoted the application of the traditional Chinese medicine (TCM) fingerprint in TCM quality control. Professor Fang and his team’s research achievements in data mining have been highly appreciated by relevant experts. This article is written to celebrate the 80th birthday of Prof. Kaitai Fang.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Atkinson, A.C., Bogacka, B., Bogacki, M.B.: \(D\)- and \(T\)-optimum designs for the kinetics of a reversible chemical reaction. Chemom. Lab. Syst. 43, 185–198 (1998)

    Google Scholar 

  2. Cox, D.R.: Note on grouping. J. Am. Stat. Assoc. 52, 543–547 (1957)

    MATH  Google Scholar 

  3. Fang, K.T.: Using conditional distributions to formulate a national clothing standard. J. Appl. Math. 2, 63–74 (1976). (Chinese)

    Google Scholar 

  4. Fang, K.T.: Uniform design—application of number theory method in experimental design. J. Appl. Math. 3, 363–372 (1980). (Chinese)

    Google Scholar 

  5. Fang, K.T.: Endless and unbending journey to statistics research: the oral autobiography of Kai-tai Fang. Hunan Education Press (2016) (in Chinese)

    Google Scholar 

  6. Fang, K.T., He, S.: The problem of selecting a given number of representative points in a normal population and a generalized mill’s ratio. Technical report, No. 5, Department of Statistics, Stanford University, USA (1982)

    Google Scholar 

  7. Fang, K.T., Wang, Y.: Number-Theoretic Methods in Statistics. Chapman and Hall, London (1994)

    MATH  Google Scholar 

  8. Fang, K.T., Wu, C.Y.: A probability extremum problem. J. Appl. Math. 2, 132–148 (1979) (in Chinese)

    Google Scholar 

  9. Fang, K.T., Zhang Y.T.: Introduction to Multivariate Statistical Analysis. Science Press (1982) (in Chinese)

    Google Scholar 

  10. Fang, K.T., Bentler, P.M., Yuan, K.H.: Applications of number-theoretic methods to quantizers of elliptically contoured distributions. In: Multivariate Analysis and Its Applications, IMS Lecture Notes—Monograph Series, pp. 211–225 (1994)

    Google Scholar 

  11. Fang, K.T., Liang, Y.Z., Yin, X.L., Chen, K., Lu, G.H.: Critical value determination on similarity of fingerprints. Chemom. Intell. Lab. Syst. 82, 236–240 (2006)

    Google Scholar 

  12. Fang, K.T., Liang, Y.Z., Yu, R.Q. (eds.): Data Mining and Bioinformatics in Chemistry and Chinese Medicines. Hong Kong Baptist University (2003)

    Google Scholar 

  13. Fang, K.T., Liang, Y.Z., Yu, R.Q. (eds.): Data Mining and Bioinformatics in Chemistry and Chinese Medicines, vol. 2, Hong Kong Baptist University (2004)

    Google Scholar 

  14. Fang, K.T., Xiang, K.F., Liu, G.Y.: Precision of Test Method. China Standard Press, Beijing (1988)

    Google Scholar 

  15. Fang, K.T., Yin, H., Liang, Y.Z.: New approach by Kriging methods to problems in QSAR. J. Chem. Inform. Model. 44, 2106–2113 (2004)

    Google Scholar 

  16. Fang, K.T., Zhou, M., Wang, W.J.: Applications of the representative points in statistical simulations. Sci. China Ser. A 57, 2609–2620 (2014)

    MathSciNet  MATH  Google Scholar 

  17. He, P., Fang, K.T., Xu, C.J.: The classification tree combined with SIR and its applications to classification of mass spectra. J. Data Sci. 1, 425–445 (2003)

    Google Scholar 

  18. He, P., Fang, K.T., Liang, Y.Z., Li, B.Y.: A Generalized boosting algorithm and its application to two-class chemical classification problem. Analytica Chimica Acta 543, 181–191 (2005)

    Google Scholar 

  19. He, P., Xu, C.J., Liang, Y.Z., Fang, K.T.: Improving the classification accuracy in chemistry via boosting technique. Chem. Intell. Lab. Syst. 70, 39–46 (2004)

    Google Scholar 

  20. Hu, Q.N., Liang, Y.Z., Fang, K.T.: The matrix expression, topological index and atomic attribute of molecular topological structure. J. Data Sci. 1, 361–389 (2003)

    Google Scholar 

  21. Hu, Q.N., Liang, Y.Z., Peng, X.L., Yin, H., Fang, K.T.: Structural interpretation of a topological index. 1. External factor variable connectivity index (EFVCI). J. Chem. Inf. Comput. Sci. 44, 437–446 (2004)

    Google Scholar 

  22. Hu, Q.N., Liang, Y.Z., Yin, H., Peng, X.L., Fang, K.T.: Structural interpretation of a topological index. 2. The molecular connectivity index, the Kappa index, and the atom-type E-state index. J. Chem. Inf. Comput. Sci. 44, 1193–1201 (2004)

    Google Scholar 

  23. Lee, A.W.M., Chan, W.F., Yuen, F.S.Y., Tse, P.K., Liang, Y.Z., Fang, K.T.: An example of a sequential uniform design: application in capillary electrophoresis. Chem. Intell. Lab. Syst. 39, 11–18 (1997)

    Google Scholar 

  24. Liang, Y.Z., Fang, K.T.: Robust multivariate calibration algorithm based on least median squares and sequential number theoretic optimization method. Anal. Chem. 121, 1025–1029 (1996)

    Google Scholar 

  25. Liang, Y.Z., Fang, K.T., Xu, Q.S.: Uniform design and its applications in chemistry and chemical engineering. Chem. Intell. Lab. Syst. 58, 43–57 (2001)

    Google Scholar 

  26. Peng, X.L., Hu, Q.N., Liang, Y.Z.: Variable selection via nonconcave penalty function in structure-boiling points correlations. J. Mol. Struct.: THEOCHEM, 714, 235–242 (2005)

    Google Scholar 

  27. Peng, X.L., Yin, H., Li, R., Fang, K.T.: The application of Kriging and empirical Kriging based on the variables selected by SCAD. Analytica Chimica Acta 578, 178–185 (2006)

    Google Scholar 

  28. Tang, Y., Liang, Y.Z., Fang, K.T.: Data mining in chemometrics: sub-structures learning via peak combinations searching in mass spectra. J. Data Sci. 1, 481–496 (2003)

    Google Scholar 

  29. Varmuza, K., He, P., Fang, K.T.: Boosting applied to classification of mass spectral data. J. Data Sci. 1, 391–404 (2003)

    Google Scholar 

  30. Xu, Q.S., Liang, Y.Z., Fang, K.T.: The effects of different experimental designs on parameter estimation in the kinetics of a reversible chemical reaction. Chem. Intell. Lab. Syst. 52, 155–166 (2000)

    Google Scholar 

  31. Xu, Q.S., Massart, D.L., Liang, Y.Z., Fang, K.T.: Two-step multivariate adaptive regression spline for modeling a quantitative relationship between gas chromatography retention indices and molecular descriptors. J. Chromatogr. A 998, 155–167 (2003)

    Google Scholar 

  32. Xu Q.S., Xu Y.D., Li L., Fang K.T.: Uniform experimental design in Chemometrics. J. Chem. e3020 (2018). https://doi.org/10.1002/cem.3020

    Google Scholar 

  33. Yin, X.L., Fang, K.T., Liang, Y.Z., Wong, R.N.S., Ha, A.W.Y.: Assessing phylogenetic relationships of Lycium samples using RAPD and entropy theory. Acta Pharmacologica Sinica 26, 1217–1224 (2005)

    Google Scholar 

  34. Ying, H., Li, R.Z., Fang, K.T., Liang, Y.Z.: Empirical Kriging models and their applications to QSAR. J. Chem. 20, 1–10 (2007)

    Google Scholar 

  35. Zhang, L., Liang, Y.Z., Yu, R.Q., Fang, K.T.: Sequential number-theoretic optimization (SNTO) method applied to chemical quantitative analysis. J. Chem. 11, 267–281 (1997)

    Google Scholar 

  36. Zhang, L., Liang, Y.Z., Jiang, J.H., Yu, R.Q., Fang, K.T.: Uniform design applied to nonlinear multivariate calibration by ANN. Analytica Chimica Acta 370, 65–77 (1998)

    Google Scholar 

Download references

Acknowledgments

This work was partially supported by Guangdong Natural Science Foundation (No. 2018A0303130231) and Guangdong Innovation and Enhancement Project: Education Research Programme (R5201920).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ping He .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

He, P., Peng, X., Xu, Q. (2020). From “Clothing Standard” to “Chemometrics”. In: Fan, J., Pan, J. (eds) Contemporary Experimental Design, Multivariate Analysis and Data Mining. Springer, Cham. https://doi.org/10.1007/978-3-030-46161-4_3

Download citation

Publish with us

Policies and ethics