Transfer learning for predicting human skin sensitizers
- 292 Downloads
Computational prioritization of chemicals for potential skin sensitization risks plays essential roles in the risk assessment of environmental chemicals and drug development. Given the huge number of chemicals for testing, computational methods enable the fast identification of high-risk chemicals for experimental validation and design of safer alternatives. However, the development of robust prediction model requires a large dataset of tested chemicals that is usually not available for most toxicological endpoints, especially for human data. A small training dataset makes the development of effective models difficult with insufficient coverage and accuracy. In this study, an ensemble tree-based multitask learning method was developed incorporating three relevant tasks in the well-defined adverse outcome pathway (AOP) of skin sensitization to transfer shared knowledge to the major task of human sensitizers. The results show both largely improved coverage and accuracy compared with three state-of-the-art methods. A user-friendly prediction server was available at https://cwtung.kmu.edu.tw/skinsensdb/predict. As AOPs for various toxicity endpoints are being actively developed, the proposed method can be applied to develop prediction models for other endpoints.
KeywordsAdverse outcome pathway Allergic contact dermatitis Alternative method Multitask learning Skin sensitization ExtraTrees
This work was supported by Ministry of Science and Technology of Taiwan [MOST104-2221-E-037-001-MY3, MOST107-2221-E-037-005-MY3]; National Health Research Institutes [NHRI-107A1-EMCO-0318184]; and Research Center for Environmental Medicine in Kaohsiung Medical University from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan. This work was initiated in Kaohsiung Medical University and completed in Taipei Medical University. The funding agencies play no role in the study design, data analysis and manuscript preparation.
Compliance with ethical standards
Conflict of interest
The authors declare no conflict of interest.
- Jaworska JS, Natsch A, Ryan C, Strickland J, Ashikaga T, Miyazawa M (2015) Bayesian integrated testing strategy (ITS) for skin sensitization potency assessment: a decision support system for quantitative weight of evidence and adaptive testing strategy. Arch Toxicol 89(12):2355–2383. https://doi.org/10.1007/s00204-015-1634-2 CrossRefGoogle Scholar
- Li Y, Pan D, Liu J et al (2007a) Categorical QSAR Models for skin sensitization based upon local lymph node assay classification measures part 2: 4D-fingerprint three-state and two-2-state logistic regression models. Toxicol Sci 99(2):532–544. https://doi.org/10.1093/toxsci/kfm185 CrossRefGoogle Scholar
- Netzeva TI, Worth A, Aldenberg T et al (2005) Current status of methods for defining the applicability domain of (quantitative) structure-activity relationships. The report and recommendations of ECVAM Workshop 52. Altern Lab Anim 33(2):155–173Google Scholar
- OECD (2018) OECD QSAR Toolbox, https://www.qsartoolbox.org/. Accessed 13 Sep 2018
- Patlewicz GY, Basketter DA, Pease CK et al (2004) Further evaluation of quantitative structure–activity relationship models for the prediction of the skin sensitization potency of selected fragrance allergens. Contact Dermatitis 50(2):91–97. https://doi.org/10.1111/j.0105-1873.2004.00322.x CrossRefGoogle Scholar
- Simm J, Magrans De Abril I, Sugiyama M (2014) Tree-Based Ensemble Multi-Task Learning Method for Classification and Regression. IEICE Transactions on Information Systems E97D(6):1677–1681 https://doi.org/10.1587/transinf.E97.D.1677