Exploring 2D X-ray diffraction phase fraction analysis with convolutional neural networks: Insights from kinematic-diffraction simulations

Yue, Weiqi; Mehdi, Mohommad Redad; Tripathi, Pawan K.; Willard, Matthew A.; Ernst, Frank; French, Roger H.

doi:10.1557/s43580-024-00862-9

Exploring 2D X-ray diffraction phase fraction analysis with convolutional neural networks: Insights from kinematic-diffraction simulations

Original Paper
Open access
Published: 23 May 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

MRS Advances Aims and scope Submit manuscript

Exploring 2D X-ray diffraction phase fraction analysis with convolutional neural networks: Insights from kinematic-diffraction simulations

Download PDF

Weiqi Yue^1,3,
Mohommad Redad Mehdi^2,3,
Pawan K. Tripathi^2,3,
Matthew A. Willard^2,3,
Frank Ernst^2,3 &
…
Roger H. French ORCID: orcid.org/0000-0002-6162-0532^1,2,3

251 Accesses
Explore all metrics

Abstract

Deep-learning models are effective for analyzing the complex information in 2D X-ray diffraction (XRD) patterns. Accurately collecting parameters of the material sample is crucial during model training, significantly impacting model performance. In this study, we employ a kinematic-diffraction simulator to generate simulated 2D XRD patterns for Ti–6Al–4V alloy, allowing precise control of sample parameters. These simulated patterns are used to train convolutional neural networks, predicting $\upbeta$-phase volume fractions. The training data set consists exclusively of 2D XRD patterns with pure $\upalpha$- or pure $\upbeta$-phase, while the testing set incorporates patterns with intermediate phase volume fraction. In particular, we investigate how the architectures of the model influence prediction reliability and computational performance. Experimental results reveal that, with appropriate training, the convolutional neural network accurately detects intermediate phase volume fractions even trained with only pure-phase patterns, achieving a mean square error accuracy of $9.4 \times 10^{-4}$.

Graphical abstract

Phase Identification in Synchrotron X-ray Diffraction Patterns of Ti–6Al–4V Using Computer Vision and Deep Learning

Article 16 January 2024

Integrated analysis of X-ray diffraction patterns and pair distribution functions for machine-learned phase identification

Article Open access 28 February 2024

A deep convolutional neural network for real-time full profile analysis of big powder diffraction data

Article Open access 21 May 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

An important technique for innovative materials research is high-energy X-ray diffractometry, available at synchrotron beamlines, where diffraction patterns are usually recorded on a 2D area detector [1]. These 2D XRD patterns, where each reflection of a grain in a specific phase produces a distinctive spot on a ring, offers a wealth of information for unraveling the intricacies of the atomistic structure and microstructure of materials. However, the comprehensive analysis of all information contained in these diffraction patterns is a tedious and time-consuming process with current conventional methods relying on laborious manual operations. Recent advancements in computational methods [2, 3] and deep-learning techniques, especially convolutional neural networks (CNNs), have shown outstanding capabilities in addressing the complexities associated with XRD pattern analysis. Hongyang et al. [4] devised a CNN-based model capable of predicting precise scale factors, lattice parameters, and crystallite size maps for multi-phase catalytic material systems. Similarly, WB et al. [5] employed CNNs for classifying crystal structures across approximately 150,000 powder XRD patterns. Hin-Woong et al. [6] constructed a CNN that identifies the constituent phases in multi-phase inorganic compounds. Furthermore, Jerardo et al. [7] extended the utility of CNNs to classify a broad spectrum of materials’ crystal systems and space groups under both simulated and experimental conditions.

While much of these research has centered on 1D XRD patterns, the inherent strengths of CNNs, especially their capacity to autonomously learn hierarchical features and discern patterns, make them particularly well-suited for extracting information from image data [8]. Additionally, 2D patterns contain more detailed information than their 1D pattern. Building upon our previous work [9], we have demonstrated the efficacy of CNNs in accurately predicting phase fractions for 2D high-energy synchrotron XRD patterns. In this context, the “ground truth” of the 2D XRD pattern’s attributes plays an important role for developing models. In order for the deep-learning model to learn important features in a set of data, it must be identified by experts on enough representative data sets, so that the computer can be properly trained. Such data are referred to as “ground truth” in the artificial intelligence (AI) and deep-learning community. The utilization of simulated data stands out as a noteworthy strategy. By employing a Kinematic-Diffraction Simulator (KDS) [10] to generate simulated 2D XRD patterns, researchers can gain a unique advantage in precisely controlling the attributes of interest, ensuring accurate ground truth. This allows for the creation of diverse datasets tailored to specific materials, enhancing the model’s adaptability and performance.

To showcase the efficacy of this approach, we focused our study on the Ti–6Al–4V (Ti64) alloy, a material that is widely used in industries [11]. By leveraging simulated 2D XRD patterns for this alloy (assuming Ti64 only contains $\upalpha$- and $\upbeta$-phases), we aimed to train a CNN to predict the $\upbeta$-phase volume fraction within each 2D XRD pattern. Notably, unlike typical CNN training cases, we exclusively used pure $\upbeta$- and pure $\upalpha$-phases in simulated 2D XRD patterns for training. Then, trained models were employed to predict $\upbeta$-phase fraction volume for 2D XRD patterns with a mixture of $\upalpha$- and $\upbeta$-phases. This innovative approach, using related and easily generated pure-phase XRD data during real-world experiments for training, enhances the model’s practical utility. This targeted and controlled strategy ensures that the model learns intricate features specific to the Ti64 alloy, highlighting the potential of simulated data in advancing the accuracy and efficiency of XRD pattern analysis.

Methodology

Data set description

The KDS program we employed applies the theory of diffraction physics for the generation of simulated 2D XRD patterns [10]. This model, written in Wolfram language [12], operates on the fundamental principles of diffraction, specifically utilizing the Fraunhofer diffraction theory [13] to produce diffractograms based on lattice parameters for the $\upalpha$- and $\upbeta$-phases. To effectively generate these patterns, the model requires metadata from the diffractometer (beamline), including the wavelength of the primary beam, energy spread, and beam divergence. On the sample side, the model necessitates information about the phases present, their crystal structures, mole fractions, and grain-size distributions. Additionally, users have the flexibility to introduce texture or strain into the microstructure. The KDS accommodates the specification of the number of grains within the illuminated volume. For further insights into the intricacies of the KDS, details can be found in [10]. We employed KDS to simulate a total of 6,372 2D XRD patterns, each with a size of $2048^2$. These patterns exclusively consisted of two phases of titanium: the $\upalpha$-phase (hexagonal close-packed, HCP) and the $\upbeta$-phase (body-centered cubic, BCC). The total volume fraction for both $\upalpha$- and $\upbeta$-phases always adds up to 1. In order to generate continuous rings, we used 100,000 grains. Each pattern takes about 15 minutes on a single core and 4 GB of memory. We fleet out these jobs on a High-Performance Computing (HPC) cluster [14] to achieve the patterns at scale. The package generates “.tiff” files for each simulated diffractogram. The “.tiff” format was chosen since it does not compress any data and keeps the original data quality preserved. More information of parameters is checked in Table 2 under appendix section. Among all the data, 5,372 2D XRD patterns containing either $\upalpha$- or $\upbeta$-phases ($\upbeta$-phase volume fraction for these patterns is either 0 or 1) were used to train CNN models. The remaining 1,000 2D XRD patterns constituted our testing set, with each pattern representing a specific $\upbeta$-phase volume fraction incrementally increasing by 0.05, ranging from 0 to 1. Approximately, 75 XRD patterns exist for every 0.1 change in $\upbeta$-phase volume fraction, and 25 patterns for 0.05 changes (Fig. 1).

CNN implementation

Input to CNN models are pixel intensities of 2D XRD patterns and output is the $\upbeta$-phase volume fraction. To limit pixel intensities to values between 0 and 1, the pixel intensities of each 2D XRD pattern were divided by maximum value among all intensities. The CNN models were trained using the Tensorflow framework [15, 16] implemented in the HPC computing environment CRADLE [14]. For the training of the CNN models, the 5,372 diffraction patterns were split into a training set comprising 80% of the data and a validation set comprising 20% of the data. The training set is employed to facilitate model learning and the validation set permits monitoring of the model’s performance change during training. We employed the mean squared error (MSE) as the loss function. It can be computed by calculating the average of the squared differences between the predicted values and the ground truth. The goal is to identify a model that minimizes the MSE, signifying that its predicted values closely align with the ground truth.

The process of training the model involves the important task of selecting optimal hyperparameters to achieve the best model performance with the importance of this task described in references [9, 17,18,19,20]. Hyperparameters that control the architecture of the CNN include the number of convolutional layers, max-pooling layers, and fully connected layers [21]. In general, a model with more convolutional and fully connected layers has the capacity to handle larger datasets and capture more intricate features. However, the trade-off is an increase in training time. It is important to note that a model that is overly complex may result in overfitting, performing well on the training set but failing to generalize new, unseen data. In addition, we also tuned various other hyperparameters that affect the training process. The learning rate [22], a hyperparameter that governs the scale of parameter adjustments throughout a single training epoch, holds substantial importance. The batch size [23], a hyperparameter, dictates the volume of data processed on the GPU in a single iteration. It plays a crucial role in balancing computational efficiency and memory requirements during the training of the neural network. Moreover, we carefully considered the selection of activation functions. As each pixel value of a 2D XRD pattern is input to the CNN model, these functions empower neural networks to capture intricate non-linear relationships within these pixel values. Without activation functions, the neural network would behave as a linear model, limited to capturing linear relationships in these pixel values.

In this study, two types of activation functions, ReLU and Leaky-ReLU [24], were explored to evaluate their impact on the model’s performance. The ReLU and Leaky-ReLU function is defined as follows:

$$\begin{aligned} \text {ReLU}(x ) = \text {max}(0,x ) \end{aligned}$$

(1)

$$\begin{aligned} \text {LeakyReLU}(x ) = \text {max}( cx , x ) \end{aligned}$$

(2)

where x is the input numeric value and c is a small positive constant. We trained a total of 33 CNNs with different combinations of these hyperparameters. Each model was allocated 64 GB memory, with 32 GB from a V100 GPU [25] dedicated to training. Given that our models’ structures are generally not overly complex, each epoch requires approximately 3 minutes. We set 20 epochs for each model, resulting in roughly 1 hour of training time per model.

Results

Table 1 All 33 trained models with pure $\upalpha$- and $\upbeta$-phases, along with their hyperparameters and performance metrics

Full size table

All details regarding model performance and hyperparameters employed are comprehensively outlined in Table 1. The table furnishes the following information:

1.
Architecture of each model: Specifics encompass the number of convolutional layers (#CL), which ranges from 4 to 7. The number of fully connected layers (#FC) ranges from 1 to 3 consisting different number of nodes (128, 64, 32). The number of max-pooling layers (#MP) varies between 3 and 5.
2.
Learning rate: The range of learning rates utilized is documented.
3.
Batch size: Batch sizes employed during training are detailed, ranging from 8 to 32.
4.
Activation functions: Activation functions used in each model are presented, featuring both ReLU and Leaky-ReLU. For Leaky-ReLU, the slope for negative values is also provided.
5.
Model performance: Via the MSE observed for testing data sets.

We observed that the utilization of different activation functions yields distinct trends in the predicted results. Thus, we plotted predicted results curves for the top 5 models using ReLU and Leaky-ReLU to visualize model performance. Figure 2a and b provides visual insights into the model performance with the ReLU activation function. Figure 2a depicts the model’s predicted results versus the ground truth for each XRD pattern in the testing set. Figure 2b illustrates the differences between the ground truth and predicted values, where a smaller difference signifies superior model performance. Meanwhile, Fig. 2c and d displays the same situations but with Leaky-ReLU activation function. The pattern sequence is ordered from 0 to 1 of the $\upbeta$-phase fraction volume.

The performance curve of all models employing the Leaky-ReLU activation function exhibits a bow-shaped curve (shown in Fig. 2c), indicating that the predicted results for the “end points”—pure $\upbeta$-phase or $\upalpha$-phase 2D XRD patterns—are highly accurate and gradually decline as they approach the central positions. On the other hand, the performance curve for models utilizing ReLU activation is a sigmoidal curve (shown in Fig. 2a). Again, the end points are accurate but an additional nearly accurate value is found near the middle of the 0.50 phase fraction position. Model #5, characterized by 5 convolution layers, 5 max-pooling layers, and 2 fully connected layers, and employing the ReLU activation function, achieves the lowest MSE among all models, registering a value of 9.4$\times 10^{-4}$ (visualized in Fig. 3).

Discussion

Strategy of hyperparameter tuning

Numerous hyperparameters significantly influence the final results of the models. Effectively determining the optimal hyperparameters remains one of the most complex challenges in the AI community [9, 19, 20]. Due to resource constraints, exploring all conceivable combinations of the hyperparameters is impractical, which makes the appropriate selection of hyperparameters critical.

In our previous work [9], we trained 168 different CNN models to predict the $\upbeta$-phase volume fraction using experimental Ti64 XRD data sets [26]. Consequently, we selected some models that exhibited superior performance and used their architectures as our initial selected models. In Table 1, the performance of these models (#1, 2, 3, 10, 11, 12, 13, and 14) is presented. The models share some common points: they all employ $1 \times 10^{-3}$ as the learning rate, 8 as batch size, and ReLU as the activation function. We observed that models #1, 2, and 3, each including 4 or 5 convolutional layers, with each followed by a max-pooling layer, tended to deliver better performance. To evaluate the influence of previously unexplored hyperparameters, namely learning rate and batch sizes, in the training process, we selected the architecture of model #3 as our baseline. This architecture, characterized by fewer parameters compared to models #1 and #2, resulted in reduced training time. We experimented with batch sizes of 16 and 32, and learning rates of 5$\times 10^{-5}$, 1$\times 10^{-4}$, 1$\times 10^{-3}$, and 5$\times 10^{-3}$. The results are represented as performance of models #4 through 9 in Table 1. After evaluation, model #5 achieved the top performance (shown in Fig. 3), specifically with a learning rate of 1$\times 10^{-4}$ and a batch size of 16.

We also observed that for an input 2D XRD pattern with 0 $\upbeta$-phase volume fraction (pure $\upalpha$-phases), the output of the last layer of the CNN consists of all negative values. Since we used a ReLU activation function, these negative values were adjusted to 0 (see Equation 1) so that the final predicted value would be very close to 0, which is the ground truth of the input 2D XRD pattern. We also sought to investigate the effects of negative values, opting not to merely filter them out. To address this, we implemented the Leaky-ReLU activation function. It multiplies a small, non-zero constant c (see Equation 2) for negative input values to prevent a node from becoming inactive during training.

For models trained with Leaky-ReLU, we maintained a consistent learning rate of 1$\times 10^{-4}$ and a batch size of 16 (same as the setting used for model #5). We initially explored architectures with a lower number of max-pooling and convolutional layers, progressively increasing them. We noted model #20 demonstrates good performance (a MSE of 6.4$\times 10^{-3}$) when employing 5 convolutional layers and 4 max-pooling layers. The architecture of model #20 closely resembled the optimal model (#5) trained with ReLU. Building on this, we further refined both architectures: one with 5 fully connected layers and 5 max-pooling layers, and the other with 5 fully connected layers and 4 max-pooling layers. We trained models #20 to 27, experimenting with different fully connected layer configurations and Leaky-ReLU constant c (0.1, 0.01, and 0.005). Among these, models #21, 22, 25, and 27 demonstrated improved results (with smaller MSE than 6.4$\times 10^{-4}$) compared to the original model (#20).

Performance of CNN models in analyzing 2D XRD patterns

Current research on deep learning tends to treat neural networks as ’black boxes.’ While achieving good results is crucial, understanding what a CNN specifically learns from 2D XRD patterns is equally important. Upon observing the feature maps following the convolutional layers, we found that these layers extract ’rings’ from the input 2D XRD patterns [9, 27]. Then, the fully connected layers will ’learn’ the significance of each pixel position within these extracted rings by adjusting the associated weights. During the learning process, the model refines these weights based on the intensity of the rings. To generate the final output, the model aggregates the weights multiplied by the intensity values of these rings. For example, when the model is trained with labels of $\upbeta$-phase fractions, the weights corresponding to $\upbeta$ rings will be notably high. As a result, if there is significant intensity in these regions, the model will produce a higher output value.

A noteworthy point is the potential overlap of $\upalpha$ and $\upbeta$ rings in the 2D XRD patterns. We hypothesize that such overlapping rings might not adversely affect CNN’s performance. This is because during training, the model allocates lower weights to the overlapping regions. Therefore, even if the intensity in these overlapping areas is high, the contribution to the final output, such as the $\upbeta$-phase fraction, remains minimal due to the smaller weights assigned. However, it is important to highlight that our dataset only comprises $\upalpha$- and $\upbeta$-phases, resulting in limited ring overlap scenarios. In cases where there is extensive ring overlap between two phases, we anticipate that it might introduce more dispersion in the model’s predictions.

In our cases, Fig. 3 shows the performance of model #5 in this task. The model clearly distinguishes 2D XRD patterns with different $\upbeta$-phase volume fractions ranging from 0 to 1. Model #5 achieved an MSE of $9.4 \times 10^{-4}$. This result underscores the model’s exceptional accuracy, with an average error of approximately $\pm 0.03$ for each XRD pattern compared with its ground truth. Other models depicted in Fig. 2 also exhibit similar performance, indicating that the performance of model #5 is not accidental. Therefore, compared to our previous work [9], we assert that it is feasible to train effective models exclusively with pure $\upalpha$- and $\upbeta$-phases. The trained model can then be utilized to predict 2D XRD patterns with mixed phases.

We also believe that exploring more possible cases of hyperparameters could lead to the discovery of a better model [17, 18, 21]. Throughout the current tuning process, we infer optimal bounding ranges for certain hyperparameters relevant to this task:

The optimal number of layers for convolutional and max-pooling should fall within the range of 4 to 5, while fully connected layers can be limited to 1 or 2.
Optimal learning rates for training are observed at $1\times 10^{-4}$ or $1\times 10^{-3}$, while batch sizes of 8 or 16 are recommended.
The ReLU curve exhibits a sigmoidal curve, while the Leaky-ReLU curve takes on a bow-shaped form. We consider this dissimilarity to the fact that Leaky-ReLU does not completely discard negative values, leading to smaller predictive outcomes. We also observe that models exhibiting superior performance with Leaky-ReLU often employ a relatively small constant c value. When this value approaches 0, its behavior becomes increasingly similar to that of ReLU. As depicted in Fig. 2, regardless of the activation function employed, the models can effectively discern different $\upbeta$-phases volume fractions in XRDs, even when there is some divergence in their predictions. Therefore, for this specific task, we consider ReLU to remain the optimal activation function.

Conclusion

In this study, we employed simulated 2D XRD patterns to ensure accurate ground truth for the $\upbeta$-phases volume fraction and utilized them for training CNN models. We expanded on our previous work by exploring additional hyperparameters and demonstrated their impact on model performance. Through our hyperparameter tuning strategy, we successfully showed that multiple models produce close predictive results compared to the ground truth. These models, even when trained solely on pure-phase data, exhibit the ability to predict 2D XRD patterns with mixed phases. Generating pure-phase data is easily achievable in real-world scenarios, and our work highlights the potential of using pure phases XRD data to train models, enhancing accuracy while conserving resources. This also underscores the feasibility of deep-learning techniques in XRD pattern analysis and provides a robust approach for future material research endeavors.

Data availability

The data obtained during the current study are available from the corresponding author upon reasonable request.

References

S. Malinov, W. Sha, Z. Guo, C.C. Tang, A.E. Long, Synchrotron X-ray diffraction study of the phase transformations in titanium alloys. Mater. Charact. 48(4), 279–295 (2002). https://doi.org/10.1016/S1044-5803(02)00286-3
Article CAS Google Scholar
A. Linda, P.K. Tripathi, S. Nagar, S. Bhowmick, Effect of pressure on stacking fault energy and deformation behavior of face-centered cubic metals. Materialia 26, 101598 (2022). https://doi.org/10.1016/j.mtla.2022.101598
Article CAS Google Scholar
P.K. Tripathi, Y.-C. Chiu, S. Bhowmick, Y.-C. Lo, Temperature-dependent superplasticity and strengthening in CoNiCrFeMn high entropy alloy nanowires using atomistic simulations. Nanomaterials 11(8), 2111 (2021). https://doi.org/10.3390/nano11082111
Article CAS PubMed PubMed Central Google Scholar
H. Dong, K.T. Butler, D. Matras, S.W. Price, Y. Odarchenko, R. Khatry, A. Thompson, V. Middelkoop, S.D. Jacques, A.M. Beale et al., A deep convolutional neural network for real-time full profile analysis of big powder diffraction data. NPJ Comput. Mater. 7(1), 74 (2021)
Article Google Scholar
W.B. Park, J. Chung, J. Jung, K. Sohn, S.P. Singh, M. Pyo, N. Shin, K.-S. Sohn, Classification of crystal structure using a convolutional neural network. IUCrJ 4(4), 486–494 (2017)
Article CAS PubMed PubMed Central Google Scholar
J.-W. Lee, W.B. Park, J.H. Lee, S.P. Singh, K.-S. Sohn, A deep-learning technique for phase identification in multiphase inorganic compounds using synthetic XRD powder patterns. Nat. Commun. 11(1), 86 (2020)
Article CAS PubMed PubMed Central Google Scholar
J.E. Salgado, S. Lerman, Z. Du, C. Xu, N. Abdolrahim, Automated classification of big x-ray diffraction data using deep learning models. NPJ Comput. Mater. 9(1), 214 (2023)
Article Google Scholar
S. Potluri, A. Fasih, L.K. Vutukuru, F.A. Machot, K. Kyamakya, CNN based high performance computing for real time image processing on GPU, in Proceedings of the Joint INDS’11 & ISTET’11, pp. 1–7 (2011). https://doi.org/10.1109/INDS.2011.6024781. ISSN: 2324-8335
W. Yue, P.K. Tripathi, G. Ponon, Z. Ualikhankyzy, D.W. Brown, B. Clausen, M. Strantza, D.C. Pagan, M.A. Willard, F. Ernst, E. Ayday, V. Chaudhary, R.H. French, Phase Identification in Synchrotron X-ray Diffraction Patterns of Ti-6Al-4V Using Computer Vision and Deep Learning. Integrating Materials and Manufacturing Innovation (2024). https://doi.org/10.1007/s40192-023-00328-0. Accessed 18 Jan 2024
M.R. Mehdi, R. Chawla, E.I. Barcelos, M.A. Willard, R.H. French, F. Ernst, 2D-diffractogram analysis: Kinematic-diffraction simulator for neural network training-data generation. Mater. Res. Soc. (2024). Unpublished manuscript
H. Li, D. Jia, Z. Yang, X. Liao, H. Jin, D. Cai, Y. Zhou, Effect of heat treatment on microstructure evolution and mechanical properties of selective laser melted Ti-6Al-4V and tib/ti-6al-4v composite: A comparative study. Mater. Sci. Eng.: A 801, 140415 (2021)
Article CAS Google Scholar
Wolfram Research Inc, Wolfram Mathematica. Wolfram Inc. (2023). https://www.wolfram.com/mathematica
B.D. Guenther, Fraunhofer Diffraction (Oxford University Press, Oxford, 2015). https://doi.org/10.1093/acprof:oso/9780198738770.003.0010
Book Google Scholar
A. Nihar, T. Ciardi, R. Chawla, O.D. Akanbi, V. Chaudhary, Y. Wu, R.H. French, Accelerating time to science using CRADLE: A framework for materials data science, in 30th IEEE International Conference On High Performance Computing, Data, & Analytics (IEEE, Goa, India, 2023). https://doi.org/10.1109/HiPC58850.2023.00041
M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, M. Kudlur, J. Levenberg, R. Monga, S. Moore, D.G. Murray, B. Steiner, P. Tucker, V. Vasudevan, P. Warden, M. Wicke, Y. Yu, X. Zheng, TensorFlow: A system for large-scale machine learning, in Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation 16 (USENIX Association, Savannah, GA, 2016), pp. 265–283. tex.ids= abadiTensorFlowLargeScaleMachine2016, abadiTensorFlowLargeScaleMachine2016a, abadiTensorFlowSystemLargeScale2016a, abadiTensorFlowSystemLargescale2016 arXiv:1603.04467. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/abadi Accessed 26 Jan 2019
TensorFlow Developers, TensorFlow. Zenodo (2023). https://doi.org/10.5281/zenodo.8256979
D. Passos, P. Mishra, A tutorial on automatic hyperparameter tuning of deep spectral modelling for regression and classification tasks. Chemom. Intell. Lab. Syst. 223, 104520 (2022). https://doi.org/10.1016/j.chemolab.2022.104520. Accessed 21 Aug 2023
T. Yu, H. Zhu, Hyper-parameter optimization: A review of algorithms and applications. arXiv. arXiv:2003.05689 [cs, stat] (2020). https://doi.org/10.48550/arXiv.2003.05689. http://arxiv.org/abs/2003.05689 Accessed 27 Apr 2023
L. Prechelt, Early stopping—but when? in Neural Networks: Tricks of the Trade, ed. by G.B. Orr, K.-R. Müller. Lecture Notes in Computer Science (Springer, Berlin, 1998), pp. 55–69. https://doi.org/10.1007/3-540-49430-8_3. Accessed 27 Apr 2023
C. Ying, A. Klein, E. Real, E. Christiansen, K. Murphy, F. Hutter, NAS-Bench-101: Towards Reproducible Neural Architecture Search (2019)
G.Y. Kimura, D.R. Lucio, A.S. Britto Jr., D. Menotti, CNN Hyperparameter Tuning Applied to Iris Liveness Detection. arXiv (2020). https://doi.org/10.48550/arXiv.2003.00833
Y. Wu, L. Liu, J. Bae, K.-H. Chow, A. Iyengar, C. Pu, W. Wei, L. Yu, Q. Zhang, Demystifying Learning Rate Policies for High Accuracy Training of Deep Neural Networks. arXiv. arXiv:1908.06477 [cs, stat] (2019). https://doi.org/10.48550/arXiv.1908.06477. http://arxiv.org/abs/1908.06477 Accessed 25 Oct 2023
Y. You, Y. Wang, H. Zhang, Z. Zhang, J. Demmel, C.-J. Hsieh, The Limit of the Batch Size. arXiv. arXiv:2006.08517 [cs, stat] (2020). https://doi.org/10.48550/arXiv.2006.08517. http://arxiv.org/abs/2006.08517 Accessed 25 Oct 2023
C. Nwankpa, W. Ijomah, A. Gachagan, S. Marshall, Activation functions: Comparison of trends in practice and research for deep learning. arXiv preprint arXiv:1811.03378 (2018)
M. Martineau, P. Atkinson, S. McIntosh-Smith, Benchmarking the nvidia v100 gpu and tensor cores, in European Conference on Parallel Processing (Springer, 2018), pp. 444–455
D.W. Brown, V. Anghel, L. Balogh, B. Clausen, N.S. Johnson, R.M. Martinez, D.C. Pagan, G. Rafailov, L. Ravkov, M. Strantza, E. Zepeda-Alarcon, Evolution of the Microstructure of Laser Powder Bed Fusion Ti-6Al-4V During Post-Build Heat Treatment. Metallurgical and Materials Transactions A: Physical Metallurgy and Materials Science 52(12), 5165–5181 (2021). https://doi.org/10.1007/s11661-021-06455-7. Accessed 27 Apr 2023
S. Ren, K. He, R. Girshick, X. Zhang, J. Sun, Object detection networks on convolutional feature maps. IEEE Trans. Pattern Anal. Mach. Intell. 39(7), 1476–1481 (2017). https://doi.org/10.1109/TPAMI.2016.2601099
Article Google Scholar

Download references

Acknowledgments

This work made use of the High Performance Computing Resource in the Core Facility for Advanced Research Computing at Case Western Reserve University.

Funding

This work was based upon research in the Materials Data Science for Stockpile Stewardship Center of Excellence (MDS3-COE), and supported by the Department of Energy’s National Nuclear Security Administration under award DE-NA0004104. Abridged Legal Disclaimer: The views expressed herein do not necessarily represent the views of the U.S. Department of Energy or the United States Government.

Author information

Authors and Affiliations

Department of Computer and Data Science, Case Western Reserve University, Cleveland, OH, 44106, USA
Weiqi Yue & Roger H. French
Department of Materials Science and Engineering, Case Western Reserve University, Cleveland, OH, 44106, USA
Mohommad Redad Mehdi, Pawan K. Tripathi, Matthew A. Willard, Frank Ernst & Roger H. French
Materials Data Science for Stockpile Stewardship: Center of Excellence, Case Western Reserve University, Cleveland, OH, 44106, USA
Weiqi Yue, Mohommad Redad Mehdi, Pawan K. Tripathi, Matthew A. Willard, Frank Ernst & Roger H. French

Authors

Weiqi Yue
View author publications
You can also search for this author in PubMed Google Scholar
Mohommad Redad Mehdi
View author publications
You can also search for this author in PubMed Google Scholar
Pawan K. Tripathi
View author publications
You can also search for this author in PubMed Google Scholar
Matthew A. Willard
View author publications
You can also search for this author in PubMed Google Scholar
Frank Ernst
View author publications
You can also search for this author in PubMed Google Scholar
Roger H. French
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: WY, MAW, PKT, and RHF; dataset generation: MRM and FE; methodology and investigation: WY and MAW; writing original draft preparation: WY; and review and editing: WY, PKT, MAW, FE, and RHF.

Corresponding author

Correspondence to Roger H. French.

Ethics declarations

Competing interest

The authors declare no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A

See Table 2.

Table 2 The simulated phases and the parameters used

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yue, W., Mehdi, M.R., Tripathi, P.K. et al. Exploring 2D X-ray diffraction phase fraction analysis with convolutional neural networks: Insights from kinematic-diffraction simulations. MRS Advances (2024). https://doi.org/10.1557/s43580-024-00862-9

Download citation

Received: 30 January 2024
Accepted: 30 April 2024
Published: 23 May 2024
DOI: https://doi.org/10.1557/s43580-024-00862-9

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Exploring 2D X-ray diffraction phase fraction analysis with convolutional neural networks: Insights from kinematic-diffraction simulations