1 Introduction

Slag density in the blast furnace ironmaking process is dependent on a complex set of parameters which govern the thermodynamics and kinetics of the slag formation and transport, respectively.[1] The slag-metal separation rate is directly influenced by the slag density which affects the productivity of the process.[2] The network structure of the Al2O3-MgO-CaO-SiO2 quaternary slag system incorporating minor constituents[3,4,5,6] can be a prime factor responsible for the high density of the slag. However, no quantified correlation of the structure of the slag with density can be directly established. The more fundamental quantities like the enthalpy of mixing[7,8,9,10] can be correlated to the slag density to make an understanding. This would enable cross-correlating the slag density with the structure, which is related to the enthalpy of mixing.[11] Exothermic enthalpy of mixing would mean a denser slag system being energetically favoured and vice-versa. Another crucial thermophysical quantity is the viscosity of the slag which has been a subject of study for a long period of time in the quaternary systems.[12,13,14,15,16,17] Slag viscosity is not an independent governing quantity, but rather a dependent property which is a function of many fundamental parameters. The key importance of viscosity is to govern the slag transport and removal rate,[18,19,20] which strongly impacts productivity. The understanding of the inter-property correlations as mentioned above, is crucial to modern blast furnace ironmaking which is undergoing a transition where the utilization of low-grade iron ores is posing significant challenges to the slag practice.[21,22,23,24,25] High alumina slag[26] where Al2O3 content is more than 15 wt.% is responsible for the formation of highly dense and viscous slags which affect productivity.[26,27,28,29,30] The complex alumino-silicate structure of the slag[31] is responsible for increasing the slag viscosity which needs to be counteracted. Here, the correlation between the density of the slag and slag viscosity is essential as both quantities are influenced by the underlying slag structure. The industrial-level engineering of the slag chemistry via dilution or minor constituent addition[31,32,33,34,35] can then be targeted for a specific property without independently measuring both. For instance, the slag density can be determined using suitable experimental techniques or models and its relationship with slag viscosity can be established henceforth, without using tedious steps to measure the viscosity simultaneously.

The measurement of slag density for different quaternary slag compositions and temperatures has been experimentally performed by various scholarly groups.[1,2,36,37] The relationship of slag density (molar volume) with the slag composition and temperature was established using the CALPHAD approach by Moharana et al. and the group,[1] which is also the core input to this work. The mathematical model developed in the work showed density trends for different slag systems with composition variation and as well as temperature variation. Density measurements using the Archimedes principle have been performed by Muhmood and Seetharaman which provided insights into variation for the low silica quaternary slag systems.[38]

The correlation between viscosity and slag composition has been theoretically studied by Hazra et al. which demonstrates that high alumina content primarily increases the viscosity of the slag thereby affecting productivity.[16] In another study, Pal et al. used machine learning models to derive insights into the role of MgO, CaO/SiO2 (C/S) ratio and Al2O3 content on the slag viscosity.[17] Slag viscosity has also been determined using molecular dynamics to get atomic-level insights into the structure-property correlation.

Despite the availability of adequate literature on the individual properties of the slag system, there is a significant gap in the study of the inter-property correlation of the quaternary slag. To the best of our knowledge, no study has been done experimentally or computationally which correlates the slag density and viscosity to establish a relationship.

The objective of this study is to establish correlations between the slag density and mixing enthalpy as well as slag density and slag viscosity. It involves the quantification of slag density as a function of known quantities like composition of constituents, temperature, and enthalpy of mixing while taking slag density as an independent variable when correlating with viscosity. The correlations would enable a better understanding of the role of these quantities in governing the slag transport dynamics. Additionally, a more physical relationship can be derived using the following relationship of molar volume and density to extend the correlations made with density:

$$\rho = {\raise0.7ex\hbox{${\left( {\mathop \sum \limits_{i = 1}^{c} xiMi} \right)}$} \!\mathord{\left/ {\vphantom {{\left( {\mathop \sum \limits_{i = 1}^{c} xiMi} \right)} {Vm}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${Vm}$}}$$

The study makes use of machine learning (ML) models to correlate these quantities. Due to its capacity to manage the complicated interaction of many factors inherent in the slag domain, machine learning (ML) is highly favoured for creating inter-property correlations in the slag domain. The slag system has various compositional, temperature, and structural factors that interact in complex ways, making typical analytical techniques difficult and time-consuming. ML excels in capturing nonlinear connections and identifying patterns in large datasets, which is critical for dealing with complicated slag compositions and their associated characteristics. For these reasons, ML is chosen over the standard CALPHAD approach to make the correlations.

In the subsequent sections, the model development, deployment and predictions are discussed to derive insights into the slag systems.

2 Model Development

The machine learning models were developed to predict the slag density, enthalpy of mixing and viscosity as a function of slag composition and temperature. A total of 4 models were developed and deployed, with the final model incorporating density as an input feature along with composition and temperature, to predict the viscosity. The training data for the models was obtained in three steps: (a) The density data were obtained using the model developed by Moharana et al.[1] (b) The mixing enthalpy and viscosity data were obtained by calculating the raw composition vs temperature data in FactSage to make the hybrid dataset (c) The viscosity data were also obtained by utilizing the model developed by Pal et al.[17] to populate the fitting data. All the models were tuned using hyperparameter optimization to obtain the best-fit model. Python's scikit-learn package[39] was used to import all the necessary models. The distribution of data and correlation matrix for the first model is shown in Fig. 1 and the correlation matrices for the other 3 models are shown as a part of Fig. 2.

Fig. 1
figure 1

(a) Initial data distribution for enthalpy of mixing (b) Correlation matrix for ML model with target feature- enthalpy of mixing

Fig. 2
figure 2

Correlation matrices/heatmaps for (a) Density prediction model (b)Viscosity prediction model (c) Hybrid model

The negative correlation of enthalpy of mixing with the slag constituents is depicted in Fig. 1(b). The heatmap also shows a negative correlation of enthalpy with temperature indicating the enthalpy of mixing increases in positive magnitude, which is expected for endothermic reactions at high temperatures.

The correlation matrix as shown in Fig. 2(a) depicts a negative correlation between density and temperature suggesting that as the temperature increases, the density tends to decrease. For the slag components, the correlation with density is positive for Al2O3 and CaO but negative for MgO and SiO2. On the other hand, Figure 2(b) shows a negative correlation of the slag components except for SiO2 which is due to the network-breaking and network-forming tendencies of the constituents respectively for the existing quaternary-composition space. This verifies the non-linear relationship of density and viscosity which is investigated in the further sections.

The hyperparameter optimization was done by iterating over the following models- Linear regression, Ridge regression, K nearest neighbours, Decision tree, Support vector machine, Random Forest, Adaboost and ExtraTrees regression. The training procedure was carried out by splitting the data into 80:20 train:test split. For choosing the best-fit model, two metrics, namely R2 score and mean square error (MSE) were chosen. The R2 score of any model validation represents the proportion of variation in the dependent variable and ranges from 0 to 1. A high R2 value indicates a more dependable and accurate model. The following equation is used to get the R2 score.

$${\text{R}}^{2} = \frac{{\mathop \sum \nolimits_{1}^{{\text{n}}} \left( {{\text{measured}}_{{\text{i}}} - {\text{predicted}}_{{\text{i}}} } \right)}}{{\mathop \sum \nolimits_{1}^{{\text{n}}} \left( {{\text{measured}}_{{\text{i}}} - {\text{average}}\left( {{\text{measured}}} \right)} \right)^{2} }}\user2{ }$$
(1)

Similarly, the mean square error value is described as:

$${\text{M.S.E}} = \frac{{\mathop \sum \nolimits_{1}^{n} \left( {{\text{predicted}}_{i} - {\text{actual}}_{i} } \right)^{2} }}{n}$$
(2)

where n is the number of data points used in the model construction.

The result of the hyperparameter optimization implied that ExtraTrees regression is the best fit model for all the four models. However, the model parameters were different for all four cases in order to maximize the R2 score and minimizing the M.S.E. The performance metrics for the aforementioned models is shown in Table 1.

Table 1 R2 score and M.S.E values for the ML prediction models

The ExtraTrees regressor is an ensemble algorithm for learning that makes predictions by combining numerous decision trees.[40] ExtraTrees, unlike regular decision trees or random forests, use a high degree of randomness throughout the training process. ExtraTrees successfully decreases variance and overfitting by employing random subsets of the data and randomly picking features for node splitting. This improves the generalization performance. With the provided hyperparameters, the ExtraTrees regressor model is the overall best choice for prediction. The model's ensemble nature, along with randomization and hyperparameter optimization, allows it to properly manage the dataset's complexities and details. The ExtraTrees regressor demonstrates improved accuracy and generalization capabilities by utilizing the power of multiple decision trees and feature randomization, making it a great choice for slag property prediction in this case. However, it is important to highlight that model selection may be dependent on the unique characteristics of the dataset, and subsequent assessments on varied datasets should be undertaken to confirm its robustness.

The four ML models utilizing this algorithm were now provided with multiple test datasets to make suitable predictions in the regime of interest. The detailed description and analysis of these predictions is given in the subsequent section.

3 Results and Discussion

3.1 Enthalpy of Mixing of Quaternary Slag System

The Al2O3-MgO-CaO-SiO2 quaternary slag system has a negative enthalpy of mixing indicating exothermicity at a given temperature as predicted by FactSage. To establish this relationship, the absolute enthalpy of mixing (ΔHmix) is selected as the basis for training the model. This is done to follow the practice that is often used in the literature.

To understand the dependence of ΔHmix on the slag composition, three different test sets were chosen. In the first case, the variation of Al2O3 was correlated with ΔHmix for a fixed MgO content of 10 wt.% and fixed CaO/SiO2 or C/S ratio of 1. The curve showing this relationship at three different temperatures, T=1773K, 1823K and 1873K, is presented in Fig. 3(a).

Fig. 3
figure 3

Enthalpy of mixing as a function of (a) Al2O3 wt.% (b) C/S ratio and (c) MgO wt.% at three different temperatures

In the quaternary slag system, there is a significant increase in the ΔHmix (in the negative direction) as the Al2O3 content increases. The relationship is depicted in Fig. 3(a), which shows that ΔHmix increases almost linearly with increasing Al2O3 content. This is to be expected, as the content of Al2O3 increases the complexity of the slag growth (alumina network), resulting in an increase in enthalpy due to stronger atomic interactions. This is an indication of higher exothermicity.

Notably, the lines show ΔHmix overlap at 1773K and 1823K, indicating a constant enthalpy shift at these temperatures across various Al2O3 contents. The dominance of particular interactions that remain relatively consistent within this temperature range can be attributed to this phenomenon. However, at a higher temperature, i.e., 1873K, the curve shifts upward showing significant deviation from the previous two temperatures. This is indicative of a change in the slag structure, possibly the breakage of the alumino-silicate network at a higher temperature resulting in smaller chains and clusters.

A similar but reverse trend is observed for the slag system where the C/S ratio is varied from 0.9 to 1.2 in steps of 0.05 to study the variation of ΔHmix. For studying the effect of the C/S ratio, slag composition is chosen with 18 wt.% fixed Al2O3, 10 wt.% fixed MgO and three different temperatures as in the previous case. As shown in Fig. 3(b), the ΔHmix values decrease (in the negative direction) as the C/S ratio is increased with the observation for temperatures seeming consistent with the previous case. The drop in ΔHmix observed as the C/S ratio increases can be due to the preferred generation of energetically favourable phases of CaO dissolving the SiO2 and Al2O3 to form calcium silicates/aluminates, resulting in decreased enthalpy and higher stability in the quaternary slag system. Again, the higher temperature would mean a more depolymerised structure, which would have a relatively higher enthalpy of mixing than the lower-temperature counterparts.

A rather more complex relationship is shown between the MgO content and ΔHmix as shown in Fig. 3(c). The observed behaviour of ΔHmix with increasing MgO wt.% reveals a two-stage behaviour. The almost steady ΔHmix trend is shown with slight deviations upto 14 wt.% MgO, followed by a sharp increase in the negative direction. There is an indication of compound formation at temperatures below the liquidus temperature, corresponding to an increase in the exothermicity.

The enthalpy of mixing relationship with slag composition and temperature would be further used to make correlations with slag density in the subsequent subsections.

3.2 Density Variation with Composition and Temperature

The density predictor model is run on the same input datasets to make predictions on slag density as Al2O3, MgO, C/S and temperature are varied. Density being a physical property shows a more direct relationship without complexity with the slag composition and temperature, as compared to the enthalpy of mixing. The density of the slag system is shown to increase with increasing Al2O3 content in Fig. 4(a). The increase in density is indicative of a more polymerised structure which is resulting with the addition of Al2O3. This trend may be divided into three separate regions: a quick increase at low Al2O3 contents, a steady increase at higher contents, and a rapid increase at even higher contents. As the temperature rises from 1773 K to 1873 K, the curve moves downward. This behaviour indicates that densities are lower at higher temperatures.

Fig. 4
figure 4

Slag density as a function of (a) Al2O3 wt.%, (b) C/S ratio, and (c) MgO wt.% at three different temperatures

Thermal energy disrupts the atomic configurations and lowers the overall packing density of the slag particles at high temperatures. As a result, when the temperature rises, the density of the slag system drops.

A direct and linear relationship between the C/S ratio and slag density can be observed in Fig. 4(b). The slag density increases linearly as the C/S ratio increases from 0.9 to 1.2. The variations in the slag system caused by altering the C/S ratio can explain this behaviour. Lower C/S ratios (about 0.9) result in a silica-rich slag system, whereas higher C/S ratios result in an increased amount of CaO. Denser calcium silicate phases form inside the slag matrix when the C/S ratio rises, leading to an apparent increase in slag density. Furthermore, the tendency of greater slag density at lower temperatures is consistent with the preceding case. Similarly, the increase in MgO content, also increase the slag density as shown in Fig. 4(c), which can be attributed to the same phenomenon of basic oxide phases forming in the slag resulting in higher density of the slag.

3.3 Correlation of Density of Slag with Enthalpy of Mixing

The density and mixing enthalpy prediction models using the same inputs, enable the establishment of a correlation between the slag density and enthalpy of mixing.

As shown in Fig. 5, the exothermicity and density are parabolically correlated upto 15 wt.% Al2O3 after which they start to deviate. The deviation is more pronounced at higher temperatures due to the complexity of the slag structure which the model captures to some extent. Higher exothermicity implies greater density which falls under the argument that stronger bonds lead to denser phases. The correlation is also affected by the nature of the Al2O3 bonding as there can be changes in the network forming ability at higher contents that might affect the relationship of density with the exothermicity.

Fig. 5
figure 5

Density of the slag as a function of enthalpy of mixing for increasing Al2O3 content at three different temperatures

3.4 Viscosity Variation with Composition and Temperature

It has been noted that viscosity responds differently to different compositional parameters. In particular, because of Al2O3 network-forming qualities, which improve covalent bonding and structural rigidity, an increase in Al2O3 content is linked to an increase in viscosity as shown in Fig 6(a). When the C/S ratio (CaO/SiO2) rises to a certain threshold, the fluxing effect of CaO causes the network to break, which is why the viscosity drops. Beyond this point, though, the extra CaO could lead to the creation of additional phases and a subsequent increase in viscosity which is demonstrated in Fig 6(b). Furthermore, Figure 6(c) shows that the addition of MgO tends to reduce viscosity, serving as a flux and network moderator to improve the transport and fluidity.

Fig. 6
figure 6

Viscosity of the slag as a function of (a) Al2O3 wt.% (b) C/S ratio and (c) MgO wt.% at three different temperatures

3.5 Viscosity and Density Correlation for the Slag

Figure 7 shows the relationship between the density and viscosity of the slags as the Al2O3 content is increased from 5 to 20 wt.% and MgO, C/S ratio are kept fixed at 10 wt.% and 1 respectively. The viscosity increases slowly with the increase in density at 1773K whereas the rise is more rapid at higher temperatures. The relationship established between the viscosity and density by the machine learning model is a direct one. The direct correlation between the viscosity and density has been pre-established using the absolute reaction-rate theory proposed by Glasstone et al.[41] This is given by the equation:

$$\eta = \left( {hN/Vm} \right) * exp\left( { - \Delta G* / RT} \right) = \left( {hN\rho /M} \right) * exp\left( { - \Delta G* / RT} \right)$$

where h= Planck’s constant, N= Avogadro’s number, Vm= Molar volume, G* = Gibbs energy of activation per mole.

Fig. 7
figure 7

Viscosity of the slag as a function of the density of the slag for increasing Al2O3 content at three different temperatures

Our model validates the above empirical relationship although it is data driven. This can be counted as the first ever attempt to the best of our knowledge. The relationship between viscosity and density could be extrapolated to an even complex set of conditions by re-training the model with more features. This falls under the scope of a future work with an advantage of high-throughput correlations that can be used for complex environments like the blast furnace.

4 Conclusion

The density of the blast furnace slag systems was found to be intrinsically correlated to the enthalpy of mixing and viscosity. The complex interplay of slag structure in the high alumina regime and the governing parameters like temperature and constituent compositions were modelled using machine learning and correlated to obtain direct correlations between density and enthalpy of mixing. It was verified that higher exothermicity or more negative enthalpy of mixing, resulted in a higher density with a deviation in the trend at 15 wt.% Al2O3. The viscosity of the slag increased with higher densities of slag as Al2O3 was varied from 5 to 20 wt.%. This established a direct correlation between the viscosity and the density of slag using machine learning that was validated against the absolute reaction-rate theory proposed by Glasstone et al.[41] The relationship was more linear at higher temperatures and parabolic at low temperatures. The work has a scope to be used for more complex systems with minor additions to better understand the impact of slag density on blast furnace productivity.