Introduction

Sand production is a critical issue, especially prevalent in weaker, unconsolidated formations. Its significance in the industry is underscored by the multifaceted challenges it presents, ranging from equipment erosion and increased maintenance requirements to environmental concerns. The dynamics of sand production exhibits varied patterns in the field, often characterized initially by a burst, followed by potential stabilization or decline. This transient nature of sand production hinges on key parameters such as the peak production rate and timing of its occurrence. Traditionally, the industry's approach to managing sand production has revolved around reactive measures post-occurrence. However, recent advancements in predictive analytics, particularly the application of machine learning (ML) algorithms, have opened new frontiers in proactive management.

Various sand production patterns can be observed in the field, as noted by Fjaer et al. (2008): Initially, when a well is started, there is a surge in sand production due to the release of perforation debris with reservoir fluid. At certain flow rates, equilibrium is established, leading to transient sand production. The key parameters of interest in transient sand production are the peak value and declining rate. Accurate prediction of these parameters is crucial for planning sand disposal measures. For instance, if the peak value is not critical and the declining rate is high, indicating a small amount of sand over a short period, there may be no need for long-term sand storage. Conversely, a high peak value with a low declining rate necessitates preparation for significant sand production over an extended period, requiring specific sand disposal sites. Therefore, effective prediction of peak sand production and declining rates forms the foundation of a cost-effective sand management strategy.

Comprehensive research has been conducted to predict sanding onset, and as a result, there are many analytical, numerical, and empirical models which can predict critical production conditions triggering sand production to a certain extent (Morita et al. 1989; Kessler et al., 1993; Weingarten and Perkins, 2007; Wang and Dusseault, 2010; Han et al., 2011; Al-Shaaibi et al., 2013; Fuh and Morita, 2013; Wu et al. 2016; Papamichos and Furui, 2019). On the other hand, predicting not only sanding onset but also its volume is crucial for sand management strategies justification. However, sand volume prediction models are susceptible to specific parameters and assumptions and therefore can be only applied to certain field conditions. Moreover, sand volume prediction is particularly essential because it affects the sand control solutions planned for a specific well.

The principle of analytical sand volume prediction models is to establish a mathematical formulation for the constitutive material behavior and sand detachment mechanism (Geilikman et al., 1994; Al-Shaaibi et al., 2013; Papamichos and Furui 2013; Gholami et al. 2016; Hayavi and Abdideh 2017; Papamichos and Furui 2019; Shabdirova et al., 2019). Hole stability analysis is then conducted by introducing failure criteria to a constitutive model. This is followed by the analytical expression for the sand detachment process. Analytical models are fast and easy to use; however, they mostly simulate very simplified conditions. Because any model employs simplifying assumptions, analytical models can be used where a quick and reliable sand prediction tool is required.

More complex conditions can be considered if the numerical prediction methods are utilized (Yi, 2003; Nouri et al. 2006b; Kim et al., 2011; Chen, 2012; Cui et al. 2016; Han and Cundall 2017; Li et al., 2018; Wang et al., 2018, 2019a, 2019b; Garolera et al., 2019; Khamitov et al., 2022). Numerical models can be grouped under continuum and discontinuum methods. In the continuum method, rock is treated as a continuous material, and constitutive laws are employed when describing its behavior. The main disadvantage of continuum models is that they cannot describe local discontinuities as particle detachment. In this case, discontinuum methods are used. Three-dimensional discrete element modeling (DEM) can capture the motion of individual grains and therefore provide information on sanding mechanism at micro-scale. Considering each individual particle makes DEM computationally expensive and time-consuming, which prevents using DEM for large-scale problems.

Laboratory experiments are very useful for studying the mechanisms of sand production, although their results are strongly influenced by boundary conditions and they are not always straightforward to apply to field-scale problems (Skjaerstein et al. 1997; Nouri et al. 2006a; van den Hoek et al. 2007; Fattahpour et al., 2012; Wu et al. 2016; Kozhagulova et al. 2021; Shabdirova et al., 2022). Experimental results are mostly used to calibrate and validate analytical and numerical sand production models.

Despite advancements in analytical, numerical, and experimental techniques for predicting sand production, these methodologies are often limited by their reliance on simplifying assumptions, computational intensity, and the challenges of scaling from laboratory conditions to field applications. ML offers a promising alternative by leveraging vast amounts of data to identify complex patterns and relationships that traditional models may overlook. By integrating ML with existing predictive models, the industry can achieve a more comprehensive and robust understanding of sand production dynamics, leading to better-informed decision-making and reduced operational risks.

The prediction of sanding onset can be viewed as a classification task with two outcomes—whether sanding will or will not occur under given conditions. Several studies have successfully applied ML algorithms to predict sanding onset and analyze the significance of input variables. Khamehchi et al. (2014) presented an analysis of sand production onset in 23 field datasets from the North Adriatic Sea, utilizing simple regression and artificial neural network (ANN) algorithms to predict critical total drawdown (CTD). Gharagheizi et al. (2017) applied support vector machine (SVM) to predict sanding onset using data from 31 wells in the Northern Adriatic Basin. The algorithm, evaluated using receiver operating characteristics (ROC) curve, demonstrated high predictive capability. Ketmalee and Bandyopadhyay (2018) employed ANN to generate synthetic logs for sand prediction tools in the Bongkot field, Thailand, successfully validating the model with real well data. Ngwashi et al. (2021) compared the performance of ANN with backpropagation and SVM in predicting sanding onset in the Niger Delta region, reporting 80% and 100% accuracy, respectively. Abdelghany et al. (2022) used a probabilistic neural network to infer reservoir properties and predict sand production onset, reporting improved accuracy compared to conventional models. Song et al. (2022) applied four ML algorithms to predict sand production from gas-hydrate sediments, recommending XGBoost for early-stage prediction.

While there are significant researches on using ML for classifying the onset of sand production, less focus has been on quantitatively predicting the amount of sand produced. Most studies aimed to guide the decision on whether to implement sand control measures during the well completion phase. However, the actual amount of sand produced is crucial for planning surface facilities and cost-effective management, indicating a need for more research in this area.

This manuscript builds upon our previous work published (Shabdirova et al., 2023), whereby we explored the potential of ML algorithms in predicting transient sand production in the Karazhanbas oilfield in Western Kazakhstan. Our initial study, based on data from 43 wells, established a foundational understanding of the applicability of the Random Forest (RF) algorithm for predicting sand production behaviors. Building on these findings, the current research, which now encompasses data from 2000 wells, placed greater emphasis on refining data preparation methodologies rather than exclusively focusing on ML techniques. This strategic shift allowed for a more thorough investigation into the intricacies of data preprocessing, aiming to improve the quality of input data. This expansion is instrumental in facilitating a more comprehensive analysis, enhancing predictive accuracy, and enabling the exploration of more complex aspects of sand production prediction.

This paper is structured as follows: The Oilfield Information section gives background information on the oilfield. The Methodology section outlines an approach to sand production prediction, incorporating data collection, ML algorithms, model training, and validation, as well as input variable analysis. The Results and Discussion section of the manuscript assesses various operational and geological factors influencing sand production in the Karazhanbas oilfield. It demonstrates the importance of data diversity and quality in improving prediction accuracy, highlights the significant impact of fault proximity on predictions, and introduces novel metrics for evaluating model performance, especially regarding peak sand production events. The manuscript concludes with a summary of the main findings, emphasizing their implications for the industry and suggesting directions for future research.

Oilfield Information

The Karazhanbas oilfield, located in the Ustyurt-Buzachi sedimentary basin in Western Kazakhstan, is characterized by a gentle, elongated anticlinal fold, flanked by western and eastern arches (Fig. 1). This field's structure is divided by a series of faults. Its reservoirs are found within the Lower Triassic, Middle Jurassic, and Lower Cretaceous strata. Despite the region's tectonic changes, these formations remain shallow and loosely consolidated. The field exhibits permeability of up to 0.5 DarciesFootnote 1 and oil viscosity reaching 600 mPa s. Currently, the oilfield is experiencing a drop in production, with reservoir pressure maintained through water injection. Sand production was observed in all Lower Cretaceous Formations (horizons A, B, D, G, and V) but not in Jurassic Formations (J1 and J2). The oilfield has more than 7000 wells, including injecting wells and abandoned wells. Production and geological data from 2000 producing wells were analyzed in the current study. Figure 2 shows the locations of wells (blue cross) in the field along with fault lines (yellow lines).

Figure 1
figure 1

Geological cross-section of the Karazhanbas oilfield (modified after Shabdirova et al. (2023))

Figure 2
figure 2

Location of the producing wells in the Karazhanbas oilfield

The properties of the formations are presented in Appendices 1 and 2. The data suggest significant differences in the composition of these geological layers. Horizon G shows a slightly lower quartz content and a similar range for feldspars and rock fragments compared to other Cretaceous Formations. Kaolinite in the G layer is noted in small amounts and is locally concentrated in the form of bundles in pore fillings, surrounding grains, and sometimes forming vermiculite booklets. In the A layer and in Jurassic deposits, medium-crystallized kaolinite predominates, covering and replacing grains. Layers A and B are characterized by relatively moderate gas content and saturation pressure, suggesting a less volatile oil type compared to deeper layers. Layers G and D exhibit a broader range of gas content and saturation pressure, pointing to variability in reservoir conditions or oil types, possibly indicating a more complex geological history or variations in organic matter input and thermal maturation. Lower Cretaceous Formations (A, B, D, G, and V) show high permeability values, particularly horizon G with values up to 4000 mDFootnote 2, suggesting highly conductive pathways for fluid flow, which could contribute to sand production under production stress.

The area of the oilfield is divided into four blocks, separated by tectonic faults. The wells situated closer to a fault are believed to have a boundary-dominated regime (Ortiz, 2013). However, flow regime analysis of the flowing wells in different blocks showed that even the wells located far from the faults also have a boundary-dominated regime (Fig. 3). The flow regime analysis was based on a logarithmic plot of the liquid rate over time of the flowing well, which in this case had a slope of -1, indicating that well production was affected by the boundary (Ortiz, 2013), which is a fault in this case.

Figure 3
figure 3

Example of the flow regime of the flowing well, located farthest from the fault: The log-log curve has an approximate slope of -1

Methodology

Data Collection

The production data from an oilfield come as a database containing monthly information for each well, such as production date, producing horizon, liquid rate, sand mass, reservoir pressure, and water cut. These data were refined by removing any missing or irrelevant information. Reservoir thickness and depth were sourced from well perforation intervals, while porosity and net-to-gross ratios were obtained from well logs. However, these logs were available for only about 10% of the wells. For each geological horizon, these values were averaged. For wells lacking specific data, the averaged values from their respective horizons were used as estimates. As a result, for ML modeling, selected inputs included date, liquid rate, water cut, reservoir thickness, depth, porosity, and net-to-gross ratio. The output variable chosen was sand production. A separate file with the wells’ coordinates and a map with faults were used to define the shortest distance of each well to a fault.

Machine Learning Algorithms

This research examined one classification task and four regression tasks. For the classification task, ML algorithms such as RF, SVM, decision tree (DT), and K-nearest neighbors (KNN) were utilized. In the case of regression tasks, the RF algorithm was chosen based on its superior performance in previous sand production prediction studies (Shabdirova et al. 2023). While prior research indicates the RF algorithm as the most effective for analyzing sand production, its advantage over alternative models is not markedly significant. More crucially, the quality of input data plays a vital role, underscoring the importance of data processing. Consequently, this investigation shifted the emphasis from algorithm selection to enhancing data analytics practices.

Model Training and Validation

In this study, we sought to address the following main research questions:

  1. 1.

    Is it possible to predict sand production in wells within a specific block using data from other blocks?

  2. 2.

    Can knowledge of sand production from certain geological horizons be applied to predict sand production in wells from different horizons?

  3. 3.

    What impact does the quality of input data have on prediction accuracy?

  4. 4.

    How does the distance of wells from faults influence sand production predictions?

To answer the first question, multiple ML simulations with various training and testing data combinations were conducted. For example, in one simulation, a model was trained using data from wells in Block 1 and then tested using data from wells in Block 2. Another simulation involved training with Block 1 data and testing with Block 3 data, and so forth.

Addressing the second question involved training and testing models with data segmented by geological horizons within the same block, specifically Block 2, chosen for its high well count. In an initial simulation for this scenario, the model was trained on data from horizon A and tested on data from horizon B within Block 2, followed by subsequent tests on data from other horizons like G and D.

For the third question, three simulations were conducted using the same set of testing data, which comprised data from a selection of wells in Block 2 representing 20% of the total. The training data varied: The first simulation included only operational data (liquid rate and water cut), the second used only geological data (reservoir thickness, depth, porosity, and net-to-gross ratio), and the third combined both operational and geological data.

In response to the fourth question, wells in Block 2 were categorized as ‘Near’ or ‘Distant’ based on their proximity to faults—less than 500 m and more than 500 m away, respectively. In the Karazhanbas field, no investigations have been conducted to assess the fault damage zone around the faults, and so the exact distance from the fault that remains unaffected by near-fault damages is unknown. For this study, we used a distance of 500 m based on the study by Choi et al. (2016), who analyzed the existing literature on fault damage zones for different types of reservoirs and found that the damage zone around faults does not exceed 500 m in all reported cases. The ML model training and testing were performed using different combinations of these categories. When models were trained and tested within the same category (‘Near’/’Near’ and ‘Distant’/’Distant’), the data were split such that 80% were used for training and 20% for testing.

For all tasks, the root mean square error (RMSE) served as the primary validation metric. Additionally, in the fourth task, the R2 (coefficient of determination) metric was also utilized alongside RMSE for a more comprehensive assessment of model performance.

Input Variable Analysis

An analysis of the input variables in terms of box-and-whisker plots and correlation plots is shown in Figure 4. The box-and-whisker plots (Fig. 4a) illustrate the diversity in the input variables. The plot’s upper and lower boundaries represent the third and first quartiles, respectively. The horizontal line within each plot is the median for each variable. These plots are beneficial for identifying outliers and verifying the data prior to modeling. In the current dataset, a notable number of outlier points were observed for the 'liquid rate' and 'sand' variables.

Figure 4
figure 4

Input variables analysis: (a) box-and-whisker plots and (b) correlation matrix

Correlation between variables can be assessed through the correlation coefficient, which ranges from -1 to +1. The strength of a relationship between two variables is shown by the absolute value of this coefficient. A positive or negative sign denotes a direct or inverse relationship, respectively. In this study, the overall correlation among variables appears to be low. The highest correlation, with coefficient of 0.37, was found between 'ntg' (net-to-gross ratio) and 'porosity.' It is important to highlight, however, that there is no inherent physical relationship between net-to-gross and porosity of a rock, suggesting that the observed correlation might not imply a direct causality. There was a positive correlation of 0.10 between liquid rate and sand production, suggesting that, as the liquid rate increases, sand production also tends to increase, albeit weakly. This finding contrasts with conventional analytical sand prediction models, which typically indicate a direct proportionality between liquid rate and sand production. Another unexpected result from this ML analysis was the negligible correlation of 0.011 between reservoir thickness and sand production. Traditional analytical and numerical models generally predict that larger reservoirs produce more sand. Similarly, while higher porosity is often associated with increased sand production in conventional models, our dataset shows a very weak correlation of 0.041 between porosity and sand production, suggesting minimal influence. Depth is another important parameter for sand production as it represents stress, which is crucial for rock failure. Additionally, the compartmentalized geology of the field results in different depths for the same formation, adding to the complexity. Despite the low linear correlation between depth and sand production, we can expect a more complex relationship that can be revealed through ML modeling.

Results and Discussion

The ML analysis started with application of different ML methods to classify the wells into sand-producing wells and no sand-producing wells based on a threshold value of 0.1 t/month, which was established by the operating company. Table 1 shows the accuracy (Forsyth, 2019) of each method, which is good for all ML methods.

Table 1 Classification results

Figure 5 shows the locations of the wells with sand production of more than 0.1 t/month, i.e., sand-producing wells. Most of them are located in near-fault regions. The color scale corresponds to the average sand production of each well. While most wells produce sand at less than 5 t/month, several wells have extremely high sand production (red and yellow dots in the plot). The well data suggest that all the wells produce from horizons A and/or G (Fig. 1) and with very high liquid rate up to 2000 t/day. Figure 6 plots the average flow rate values of randomly selected 100 wells. Red and blue bars correspond to sand-producing and no sand-producing wells, respectively. Thus, the results indicate that high sand production was associated with high flow rates.

Figure 5
figure 5

Locations of the sand-producing wells with sand production of more than 0.1 t/month

Figure 6
figure 6

Average liquid rate of the sand-producing and no sand-producing wells

Is it Possible to Predict Sand Production in Wells within a Specific Block using Data from Other Blocks?

Further analysis involved assessing the correlation between different blocks (Fig. 2) in terms of sand production prediction. Block 2 had the largest area and, correspondingly, the largest number of wells and production data. The simulations were conducted to evaluate whether the production data of all sand-producing wells in one block can be used to predict the data from another block. Figure 7 shows the RMSE of the ML modeling for different training and testing data combinations. Generally, models trained on a more diverse set of blocks (e.g., 2 + 3 + 4, 2 + 3) tended to perform better (lower RMSE) than those trained on fewer blocks. This suggests that including a wider range of data in the training set improves prediction accuracy.

Figure 7
figure 7

ML prediction results for different train and test blocks

The performance of the models varied significantly depending on the test block used. For instance, models tested on Block 1 generally had lower RMSEs compared to those tested on Blocks 3 or 4. This could indicate differences in the characteristics of each block or the quality and quantity of data available from each block. Higher RMSEs were consistently observed for certain test blocks (like Block 4) regardless of the training data combination. This might point to inherent challenges in modeling the data from these blocks, possibly due to complex geological features, variability in data quality, or other factors.

As expected, the best results were achieved for the cases with large amounts of training data. However, further studies demonstrated that the principle of 'more data yielding better outcomes' does not uniformly apply.

Can Knowledge of Sand Production from Certain Geological Horizons be Applied to Predict Sand Production in Wells from Different Horizons?

The study was then narrowed to Block 2 wells, and further analysis was focused on assessing the correlation between different horizons. The observations from Figure 8, however, present a complex picture; the data do not readily offer clear insights into the correlation between horizons, making it challenging to draw definitive conclusions. Notably, horizon G consistently emerged as the most problematic, frequently registering the highest RMSEs across all training sets, indicating its complexity. On a positive note, the analysis indicated that integrating multiple training horizons generally led to a reduction in RMSE. This trend underscores the potential benefits of diversifying training data to improve model accuracy. Looking ahead, incorporating the mechanical properties of each horizon as input data in future studies could provide a more nuanced understanding and potentially elevate the precision of the predictions.

Figure 8
figure 8

Prediction results for different combinations of training and testing horizons

What Impact does the Quality of Input Data Have on Prediction Accuracy?

As presented above, selected inputs included date, liquid rate, water cut, reservoir thickness, depth, porosity, and net-to-gross ratio. Liquid rate and water cut can be considered as operational input data, while reservoir thickness, depth, porosity and net-to-gross ratio represent geological features. ML simulations were conducted to assess impact of input data type on the prediction. Figure 9 presents exemplary outcomes based on a specific set of training data. The RMSE was the lowest (0.85) when the input data contained only operation data: flow rate and water cut. This can be attributed to the predominance of the quality rather than the quantity of input data: Using all available input data results in a higher RMSE of 1.12 compared to just operational data. This increase implies that including more types of data does not necessarily improve model performance and might introduce noise or irrelevant information. Thus, porosity and net-to-gross ratio values were sourced from logs, available for only one-tenth of the wells. These values were then averaged for each horizon. For the remaining wells without specific data, values were extrapolated from the corresponding horizon. Moreover, geological data remained unchanged over the production. Even in instances where it might vary, data for such changes were not available. These two factors introduce noise into the input data, resulting in diminished accuracy of the prediction results.

Figure 9
figure 9

Impact of input data type on the prediction performance

How does the Distance of Wells from Faults Influence Sand Production Predictions?

Further analysis was focused on assessing the impact of fault proximity on the predictive accuracy of the sand production prediction model. The analysis was categorized into six distinct cases, each representing a combination of training and testing scenarios based on the proximity of wells to geological faults. 'Distant' wells were situated more than 500 m away from all faults on the margins of Block 2, while 'Near' wells were located within 500 m of these faults. The simulations demonstrated the complexities inherent in model evaluation when multiple key metrics were used, specifically RMSE and R2 (Fig. 10). An exclusive reliance on R2 suggests a moderate level of model performance, with values clustering between 0.4 and 0.6 across different train–test combinations. Conversely, RMSE offers a more discerning perspective, indicating optimal prediction performance when training and testing datasets comprised wells of similar locations (0.05 for distant/distant train/test combination). This divergence underscores the inherent limitations of traditional metrics like RMSE and R2, particularly in capturing the critical nuances of sand production, such as the significance of peak values. These peak values are paramount, given their substantial operational implications.

Figure 10
figure 10

Impact of fault proximity on prediction performance

Given the operational importance, not only of the magnitude but also of the timing of peak sand production, it is essential to anticipate these high-risk events for effective operational planning and risk management. Consequently, we advocate for the adoption of new metrics tailored to the unique demands of sand production prediction. These proposed metrics, peak prediction accuracy (PPA) and peak time prediction accuracy (PTPA), were designed to precisely gauge a model's proficiency in forecasting these operationally pivotal events. By focusing on these aspects, PPA and PTPA offer a more operationally relevant assessment of model performance, aligning the evaluation more closely with the end-users' needs and the practical demands of sand production management. Thus:

$${\text{PPA}} = 100 - { }\frac{{\left| {{\text{actual \;peak \;sand \;mass}} - {\text{predicted \;peak \;sand \;mass}}} \right|{ }}}{{\text{actual \;peak \;sand \;mass}}} \times 100$$
$${\text{PTPA}} = 100 - { }\frac{{\left| {{\text{time\; at \;actual \;peak \;sand \;mass}} - {\text{time \;at \;predicted \;peak \;sand \;mass }}} \right|}}{{\text{time \;at \;actual \;peak \;sand \;mass}}} \times 100$$

Figure 11 shows bar charts with the new metrics. It becomes evident that predicting the timing of peak sand production events poses significant challenges. The data suggest a more reliable identification of sand production patterns in wells distant from faults. However, wells in proximity to faults seem to be subject to unique geological and operational influences that significantly affect sand production. This underscores the complexity introduced by fault proximity, necessitating tailored approaches in predictive modeling to account for these specific factors.

Figure 11
figure 11

Sand production prediction performance with new metrics. Note: Higher accuracy is associated with values closer to 100%

Incorporating stress-related domain-specific knowledge and relevant features into the model could improve prediction accuracy. In summary, despite the fact that all wells in the field are produced in a boundary-dominated regime, the proximity to faults appears to be a significant factor influencing the prediction of sand production events. Tailoring models to specific conditions, enriching training data with diverse scenarios, and incorporating domain-specific insights are key steps toward improving the prediction accuracy for both the magnitude and timing of sand production events.

Conclusions

This study presented an extensive analysis of sand production prediction in the Karazhanbas oilfield, leveraging ML techniques to improve operational efficiency and risk management. By employing a rich dataset from 2000 wells, the study delved into the intricate dynamics of sand production, highlighting challenges in the prediction of sand production including the peak sand mass and its occurrence. As an oilfield usually covers very large areas with thousands of wells over long production life spanning several decades, it produces a large amount of data. While operational data can be monitored continuously, geological data are much more limited with discrete samples, some parameters of which could only be estimated as average values over certain horizons and locations.

ML usually requires a large dataset for training to improve prediction accuracy. Geological and operational data, however, were not always measured leading to imbalanced dataset. Data processing was found very important for a meaningful prediction of sand production across the oilfield with the ML techniques. Different scenarios were conducted to investigate the sensitivity of the prediction accuracy depending on the location and depth of the data sources as well as on the proximity to the nearby major geological faults. Incorporating mechanical property of each horizon as input data in future studies could provide a more nuanced understanding and potentially elevate the precision of the predictions. This finding elucidates the complex interplay between geological and operational factors in sand production, advocating for models tailored to specific conditions and enriched with diverse data scenarios and domain-specific insights.

The investigation focused on several key research questions:

  • The analysis demonstrated that ML models trained on diverse datasets from multiple blocks performed better than those trained on fewer blocks, highlighting the importance of data diversity in improving prediction accuracy.

  • The study found that integrating data from multiple horizons generally reduced the prediction error, suggesting that a more comprehensive dataset enhances prediction accuracy. However, some horizons consistently showed higher RMSEs, indicating their complexity and the need for additional data or refined models.

  • The findings revealed that using only operational data (liquid rate and water cut) resulted in lower RMSE than using all available data, including geological features. This suggests that the quality of input data significantly impacts prediction accuracy, and introducing noise through less accurate geological data can diminish model performance.

  • The study underscored that wells' proximity to faults significantly affects sand production predictions. Models trained and tested on wells from similar locations (e.g., distant from faults) performed better, emphasizing the need for tailored approaches in predictive modeling to account for these geological complexities.

The applied ML methods demonstrated commendable classification accuracy in identifying sand-producing wells. The research introduced novel metrics—PPA and PTPA—tailored to the operational needs of sand production management. These metrics were instrumental in evaluating model performance more relevantly, especially in forecasting operationally pivotal events like peak sand production.

In conclusion, this research marks a significant stride in sand production prediction. It not only reinforces the potential of ML in enhancing understanding and forecasting of sand production, but it also paves the way for future studies to focus on refining prediction models. By incorporating stress-related domain-specific knowledge and enriching training data, future research can further improve the accuracy and reliability of sand production prediction, ensuring more informed decision-making and efficient operational management in oilfield operations.