Earthquake multi-classification detection based velocity and displacement data filtering using machine learning algorithms

Murti, Muhammad Ary; Junior, Rio; Ahmed, Ali Najah; Elshafie, Ahmed

doi:10.1038/s41598-022-25098-1

Earthquake multi-classification detection based velocity and displacement data filtering using machine learning algorithms

Article
Open access
Published: 08 December 2022

Volume 12, article number 21200, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Earthquake multi-classification detection based velocity and displacement data filtering using machine learning algorithms

Download PDF

Muhammad Ary Murti¹,
Rio Junior¹,
Ali Najah Ahmed² &
…
Ahmed Elshafie^3,4

3105 Accesses
12 Citations
4 Altmetric
Explore all metrics

Abstract

Earthquake is one of the natural disasters that have a big impact on society. Currently, there are many studies on earthquake detection. However, the vibrations that were detected by sensors were not only vibrations caused by the earthquake, but also other vibrations. Therefore, this study proposed an earthquake multi-classification detection with machine learning algorithms that can distinguish earthquake and non-earthquake, and vandalism vibration using acceleration seismic waves. In addition, velocity and displacement as integration products of acceleration have been considered additional features to improve the performances of machine learning algorithms. Several machine learning algorithms such as Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Artificial Neural Network (ANN) have been used to develop the best algorithm for earthquake multi-classification detection. The results of this study indicate that the ANN algorithm is the best algorithm to distinguish between earthquake and non-earthquake, and vandalism vibrations. Moreover, it’s also more resistant to various input features. Furthermore, using velocity and displacement as additional features has been proven to increase the performance of every model.

Machine Learning Based Comparative Analysis for the Classification of Earthquake Signals

Machine Learning Based Earthquake Early Warning (EEW) System: A Case Study of Himalayan Region

Seismic Discrimination Between Nuclear Explosions and Natural Earthquakes using Multi-Machine Learning Techniques

Article Open access 08 April 2024

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Introduction

Indonesia is located between three major tectonic plate confluences, namely the Eurasian Plate, the Indo-Australian Plate, and the Pacific Plate¹. Based on the records of the Meteorological, Climatological, and Geophysical Agency (BMKG), from 2008–2018 there were around 5000–6000 earthquakes. And in 2019 there have been 15 destructive earthquakes². Earthquake detection using accelerometer sensors has been done by several researchers^3,4,5,6, the vibrations that the sensor will detect are not only vibrations caused by the earthquake, but also other vibrations such as vibrations due to heavy objects dropped on the floor, heavy vehicles passing by, explosions, or when someone is trying to broke the box. Due to similarity of the earthquake waves and those seismic noises, earthquake early warning systems are sometimes accidentally triggered and cause false alert. Therefore, it is necessary to classify earthquake and seismic noise to avoid detection errors^7,8.

Several researchers have also conducted studies on the use of machine learning in the seismic field. According to Nishita Narvekar⁹, The seismic signal recorded at the earthquake stations are often mixed with noise. Therefore, it is necessary to remove noise before the data is fed to the machine learning algorithm using filtering techniques. In addition, applying Fast Fourier Transform (FFT) as one of the methods that are widely used in the world of Seismology on the seismic signal can be used to reduce computation time^10,11. Combining it with machine learning algorithms has been proven to give the best results. Furthermore, the experiment of comparison of SVM, DT, and RF algorithms to distinguish between earthquake vibrations and noises shows that RF algorithms give better performance based on this research.

Some researchers¹², proposed a seismic detection system that can be implemented at the seismic station using ANN and SVM that can classify local earthquakes and the other possibilities vibration. The data were collected from PVAQ station in Portugal. The data is distributed into 60% of training data, 20% of testing data, and 20% of validation data. The performances of the model show that ANN was able to obtain a value greater than 95% whereas SVM is capable to get an almost perfect classification.

In another study¹³, DT is used for solving two classification problems involving signals. The purpose is to learn signal temporal logic (STL) for finding a pattern in data that do not conform to the expected behavior (anomaly detection). The result shows that DT provides good performances and can be interpreted over specific application domains. In another case¹⁴, DT is applied to classify the conditions of a wind turbine blade by evaluating the turbine vibration signal. There are 600 data samples which 100 samples were from good condition blades. DT classifier that is used for this problem has been proven very much effective for diagnosing this problem. Based on those studies, it shows that the DT algorithm is good for signal and vibration classification.

According to Saman Yaghmaei-Sabegh¹⁵, characteristics of earthquake ground-motion is an uncertain thing. His study proposed to classify earthquake ground motion using K-means clustering and Self-Organizing Map (SOM) network as two powerful unsupervised clustering techniques by utilizing 6 different scalar frequency content indicators. There are two synthetized and real dataset used in this study. The result shows that T₀ (The smoothed spectral predominant period) parameter showed the best performances among all scalar indicators. In addition, K-means clustering obtained a better performance than SOM in pattern recognition and classification procedure.

Among various types of machine learning algorithms that have been used by several researchers, recently various studies in a wide range of engineering fields deployed different Machine learning models. Based on their findings the most reliable models are Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Artificial Neural Network (ANN). Therefore, this study will investigate the reliability of these models for earthquake detection. These models will be used as a comparison to find the best algorithm for earthquake multi-classification detection. The acceleration dataset used is based on seismic events in the Indonesia region, especially on Java Island, recorded at 3 different stations obtained from ESM (Engineering Strong Motion) Database¹⁶. Accelerometer alone cannot detect shaking pattern of the ground⁷, Hence to overcome this problem, this study proposed to integrate acceleration dataset to obtain velocity and displacement as additional features to improve performances for each algorithm. Therefore, earthquake detection can be done more accurately. The analytical methods used for analyzing machine learning performance in this study will be accuracy, precision, recall, and F1.

Methodology

Study area and data collection

The area of this study is Indonesia, focused on Java Island. Java Island is considered the fourth largest island in Indonesia with the highest density of population. It is part of the complex convergence zone between the Eurasian plate and the Indo-Australian plate. Due to that, the Java region witnessed many seismic and volcanic activities. Between 2006 and 2020, earthquakes and other geohazards on volcano-dotted Java Island caused about 7000 deaths, and another 1.8 million people were injured, displaced, or left homeless^17,18.

Seismic wave acceleration data were collected from ESM Database at 3 different stations, namely CISI, SMRI, and UGM which are located on Java Island as can be seen in Fig. 1.

These stations record earthquake events that occurred around Java Island in the past 2006–2009. There are 33 records from CISI, 8 records from SMRI, and 17 records from UGM which make a total of 58 earthquake events that occurred and were recorded around Java Island by those stations. These records contain 3 different channels which are HLE, HLN, and HLZ with acceleration seismic wave information for each channel as shown in Fig. 2. The acceleration seismic wave will be integrated to get velocity and displacement seismic waves which can be used as features to improve models’ performances. Acceleration, velocity, and displacement relation can be described using the math equation¹⁹:

Acceleration:
$$Acceleration = a\left( t \right)$$
(1)
Velocity:
$$Velocity \left( {v\left( t \right)} \right) = v_{0} + \mathop \smallint \limits_{0}^{t} a dt$$
(2)
Displacement:
$$Displacement \left( {r\left( t \right)} \right) = r_{0} + \mathop \smallint \limits_{0}^{t} v dt$$
(3)
where, ${v}_{0}$ is the initial value of the velocity and ${r}_{0}$ is the initial position when $t=t-{t}_{0}$. The integration result of the acceleration seismic wave can be seen in Fig. 3.

Dataset processing

Data from ESM Database is in the form of an ASCII file containing detailed event information as well as the acceleration seismic wave data for the event. All the data go through the FFT process to get the frequency domain of the seismic wave and then the frequency is used for the filtering process using a Butterworth Bandpass Filter with the order of filter = 2, minimum frequency = 0.1 Hz, and maximum frequency = 30 Hz to reduce the noises. Figure 4 shows the result of the data filtering process in Fig. 2.

After the filtering process, the data sampling process will be done. In the data sampling, the acceleration seismic wave data will be split into earthquake and non-earthquake data. The earthquake data and non-earthquake data contain 200 data for 1 seismic event each (equivalent to 1 s because the sampling frequency = 0.005 s) where the earthquake data samples are taken starting from the beginning of P-wave, and the non-earthquake data samples are taken starting from the beginning of the wave until the P-wave arrival. There are a total of 58 seismic events, so each earthquake and non-earthquake dataset will have 3 columns (HLE, HLZ, and HLZ) with 11,600 rows of data for each column [11600, 3]. Next, the 3 columns will be merged into 1 [11600, 1] by using resultant formulas so only amplitude acceleration will be used as a feature. Amplitude resultant result from the total seismic events can be seen in Fig. 5 for both earthquake and non-earthquake. After that, label information will be added to the datasets, 0 represents non-earthquake data and 1 represents earthquake data.

For the vandalism vibration, 2 vandalism datasets were recorded by an accelerometer sensor. Both vandalism datasets will be treated the same as earthquake and non-earthquake datasets. The first vandalism datasets contain 11,600 data and are labeled as 2 which are taken by shaking the table while the sensor is on top of it (made up earthquake). The other datasets contain 750 data labeled as 3 which are taken by the sensor when a heavy vehicle is passing by. There are 4 datasets with a total amount of 35,550 data. The statistical analysis for each dataset is presented in Table 1. Lastly, integration formulas will be applied to all amplitude acceleration datasets to get the amplitude of velocity and displacement which can be used as additional features.

Table 1 Statistical analysis of datasets.

Full size table

Model selection

Several supervised machine learning algorithms that are used in this study are Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Artificial Neural Network (ANN). SVM is a supervised learning algorithm that can be used for finding patterns from a complex dataset. SVM is a very powerful and diverse machine learning model, capable of performing linear, non-linear, regression, classification, and outlier detection. When SVM theory was introduced by Vapnik and Cortes in 1995, The SVM was designed for two-group classification (binary classification). The idea behind SVM was previously implemented for the restricted case where the training data can be separated without error. In practice, the SVM has been applied for pattern and digit recognition. This experiment shows SVM can compete with the other classification methods such as decision trees and neural networks²⁰. The binary SVM approach can, however, be extended for multiclass scenarios. This will be attained by decomposing the multiclass problem into a series of binary analyses. This can be addressed with a binary SVM by following either the one-against-one or one-against-all strategies²¹.

In SVM,

input: ${x}_{i}\in {\mathbb{R}}^{D}$, with D = feature dimension,
output: $w$ (weights), one for each feature, whose linear combination produces y (the final output of the SVM model is a decision from the input data).
$$y = w^{T} x_{i} + b$$
(4)

with b is bias.

To maximize the margin, the distance from the data points to the hyperplane must be minimized. When the hyperplane cannot separate the two classes perfectly, it is necessary to add a slack variable (${\xi }_{i}$) and hyperparameter C. The function of the hyperparameter is to regulate the use of slack variables, if C is too small, the model can be underfitting, and if C is too large, the model can be overfitting.

$$\mathop {{\text{min}}}\limits_{{w,b}} \frac{1}{2}\left\| w \right\|^{2} + C\sum\limits_{{i = 1}}^{m} {\xi _{i} }$$

(5)

When the input data cannot be separated linearly, then the data must be mapped to a higher-dimensional space. If the new dimension is very large, it will take a long time to map it. Kernel Tricks can solve this, it works by ostensibly adding features. In this research, the Kernel Gaussian RBF (Radial Basis Functions) will be used.

$$K\left( {x,l} \right) = \exp \left( {\left. { - \gamma } \right\|x - \left. l \right\|^{2} } \right)$$

(6)

where: x = feature vector.

l = landmark

$$\gamma = \frac{1}{{2\sigma^{2} }}$$

The DT algorithm can be used for classification and regression. This algorithm can also be used for data with multiple outputs. It performs data classification by forming a tree. Starting from the root node to the leaf node. At each node, there is information on the features that are used as conditions for determining the direction of data flow, gini impurity, the number of samples that arrive at the node, the class prediction value, and the class of the data at that node.

In determining the branch on the DT, information about the gini impurity of the data is needed. Gini impurity evaluates a score in the range between 0 and 1, where 0 is when all observations belong to one class, and 1 is a random distribution of the elements within classes. The feature with the lowest impurity will be selected to be the next branch²². In this case, the lower the gini impurity the better the split and the lower the likelihood of misclassification. Equation 7 is gini impurity equation with ${p}_{i}$ is the probability of class k at node i, and n is the number of classes.

$${\text{Gini Impurity}}\left( {G_{i} } \right) = 1 - \mathop \sum \limits_{k = 1}^{n} p_{i,k}^{2}$$

(7)

RF is an ensemble learning technique consisting of the aggregation of a large number T of decision trees. This technique uses the voting method in determining the classification results. The classification of each DT will be used to determine the final classification. RF uses row and column sampling of the data for each tree. That way each tree is trained using different data. This algorithm can reduce the variance without increasing the bias. In addition, the accuracy of this model can be improved by increasing the CART (ntree) model ensemble²³.

An artificial neural network (ANN) is an information-processing system that has definite performance characteristics in common with biological neural networks. ANNs are utilized as statistical models in predicting complex systems in engineering. Their enormously parallel structure with a high number of simply connected processing units which are called, neurons—allows the ANN to be utilized for complex, linear as well as non-linear input–output mappings^24,25,26.

The most common ANN training method is the backpropagation algorithm. To reduce mistakes, this modifies the weights between neurons. This model is quite effective at identifying patterns. The system can display sluggish convergence and run the danger of a local optimum, but it can rapidly adapt to new data values. A significant challenge is figuring out how many layers there are, how many neurons are in the hidden layer, and how those neurons are connected. The performance of the artificial neural network depends greatly on these factors and issues. Any of these elements could significantly alter the outcomes. For different issues, various ANN architectures will produce various solutions²⁷.

All the models will be used to classify earthquake, non-earthquake, and vandalism vibration by training and testing data with a ratio of 70:30. Next, the performances of the models will be determined by analyzing the confusion matrix as one of the common methods used for classification. Table 2 shows the structure of the confusion matrix. From the confusion matrix, some information can be retrieved such as^28,29:

Accuracy:
$${\text{Accuracy}} = \frac{TP + TN}{{TP + FP + TN + FN}}$$
(8)
Precision:
$${\text{Precision}} = \frac{TP}{{TP + FP}}$$
(9)
Recall:
$${\text{Recall}} = \frac{TP}{{TP + FN}}$$
(10)
F1:
$$F1 = \frac{TP}{{TP + \frac{1}{2} \left( {FP + FN} \right)}}$$
(11)

Those performances will be used as a comparison for knowing if there is any effect of adding the velocity and displacement as additional features.

Table 2 Confusion matrix

Full size table

Results and discussion

Analyzing datasets

A correlation matrix is a (K x K) square and symmetrical matrix that shows the correlation coefficient between columns i and j of the dataset³⁰. Figure 6 shows the correlation matrix between acceleration, velocity, displacement, and labels. The observable result based on Fig. 6 shows acceleration, velocity, and displacement have a fairly close correlation, especially between velocity and displacement which has a value of 0.94 (1 is a perfect linear relationship) meanwhile labels have the least correlation with acceleration, velocity, and displacement.

Data distribution is a function that specifies all possible values for a variable and also quantifies the relative frequency (probability of how often they occur). Data distributions are widely used in statistics. Figure 7 shows the data distribution for the dataset. The distribution data for each feature looks good, there is only a small portion of data that is included in isolated data that make the dataset ready to feed to the machine learning algorithms.

Machine learning algorithm test results

Analysis of model performance is carried out by comparing the values for accuracy, precision, recall, and F1 on each algorithm. Based on the experimental data in Table 3, the accuracy is in the range of 0.673230 – 0.965400, precision is in the range of 0.656123 – 0.974964, recall is in the range of 0.673230 – 0.974589, and F1 is in the range of 0.65009 – 0.97458. The two highest values of the models in all possibilities of the input features are dominated by ANN followed by RF but ANN shows better resistance to various input features than RF. The experiment shows that with acceleration as a feature, all models' performances look good, but when velocity and displacement stand alone as a feature, every model has low performance. However, when velocity and displacement are used together as a feature, all models’ performances increase significantly. In addition, all possibilities of the input features were used in this experiment. Combining acceleration with velocity and acceleration with displacement has been proven to improve the performances of models based on experimental data in Table 3 as can be seen in Fig. 8. Furthermore, 3 out of 4 models’ performances show improvement when 3 of them are used as a feature, only SVM shows a decrease in performance. It can be seen from Fig. 8, the best SVM performance for using a single feature was obtained when acceleration was used. Followed by displacement and velocity. This is due to the fact that in case of adding more feature data that is less sensitive to the desired output, a redundancy in the input data might lead to occur. As a result, the model might translate the additional information as false information that leads to difficulty in detecting the real pattern of the data and therefore achieving low accuracy. Therefore, it can be concluded that only the highly correlated and important feature is acceleration. And that can be confirmed when acceleration was used as one of two or three features. Therefore, it can be concluded that adding velocity and displacement as additional features can improve models’ performances. Hence, in order to achieve a high level of accuracy, acceleration must be included as a feature of each model for better earthquake detection.

Table 3 Machine learning performances.

Full size table

Conclusion

This research has focused on addressing the multi-classification of the earthquake and non-earthquake vibrations through the application of machine learning. Three different widely adopted algorithms were developed, namely: Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Artificial Neural Network (ANN). These models have been utilized in vibration multi-classification detection namely earthquake, non-earthquake, made-up earthquake, and heavy vehicles passing by. The results of the models were evaluated by four different performance criteria: accuracy, precision, recall, and F1 score. Comparing those performances criteria of SVM, RF, DT, and ANN, this study concludes that ANN outperforms other machine learning algorithms in 6 out of 7 possibilities of the input features. In addition, ANN shows better resistance to various input features. Hence, ANN has been proposed as the best algorithm which can be used for multi-classification earthquake detection based on this experiment. Furthermore, acceleration, velocity, and displacement show a good correlation with the result that combining acceleration, velocity, and displacement has been proven can be used to improve the model’s performances. Summing up, combining those features improves the accuracy of RF and ANN models respectively to 0.974589 and 0.965400. for future research, future studies should be made on multi-classification earthquake detection using ANN specifically and implement it in hardware, to further prove its capability in multi-classification earthquake detection in real-time. Despite having an acceptable accuracy value to classify earthquake detection, flaws and limitation remain. There is a need to restructure the model to achieve the best possible and optimal architecture model so that the model can distinguish vibration more accurately. And the reliability of the proposed models in the future might be validated by using more available data.

Data availability

The datasets generated during the current study are not publicly available which is considered as copyright between the funded agency and the university but are available from the corresponding author on reasonable request.

References

InaTEWS, Indonesia Tsunami early warning system. http://inatews2.bmkg.go.id/new/tentang_eq.php..
CNN Indonesia, Earthquakes in Indonesia have increased in the last 5 years, 01-Dec-2019. https://www.cnnindonesia.com/teknologi/20191201065329-199-453026/gempa-di-indonesia-meningkat-dalam-5-tahun-terakhir.
Priyana, Y., Laumal, F., Husni, E. Development of earthquake early warning system using ADXL335 accelerometer. 27-Mar-2018. https://osf.io/preprints/inarxiv/sq9xr/.
Groover, V., Sharma, A. (2015) Prediction of Earthquake Using 3 Axis Accelerometer Sensor (ADXL335) and ARDUINO UNO. Int. J. Sci. Research (IJSR) ISSN. https://www.ijsr.net/archive/v6i9/ART20176803.pdf
Sinha, P. K., Saraiyan, S., Ghosh, M. & Nath, V. Design of earthquake indicator system using ATmega328p and ADXL335 for disaster management. In Lecture Notes in Electrical Engineering (eds Nath, V. & KumarMandal, J.) (Springer, Singapore, 2019). https://doi.org/10.1007/978-981-13-0776-8_53.
Chapter Google Scholar
Wu, Y.-M. & Lin, T.-L. A Test of Earthquake Early Warning System Using Low Cost Accelerometer in Hualien Taiwan (Springer, Berlin, 2014). https://doi.org/10.1007/978-3-642-12233-0_13.
Book Google Scholar
Duggal, R. et al. Building structural analysis based Internet of Things network assisted earthquake detection. Internet Things 19, 100561. https://doi.org/10.1016/j.iot.2022.100561 (2022).
Article Google Scholar
Li, Z., Meier, M.-A., Hauksson, E., Zhan, Z. & Andrews, J. Machine learning seismic wave discrimination: Application to earthquake early warning. Geophys. Res. Lett. 45(10), 4773–4779. https://doi.org/10.1029/2018GL077870 (2018).
Article ADS Google Scholar
Narvekar, N. Distinguishing Earthquakes and Noise Using Random Forest Algorithm (San Jose State University, Washington, 2018).
Book Google Scholar
Cooley, J. W., Lewis, P. A. W. & Welch, P. D. The fast fourier transform and its applications. IEEE Trans. Educ. 12(1), 27–34. https://doi.org/10.1109/TE.1969.4320436 (1969).
Article Google Scholar
Cochran, W. T. et al. What is the fast fourier transform?. Proc. IEEE 55(10), 1664–1674. https://doi.org/10.1109/PROC.1967.5957 (1967).
Article Google Scholar
Madureira, G., and Ruano, A. (2009) A neural network seismic detector. IFAC Proc. Vol. (IFAC-PapersOnline), 2 https://doi.org/10.3182/20090921-3-TR-3005.00054.
Bombara, G., Vasile, C. I., Penedo, F., Yasuoka, H., Belta, C. (2016) A decision tree approach to data classification using signal temporal logic. https://doi.org/10.1145/2883817.2883843
Joshuva, A. & Sugumaran, V. Wind turbine blade fault diagnosis using vibration signals through decision tree algorithm. Indian J. Sci. Technol. 9(48), 107936. https://doi.org/10.17485/ijst/2016/v9i48/107936 (2016).
Article Google Scholar
Yaghmaei-Sabegh, S. A novel approach for classification of earthquake ground-motion records. J. Seismol. 21(4), 885–907. https://doi.org/10.1007/s10950-017-9642-8 (2017).
Article ADS Google Scholar
Puglia, R. et al. Strong-motion processing service: A tool to access and analyse earthquakes strong-motion waveforms. Bull. Earthq. Eng. 16(7), 2641–2651. https://doi.org/10.1007/s10518-017-0299-zS (2018).
Article Google Scholar
Pasari, S., Simanjuntak, A. V. H., Mehta, A., Neha, N. & Sharma, Y. The current state of earthquake potential on Java Island, Indonesia. Pure Appl. Geophys. 178(8), 2789–2806. https://doi.org/10.1007/s00024-021-02781-4 (2021).
Article ADS Google Scholar
Marfai, M. A. et al. Natural hazards in central Java Province, Indonesia: An overview. Environ. Geol. 56(2), 335–351. https://doi.org/10.1007/s00254-007-1169-9 (2008).
Article ADS Google Scholar
Nave, R. (2000) Hyperphysic-constant acceleration motion. 2000. [Online]. Available: http://hyperphysics.phy-astr.gsu.edu/hbase/acons.html.
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20(3), 273–297. https://doi.org/10.1023/A:1022627411411 (1995).
Article MATH Google Scholar
Mathur, A. & Foody, G. M. Multiclass and binary SVM classification: Implications for training and classification users. IEEE Geosci. Remote Sens. Lett. 5(2), 241–245. https://doi.org/10.1109/LGRS.2008.915597 (2008).
Article ADS Google Scholar
Arabshahi, H. & Fazlollahtabar, H. Classifying innovative activities using decision tree and gini index. Int. J. Innov. Technol. Manag. 15(3), 1850025. https://doi.org/10.1142/S0219877018500256 (2018).
Article Google Scholar
Marhain, S., Ahmed, A. N., Murti, M. A., Kumar, P. & El-Shafie, A. Investigating the application of artificial intelligence for earthquake prediction in Terengganu. Nat. Haz. 108(1), 977–999. https://doi.org/10.1007/s11069-021-04716-7 (2021).
Article Google Scholar
Böse, M. (2006) Earthquake early warning for istanbul using artificial neural networks. Entwicklung eines Erdbeben-Frühwarnsystems für Istanbul unter Verwendung Künstlicher Neuronaler Netze https://doi.org/10.5445/IR/1000005845
Yaghmaei-Sabegh, S. A new method for ranking and weighting of earthquake ground-motion prediction models. Soil Dyn. Earthq. Eng. 39, 78–87. https://doi.org/10.1016/j.soildyn.2012.03.006 (2012).
Article Google Scholar
Essam, Y., Kumar, P., Ahmed, A. N., Murti, M. A. & El-Shafie, A. Exploring the reliability of different artificial intelligence techniques in predicting earthquake for Malaysia. Soil Dyn. Earthq. Eng 147, 106826. https://doi.org/10.1016/j.soildyn.2021.106826 (2021).
Article Google Scholar
Saritas, M. M. & Yasar, A. Performance analysis of ann and naive Bayes classification algorithm for data classification. Int. J. Intell. Syst. Appl. Eng. 7(2), 88–91 (2019).
Article Google Scholar
Grandini, M., Bagli, E., Visani, G. (2020) Metrics for multi-class classification: An overview. arXiv https://doi.org/10.48550/arXiv.2008.05756.
Ting, K. M. Confusion matrix. In Encyclopedia of Machine Learning (eds Sammut, C. & Webb, G. I.) 209 (Springer, US, Boston MA, 2010).
Google Scholar
Ferré, J. Regression diagnostics. In Comprehensive Chemometrics Vol. 3 (eds Brown, S. D. et al.) (Elsevier, Amsterdam, 2009).
Google Scholar

Download references

Acknowledgements

This study is financially supported by the grant DRTPM PTUPT No. 019/SP2H/RT-JAMAK/LL4/2022 received from the Indonesia Ministry of Education, Culture, Research, and Technology.

Author information

Authors and Affiliations

Telkom University, Bandung, 40257, Indonesia
Muhammad Ary Murti & Rio Junior
Institute of Energy Infrastructure (IEI) and Civil Engineering Department, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
Ali Najah Ahmed
Department of Civil Engineering, University of Malaya (UM), 50603, Kuala Lumpur, Malaysia
Ahmed Elshafie
National Water and Energy Center, United Arab Emirates University, 15551, Al Ain, United Arab Emirates
Ahmed Elshafie

Authors

Muhammad Ary Murti
View author publications
You can also search for this author in PubMed Google Scholar
Rio Junior
View author publications
You can also search for this author in PubMed Google Scholar
Ali Najah Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Elshafie
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.A.M.: Conceptualization, Validation, Data Curation, writing—review & editing, supervision and funding; R.J.: Visualization, formal analysis, Writing—Original draft preparation; A.N.A.: Visualization, formal analysis, Writing—Review & editing and A.E.: Writing—original draft preparation and writing—review & editing.

Corresponding author

Correspondence to Ali Najah Ahmed.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Murti, M.A., Junior, R., Ahmed, A.N. et al. Earthquake multi-classification detection based velocity and displacement data filtering using machine learning algorithms. Sci Rep 12, 21200 (2022). https://doi.org/10.1038/s41598-022-25098-1

Download citation

Received: 14 September 2022
Accepted: 24 November 2022
Published: 08 December 2022
DOI: https://doi.org/10.1038/s41598-022-25098-1
Springer Nature Limited

This article is cited by

Recent advances in earthquake seismology using machine learning
- Hisahiko Kubo
- Makoto Naoi
- Masayuki Kano
Earth, Planets and Space (2024)

Earthquake multi-classification detection based velocity and displacement data filtering using machine learning algorithms

Abstract

Similar content being viewed by others

Machine Learning Based Comparative Analysis for the Classification of Earthquake Signals

Machine Learning Based Earthquake Early Warning (EEW) System: A Case Study of Himalayan Region

Seismic Discrimination Between Nuclear Explosions and Natural Earthquakes using Multi-Machine Learning Techniques

Introduction