Abstract
This study aimed to carry out a comparative analysis of machine learning techniques via data mining in a company in the railway segment located in the state of Paraná, Brazil. To achieve the goal of reducing emissions of polluting gases that impact future climate changes, information was first collected regarding fuel consumption over the years from 2006 to 2020. Then, data mining techniques were applied to identify patterns, clean and prepare the collected base, then the machine learning techniques of K-Nearest Neighbors (KNN), Random Forest and Support Vector Machine (SVM) were applied. The selection of the best technique was based on the smallest error in the test set. As a result, it was possible to identify that the KNN was the technique with the lowest error compared to the others analyzed, being the one chosen to make future forecasts for the company under study.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. The MIT Press, Cambridge (2016)
ANTT. http://www.gov.br/antt. Accessed 04 Aug 2022
Chen, Y.J., Chen, Y.M.: Forecasting corporate credit ratings using big data from social media. Expert Syst. Appl. 207(1), 118042 (2022). https://doi.org/10.1016/j.eswa.2022.118042
Wang, B., et al.: A hybrid machine learning approach to determine the optimal processing window in femtosecond laser-induced periodic nanostructures. J. Mater. Process. Technol. 308, 117716 (2022). https://doi.org/10.1016/j.jmatprotec.2022.117716
Mitra, R., Goswami, A., Tiwari, M.K.: Financial supply chain analysis with borrower identification in smart lending platform. Expert Syst. Appl. 208, 118026 (2022). https://doi.org/10.1016/j.eswa.2022.118026
Kaymakcı, R., Görener, A., Toker, K.: The perceived over qualification’s effect on innovative work behaviour: do transformational leadership and turnover intention matter? Curr. Res. Behav. Sci. 3(1), 100068 (2022). https://doi.org/10.1016/j.crbeha.2022.100068
Alamdari, S., Basiri, M.H., Mousavi, A., Soofastaei, A.: Application of machine learning techniques to predict haul truck fuel consumption in open-pit mines. J. Min. Environ. 13(1), 69–85 (2022). https://doi.org/10.22044/JME.2022.11577.2145
Caraka, R.E., et al.: Albatross analytics a hands-on into practice: statistical and data science application. J. Big Data 9, 70 (2022). https://doi.org/10.1186/s40537-022-00626-y
Faccini, D., Maggioni, F., Potra, F.A.: Robust and distributionally robust optimization models for linear support vector machine. Comput. Oper. Res. 147, 105930 (2022). https://doi.org/10.1016/j.cor.2022.105930
Lee, L.H., et al.: Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants. Nat. Libr. Med. 65(1), e1 (2021). https://doi.org/10.1192/j.eurpsy.2021.2248
Kato, M., Yanai, T.: Pulled fly balls are harder to catch: a game analysis with a machine learning approach. Sports Eng. 25, 11 (2022). https://doi.org/10.1007/s12283-022-00373-6
Mert, S., et al.: Development of a SERS based cancer diagnosis approach employing cryosectioned thyroid tissue samples on PDMS. Nanomed. Nanotechnol. Biol. Med. 44, 102577 (2022). https://doi.org/10.1016/j.nano.2022.102577
Zhu, Q., Li, H., Ao, Z., et al.: Lipidomic identification of urinary extracellular vesicles for non-alcoholic steatohepatitis diagnosis. J. Nanobiotechnol. 20, 349 (2022). https://doi.org/10.1186/s12951-022-01540-4
Rastogi, A., et al.: Early diagnosis of lung cancer using magnetic nanoparticles-integrated systems. Nanotechnol. Rev. 11(1), 544–574 (2022). https://doi.org/10.1515/ntrev-2022-0032
Yang, C., et al.: Precise detection of cataracts with specific high-risk factors by layered binary co-ionizers assisted aqueous humor metabolic analysis. Adv. Sci. 9(21), 2105905 (2022). https://doi.org/10.1002/advs.202105905
Ferreño, D., et al.: Prediction of mechanical properties of rail pads under in-service conditions through machine learning algorithms. J. Infrastruct. Syst. 151, 102927 (2021). https://doi.org/10.1016/j.advengsoft.2020.102927
Lee, Z.S., Guo, H., Zhou, L.:Rail system anomaly detection via machine learning approaches. In: 2020 IEEE Region 10 Conference (TENCON), pp. 824–828 (2020). https://doi.org/10.1109/TENCON50793.2020.9293809
Zhou, T., Wang, Y., Wang, C.X., Salous, S., Liu, L., Tao, C.: Multi-feature fusion based recognition and relevance analysis of propagation scenes for high-speed railway channels. IEEE Trans. Veh. Technol. 69(8), 8107–8118 (2020). https://doi.org/10.1109/TVT.2020.2999313
Li, J., Peng, Q., Yang, Y.: Passenger flow prediction for Guangzhou-Zhuhai intercity railway based on SARIMA Model. J. Southwest Jiaotong Univ. 55(1), 41–51 (2020). https://doi.org/10.3969/j.issn.0258-2724.20180617
Zhang, L., Ni, Q., Zhang, G., Zhai, M., Moreno, J., Briso, C.: Random forests-enabled context detections for long-term evolution network for railway. IET Microwaves Antennas Propag. 13(8), 1080–1086 (2019). https://doi.org/10.1049/iet-map.2018.6025
Shi, L., Zhu, Y., Zhang, Y., Su, Z.: Fault diagnosis of signal equipment on the Lanzhou- Xinjiang high-speed railway using machine learning for natural language processing. Complexity 2021, 1–13 (2021). https://doi.org/10.1155/2021/9126745
Liu, Z., Ma, Q., Tang, H., Li, J., Wang, P., He, Q.: Forecasting estimated times of arrival of US freight trains. Transp. Plan. Technol. 45, 427–448 (2022). https://doi.org/10.1080/03081060.2022.2115044
Goncalves, M.C., Wollmann, R.R.G., de Sampaio, R.J.B.: Proposal of a numerical approximation theory to solve the robust convex problem of production planning (2023)https://doi.org/10.1504/IJOR.2022.10049618
Szejka, A.L., Aubry, A., Panetto, H., Júnior, O.C., Loures, E.R.: Towards a conceptual framework for requirements interoperability in complex systems engineering. In: Meersman, R., et al. (eds.) OTM 2014. LNCS, vol. 8842, pp. 229–240. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-45550-0_24
Canciglieri, O.J., Young, R.I.M.: Information mapping across injection moulding design and manufacture domains. Int. J. Prod. Res. 48(15), 4437–4462 (2010). https://doi.org/10.1080/00207540902824974
Giovannini, A., Aubry, A., Panetto, H., El Haouzi, H., Canciglieri, O., Pierrel, L.: Knowledge representation, retrieval and reuse for product family design: an anti-logicist approach. Comput. Ind. Eng. 101, 391–402 (2016). https://doi.org/10.1016/j.cie.2016.10.001
Mattioda, R.A., Fernandes, P.T., Detro, S.P., Casela, J.L., Junior, O.C.: Principle of triple bottom line in the integrated development of sustainable products. Chem. Eng. Trans. 35, 199–204 (2013). https://doi.org/10.3303/CET1335033
Nara, E.O.B., Schaefer, J.L., de Moraes, J., Tedesco, L.P.C., Furtado, J.C., Baierle, I.C.: Sourcing research papers on small- and medium-sized enterprises’ competitiveness: An approach based on authors’ networks. Revista Espanola de Documentacion Cientifica 42(2), 230 (2019). https://doi.org/10.3989/redc.2019.2.1602
Baierle, I.C., Schaefer, J.L., Sellitto, M.A., Fava, L.P., Furtado, J.C., Nara, E.O.B.: Moona software for survey classification and evaluation of criteria to support decision-making for properties portfolio. Int. J. Strateg. Prop. Manag. 24(2), 226–236 (2020). https://doi.org/10.3846/ijspm.2020.12338
Schaefer, J.L., Baierle, I.C., Sellitto, M.A., Furtado, J.C., Nara, E.O.B.: Competitiveness scale as a basis for Brazilian small and medium-sized enterprises. EMJ – Eng. Manag. J. 33(4), 255–271 (2021). https://doi.org/10.1080/10429247.2020.1800385
Storch, L.A., Nara, E.O.B., Kipper, L.M.: The use of process management based on a systemic approach. Int. J. Product. Perform. Manag. 62(7), 758–773 (2013). https://doi.org/10.1108/IJPPM-12-2012-0134
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Gonçalves, M.C., Nara, E.O.B., Santos, I.M.d., Mateus, I.B., do Amaral, L.M.B. (2023). Comparative Analysis of Machine Learning Techniques via Data Mining in a Railroad Company. In: Deschamps, F., Pinheiro de Lima, E., Gouvêa da Costa, S.E., G. Trentin, M. (eds) Proceedings of the 11th International Conference on Production Research – Americas. ICPR 2022. Springer, Cham. https://doi.org/10.1007/978-3-031-36121-0_83
Download citation
DOI: https://doi.org/10.1007/978-3-031-36121-0_83
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36120-3
Online ISBN: 978-3-031-36121-0
eBook Packages: EngineeringEngineering (R0)