Skip to main content

Comparative Analysis of Machine Learning Techniques via Data Mining in a Railroad Company

  • Conference paper
  • First Online:
Proceedings of the 11th International Conference on Production Research – Americas (ICPR 2022)

Abstract

This study aimed to carry out a comparative analysis of machine learning techniques via data mining in a company in the railway segment located in the state of Paraná, Brazil. To achieve the goal of reducing emissions of polluting gases that impact future climate changes, information was first collected regarding fuel consumption over the years from 2006 to 2020. Then, data mining techniques were applied to identify patterns, clean and prepare the collected base, then the machine learning techniques of K-Nearest Neighbors (KNN), Random Forest and Support Vector Machine (SVM) were applied. The selection of the best technique was based on the smallest error in the test set. As a result, it was possible to identify that the KNN was the technique with the lowest error compared to the others analyzed, being the one chosen to make future forecasts for the company under study.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. The MIT Press, Cambridge (2016)

    Google Scholar 

  2. ANTT. http://www.gov.br/antt. Accessed 04 Aug 2022

  3. Chen, Y.J., Chen, Y.M.: Forecasting corporate credit ratings using big data from social media. Expert Syst. Appl. 207(1), 118042 (2022). https://doi.org/10.1016/j.eswa.2022.118042

    Article  Google Scholar 

  4. Wang, B., et al.: A hybrid machine learning approach to determine the optimal processing window in femtosecond laser-induced periodic nanostructures. J. Mater. Process. Technol. 308, 117716 (2022). https://doi.org/10.1016/j.jmatprotec.2022.117716

    Article  Google Scholar 

  5. Mitra, R., Goswami, A., Tiwari, M.K.: Financial supply chain analysis with borrower identification in smart lending platform. Expert Syst. Appl. 208, 118026 (2022). https://doi.org/10.1016/j.eswa.2022.118026

    Article  Google Scholar 

  6. Kaymakcı, R., Görener, A., Toker, K.: The perceived over qualification’s effect on innovative work behaviour: do transformational leadership and turnover intention matter? Curr. Res. Behav. Sci. 3(1), 100068 (2022). https://doi.org/10.1016/j.crbeha.2022.100068

    Article  Google Scholar 

  7. Alamdari, S., Basiri, M.H., Mousavi, A., Soofastaei, A.: Application of machine learning techniques to predict haul truck fuel consumption in open-pit mines. J. Min. Environ. 13(1), 69–85 (2022). https://doi.org/10.22044/JME.2022.11577.2145

    Article  Google Scholar 

  8. Caraka, R.E., et al.: Albatross analytics a hands-on into practice: statistical and data science application. J. Big Data 9, 70 (2022). https://doi.org/10.1186/s40537-022-00626-y

    Article  Google Scholar 

  9. Faccini, D., Maggioni, F., Potra, F.A.: Robust and distributionally robust optimization models for linear support vector machine. Comput. Oper. Res. 147, 105930 (2022). https://doi.org/10.1016/j.cor.2022.105930

    Article  MathSciNet  Google Scholar 

  10. Lee, L.H., et al.: Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants. Nat. Libr. Med. 65(1), e1 (2021). https://doi.org/10.1192/j.eurpsy.2021.2248

    Article  MathSciNet  Google Scholar 

  11. Kato, M., Yanai, T.: Pulled fly balls are harder to catch: a game analysis with a machine learning approach. Sports Eng. 25, 11 (2022). https://doi.org/10.1007/s12283-022-00373-6

    Article  Google Scholar 

  12. Mert, S., et al.: Development of a SERS based cancer diagnosis approach employing cryosectioned thyroid tissue samples on PDMS. Nanomed. Nanotechnol. Biol. Med. 44, 102577 (2022). https://doi.org/10.1016/j.nano.2022.102577

    Article  MathSciNet  Google Scholar 

  13. Zhu, Q., Li, H., Ao, Z., et al.: Lipidomic identification of urinary extracellular vesicles for non-alcoholic steatohepatitis diagnosis. J. Nanobiotechnol. 20, 349 (2022). https://doi.org/10.1186/s12951-022-01540-4

    Article  Google Scholar 

  14. Rastogi, A., et al.: Early diagnosis of lung cancer using magnetic nanoparticles-integrated systems. Nanotechnol. Rev. 11(1), 544–574 (2022). https://doi.org/10.1515/ntrev-2022-0032

    Article  Google Scholar 

  15. Yang, C., et al.: Precise detection of cataracts with specific high-risk factors by layered binary co-ionizers assisted aqueous humor metabolic analysis. Adv. Sci. 9(21), 2105905 (2022). https://doi.org/10.1002/advs.202105905

    Article  MathSciNet  Google Scholar 

  16. Ferreño, D., et al.: Prediction of mechanical properties of rail pads under in-service conditions through machine learning algorithms. J. Infrastruct. Syst. 151, 102927 (2021). https://doi.org/10.1016/j.advengsoft.2020.102927

    Article  Google Scholar 

  17. Lee, Z.S., Guo, H., Zhou, L.:Rail system anomaly detection via machine learning approaches. In: 2020 IEEE Region 10 Conference (TENCON), pp. 824–828 (2020). https://doi.org/10.1109/TENCON50793.2020.9293809

  18. Zhou, T., Wang, Y., Wang, C.X., Salous, S., Liu, L., Tao, C.: Multi-feature fusion based recognition and relevance analysis of propagation scenes for high-speed railway channels. IEEE Trans. Veh. Technol. 69(8), 8107–8118 (2020). https://doi.org/10.1109/TVT.2020.2999313

    Article  Google Scholar 

  19. Li, J., Peng, Q., Yang, Y.: Passenger flow prediction for Guangzhou-Zhuhai intercity railway based on SARIMA Model. J. Southwest Jiaotong Univ. 55(1), 41–51 (2020). https://doi.org/10.3969/j.issn.0258-2724.20180617

    Article  Google Scholar 

  20. Zhang, L., Ni, Q., Zhang, G., Zhai, M., Moreno, J., Briso, C.: Random forests-enabled context detections for long-term evolution network for railway. IET Microwaves Antennas Propag. 13(8), 1080–1086 (2019). https://doi.org/10.1049/iet-map.2018.6025

    Article  Google Scholar 

  21. Shi, L., Zhu, Y., Zhang, Y., Su, Z.: Fault diagnosis of signal equipment on the Lanzhou- Xinjiang high-speed railway using machine learning for natural language processing. Complexity 2021, 1–13 (2021). https://doi.org/10.1155/2021/9126745

    Article  Google Scholar 

  22. Liu, Z., Ma, Q., Tang, H., Li, J., Wang, P., He, Q.: Forecasting estimated times of arrival of US freight trains. Transp. Plan. Technol. 45, 427–448 (2022). https://doi.org/10.1080/03081060.2022.2115044

    Article  Google Scholar 

  23. Goncalves, M.C., Wollmann, R.R.G., de Sampaio, R.J.B.: Proposal of a numerical approximation theory to solve the robust convex problem of production planning (2023)https://doi.org/10.1504/IJOR.2022.10049618

  24. Szejka, A.L., Aubry, A., Panetto, H., Júnior, O.C., Loures, E.R.: Towards a conceptual framework for requirements interoperability in complex systems engineering. In: Meersman, R., et al. (eds.) OTM 2014. LNCS, vol. 8842, pp. 229–240. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-45550-0_24

    Chapter  Google Scholar 

  25. Canciglieri, O.J., Young, R.I.M.: Information mapping across injection moulding design and manufacture domains. Int. J. Prod. Res. 48(15), 4437–4462 (2010). https://doi.org/10.1080/00207540902824974

    Article  Google Scholar 

  26. Giovannini, A., Aubry, A., Panetto, H., El Haouzi, H., Canciglieri, O., Pierrel, L.: Knowledge representation, retrieval and reuse for product family design: an anti-logicist approach. Comput. Ind. Eng. 101, 391–402 (2016). https://doi.org/10.1016/j.cie.2016.10.001

    Article  Google Scholar 

  27. Mattioda, R.A., Fernandes, P.T., Detro, S.P., Casela, J.L., Junior, O.C.: Principle of triple bottom line in the integrated development of sustainable products. Chem. Eng. Trans. 35, 199–204 (2013). https://doi.org/10.3303/CET1335033

    Article  Google Scholar 

  28. Nara, E.O.B., Schaefer, J.L., de Moraes, J., Tedesco, L.P.C., Furtado, J.C., Baierle, I.C.: Sourcing research papers on small- and medium-sized enterprises’ competitiveness: An approach based on authors’ networks. Revista Espanola de Documentacion Cientifica 42(2), 230 (2019). https://doi.org/10.3989/redc.2019.2.1602

    Article  Google Scholar 

  29. Baierle, I.C., Schaefer, J.L., Sellitto, M.A., Fava, L.P., Furtado, J.C., Nara, E.O.B.: Moona software for survey classification and evaluation of criteria to support decision-making for properties portfolio. Int. J. Strateg. Prop. Manag. 24(2), 226–236 (2020). https://doi.org/10.3846/ijspm.2020.12338

    Article  Google Scholar 

  30. Schaefer, J.L., Baierle, I.C., Sellitto, M.A., Furtado, J.C., Nara, E.O.B.: Competitiveness scale as a basis for Brazilian small and medium-sized enterprises. EMJ – Eng. Manag. J. 33(4), 255–271 (2021). https://doi.org/10.1080/10429247.2020.1800385

    Article  Google Scholar 

  31. Storch, L.A., Nara, E.O.B., Kipper, L.M.: The use of process management based on a systemic approach. Int. J. Product. Perform. Manag. 62(7), 758–773 (2013). https://doi.org/10.1108/IJPPM-12-2012-0134

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcelo Carneiro Gonçalves .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gonçalves, M.C., Nara, E.O.B., Santos, I.M.d., Mateus, I.B., do Amaral, L.M.B. (2023). Comparative Analysis of Machine Learning Techniques via Data Mining in a Railroad Company. In: Deschamps, F., Pinheiro de Lima, E., Gouvêa da Costa, S.E., G. Trentin, M. (eds) Proceedings of the 11th International Conference on Production Research – Americas. ICPR 2022. Springer, Cham. https://doi.org/10.1007/978-3-031-36121-0_83

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-36121-0_83

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-36120-3

  • Online ISBN: 978-3-031-36121-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics