Skip to main content

Determining Air Pollution Level with Machine Learning Algorithms: The Case of India

  • Conference paper
  • First Online:
4th International Conference on Artificial Intelligence and Applied Mathematics in Engineering (ICAIAME 2022)

Abstract

Air pollution is one of the critical health problems affecting the quality of life, especially in city centers. The air quality index (AQI) is the primary parameter used to measure air pollution. This parameter is constantly measured in city centers with measuring devices that contain various sensors. Due to the high purchasing costs and the need for periodic calibration, there is a need to develop more economical technologies to calculate AQI values. This study aims to predict air pollution with minimum sensors based on machine learning. In the study, daily-based five-year air quality measurement data from India, taken from the Kaggle database, were used. Air quality data includes \(PM_{2.5}\), \(PM_{10}\), \(O_{3}\), \(NO_{2}\), \(SO_{2}\), \(CO_{}\) values obtained from sensors in the measuring devices, and AQI calculated with these values. A feature selection algorithm is used to reduce the sensor cost. Then, artificial intelligence-based AQI was calculated with minimum sensor data. According to the findings, artificial intelligence-based AQI calculation model performances r and RMSE were determined as 0.93 and 20.57, respectively. It has been evaluated that AQI data can be calculated based on artificial intelligence with a minimum of sensors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Yolsal, H.: Estimation of the air quality trends in Istanbul. İktisadi İdari Bilimler Dergisi 38, 375 (2016)

    Article  Google Scholar 

  2. Khuda, K.-E.: Causes of air pollution in Bangladesh’s capital city and its impacts on public health. Nat. Environ. Pollut. Technol. 19, 1483–1490 (2020)

    Article  Google Scholar 

  3. Yilmaz, F.: Hava kirliliği, bileşenleri ve sağlık. FSM İlmî Araştırmalar İnsan ve Toplum Bilimleri Dergisi 231–250 (2021)

    Google Scholar 

  4. Pope, C.A., Dockery, D.W.: Health effects of fine particulate air pollution: lines that connect. J. Air Waste Manag. Assoc. 56, 709–742 (2006)

    Article  Google Scholar 

  5. Who air quality guidelines for particulate matter, ozone, nitrogen dioxide and sulfur dioxide (2006)

    Google Scholar 

  6. Maji, S., Ahmed, S., Ghosh, S., Garg, S.K.: Evaluation of air quality index for air quality data interpretation in Delhi, India (2020)

    Google Scholar 

  7. History of air pollution \(|\) US EPA

    Google Scholar 

  8. AQI basics \(|\) airnow.gov

    Google Scholar 

  9. Kuznetsova, I.N., Tkacheva, Yu.V., Shalygina, I.Yu., Lezina, E.A.: Calculation of air quality index and assessment of its informativeness for Russia based on monitoring data for Moscow. Russ. Meteorol. Hydrol. 46, 530–538 (2021)

    Google Scholar 

  10. Plaia, A., Ruggieri, M.: Air quality indices: a review. Rev. Environ. Sci. Biotechnol. 10, 165–179 (2011)

    Article  Google Scholar 

  11. How is the AQI calculated? \(|\) US EPA

    Google Scholar 

  12. Wood, D.A.: Local integrated air quality predictions from meteorology (2015 to 2020) with machine and deep learning assisted by data mining. Sustain. Anal. Model. 2, 100002 (2022)

    Google Scholar 

  13. Du, W., et al.: Deciphering urban traffic impacts on air quality by deep learning and emission inventory. J. Environ. Sci. 124, 745–757 (2023)

    Article  Google Scholar 

  14. Lightstone, S.D., Moshary, F., Gross, B.: Comparing CMAQ forecasts with a neural network forecast model for PM2.5 in New York. Atmosphere 8(161), 2017 (2017)

    Google Scholar 

  15. Cabaneros, S.M., Calautit, J.K., Hughes, B.R.: A review of artificial neural network models for ambient air pollution prediction. Environ. Modell. Softw. 119, 285–304 (2019)

    Article  Google Scholar 

  16. Liu, H., Li, Q., Yu, D., Gu, Y.: Air quality index and air pollutant concentration prediction based on machine learning algorithms. Appl. Sci. 9, 4069 (2019)

    Article  Google Scholar 

  17. Cabaneros, S.M.S., Calautit, J.K.S., Hughes, B.R.: Hybrid artificial neural network models for effective prediction and mitigation of urban roadside NO2 pollution. Energy Procedia 142, 3524–3530 (2017)

    Article  Google Scholar 

  18. Kekulanadara, K.M.O.V.K., Kumara, B.T.G.S., Kuhaneswaran, B.: Machine learning approach for predicting air quality index. In: 2021 International Conference on Decision Aid Sciences and Application, DASA 2021, pp. 622–626 (2021)

    Google Scholar 

  19. CPCB \(|\) central pollution control board

    Google Scholar 

  20. Mishra, S., Datta-Gupta, A.: Applied statistical modeling and data analytics: a practical guide for the petroleum geosciences (2018)

    Google Scholar 

  21. Bebis, G., Georgiopoulos, M.: Feed-forward neural networks. IEEE Potent. 13, 27–31 (1994)

    Article  Google Scholar 

  22. Vogt, M., Kecman, V.: Active-set methods for support vector machines. In: Wang, L. (ed.) Support Vector Machines: Theory and Applications. Studies in Fuzziness and Soft Computing, vol. 177, pp. 133–158. Springer, Heidelberg (2005). https://doi.org/10.1007/10984697_6

    Chapter  Google Scholar 

  23. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Classif. Regression Trees 1–358 (2017)

    Google Scholar 

  24. Abiri, O., Twala, B.: Modelling the flow stress of alloy 316l using a multi-layered feed forward neural network with Bayesian regularization. In: IOP Conference Series: Materials Science and Engineering, vol. 225, p. 9 (2017)

    Google Scholar 

  25. Uçar, M.K., Uçar, Z., Köksal, F., Daldal, N.: Estimation of body fat percentage using hybrid machine learning algorithms. Meas.: J. Int. Meas. Confederation, 167 (2021)

    Google Scholar 

  26. Tanabe, K.: Pareto’s 80/20 rule and the gaussian distribution. Phys. A Stat. Mech. Appl. 510, 635–640 (2018)

    Article  MathSciNet  MATH  Google Scholar 

  27. Flake, G.W., Lawrence, S.: Efficient SVM regression training with SMO. Mach. Learn. 46, 271–290 (2002)

    Article  MATH  Google Scholar 

  28. Cherkassky, V., Ma, Y.: Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw. 17, 113–126 (2004)

    Article  MATH  Google Scholar 

  29. Polat, K., Koc, K.O.: Detection of skin diseases from dermoscopy image using the combination of convolutional neural network and one-versus-all (2020)

    Google Scholar 

  30. On the mean squared error of an estimator \(|\) introduction to probability \(|\) supplemental resources \(|\) mit opencourseware

    Google Scholar 

  31. Root mean square error (RMSE) \(|\) cros

    Google Scholar 

  32. Standard error \(|\) what it is, why it matters, and how to calculate

    Google Scholar 

  33. Rahimpour, A., Amanollahi, J., Tzanis, C.G.: Air quality data series estimation based on machine learning approaches for urban environments. Air Qual. Atmos. Health 14, 191–201 (2021)

    Article  Google Scholar 

  34. Ausati, S., Amanollahi, J.: Assessing the accuracy of ANFIS, EEMD-GRNN, PCR, and MLR models in predicting PM2.5. Atmos. Environ. 142, 465–474 (2016)

    Article  Google Scholar 

  35. Zhu, S., Lian, X., Liu, H., Hu, J., Wang, Y., Che, J.: Daily air quality index forecasting with hybrid models: a case in China. Environ. Pollut. 231, 1232–1244 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Furkan Abdurrahman Sari .

Editor information

Editors and Affiliations

Ethics declarations

Conflict of Interests

There is no conflict of interest between the authors.

Financial Support

No support was received for this research.

Ethics Committee Approval

Ethics committee approval is not required as the open source data set was used in the study.

Data Collection and Use Permission

.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sari, F.A., Haşıloğlu, M.A., Uçar, M.K., Güler, H. (2023). Determining Air Pollution Level with Machine Learning Algorithms: The Case of India. In: Hemanth, D.J., Yigit, T., Kose, U., Guvenc, U. (eds) 4th International Conference on Artificial Intelligence and Applied Mathematics in Engineering. ICAIAME 2022. Engineering Cyber-Physical Systems and Critical Infrastructures, vol 7. Springer, Cham. https://doi.org/10.1007/978-3-031-31956-3_48

Download citation

Publish with us

Policies and ethics