Ethical and Socially-Aware Data Labels

  • Elena BerettaEmail author
  • Antonio Vetrò
  • Bruno Lepri
  • Juan Carlos De Martin
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 898)


Many software systems today make use of large amount of personal data to make recommendations or decisions that affect our daily lives. These software systems generally operate without guarantees of non-discriminatory practices, as instead often required to human decision-makers, and therefore are attracting increasing scrutiny. Our research is focused on the specific problem of biased software-based decisions caused from biased input data. In this regard, we propose a data labeling framework based on the identification of measurable data characteristics that could lead to downstream discriminating effects. We test the proposed framework on a real dataset, which allowed us to detect risks of discrimination for the case of population groups.


Data ethics Automated decisions Data quality 


  1. 1.
    Barocas, S., Selbst, A.D.: Big data’s disparate impact. Calif. Law Rev. 104(3), 671–732 (2016)Google Scholar
  2. 2.
    Corrales, D.C., Corrales, J.C., Ledezma, A.: How to address the data quality issues in regression models: a guided process for data cleaning. Symmetry 10(4), 99 (2018). Scholar
  3. 3.
    Doshi-Velez, F., et al.: Accountability of AI under the law: the role of explanation. Berkman Center Research Publication Forthcoming, Harvard Public Law Working Paper 18(07) (2017)Google Scholar
  4. 4.
    Dwork, C., Hardt, M., Pitassi, T., Reingold, O., Zemeln, R.: Fairness through awareness. In: Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, pp. 214–226. ACM (2012)Google Scholar
  5. 5.
    Friedler, S.A., Scheidegger, C., Venkatasubramanian, S.: On the (im) possibility of fairness. arXiv preprint arXiv:1609.07236 (2016)
  6. 6.
    Gebru, T., et al.: Datasheets for datasets. arXiv:1803.09010 (2018)
  7. 7.
    Hardt, M., Price, E., Srebro, N.: Equality of opportunity in supervised learning. In: Advances in Neural Information Processing Systems (2016)Google Scholar
  8. 8.
    Hosni, H., Vulpiani, A.: Forecasting in light of big data. Philos. Technol. 13, 1–13 (2017)Google Scholar
  9. 9.
    ISO-IEC: ISO/IEC 25012:2008 Software engineering - Software product Quality Requirements and Evaluation (SQuaRE) - Data quality model. Standard, International Organization for Standardization, Geneva, CH, December 2008Google Scholar
  10. 10.
    ISO-IEC: ISO/IEC 25024:2015 - Systems and software engineering - Systems and software Quality Requirements and Evaluation (SQuaRE) - Measurement of data quality. Standard, International Organization for Standardization, Geneva, CH, October 2015Google Scholar
  11. 11.
    Karim, N.S.A., Ammar, F.A., Aziz, R.: Ethical software: integrating code of ethics into software development life cycle. In: 2017 International Conference on Computer and Applications (ICCA), pp. 290–298, September 2017.
  12. 12.
    Lepri, B., Staiano, J., Sangokoya, D., Letouzé, E., Oliver, N.: The tyranny of data? The bright and dark sides of data-driven decision-making for social good. In: Cerquitelli, T., Quercia, D., Pasquale, F. (eds.) Transparent Data Mining for Big and Small Data. SBD, vol. 11, pp. 3–24. Springer, Cham (2017). Scholar
  13. 13.
    O’Neil, C.: Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. Crown Publishing Group, New York (2016)zbMATHGoogle Scholar
  14. 14.
    Torchiano, M., Vetrò, A., Iuliano, F.: Preserving the benefits of open government data by measuring and improving their quality: an empirical study. In: 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC), vol. 1, pp. 144–153, July 2017.
  15. 15.
    Vetrò, A., Canova, L., Torchiano, M., Minotas, C.O., Iemma, R., Morando, F.: Open data quality measurement framework: definition and application to open government data. Gov. Inf. Q. 33(2), 325–337 (2016). Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Nexa Center for Internet and Society, DAUINPolitecnico di TorinoTurinItaly
  2. 2.Future Urban Legacy LabPolitecnico di TorinoTurinItaly
  3. 3.Fondazione Bruno KesslerTrentoItaly

Personalised recommendations