A Framework to Improve Data Collection and Promote Usability

  • Davide CarneiroEmail author
  • Albertino Vieira
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 806)


Many of nowadays organizations can be said to be knowledge-based. That is, they have relevant decision-making processes that are supported by data and data mining processes. These data may be created/collected by the organization or acquired from external sources (e.g. open data portals). In any case, the quality of the data will, ultimately, be one of the main drivers of decision quality. In this context, it is important that data-producing organizations also produce relevant meta-information characterizing the provenance of the data, its context or the representation standards used. This paper presents a framework to facilitate this process, promoting the inclusion of information concerning representation standards, provenance, trust and permissions at the data level. The main goal is to promote data usability and, consequently, its value for the organizations.


Data acquisition Provenance Data representation 



This work is co-funded by Fundos Europeus Estruturais e de Investimento (FEEI) through Programa Operacional Regional Norte, in the scopre of project NORTE-01-0145-FEDER-023577.


  1. 1.
    Gudivada, V.N., Baeza-Yates, R.A., Raghavan, V.V.: Big data: promises and problems. IEEE Comput. 48(3), 20–23 (2015)CrossRefGoogle Scholar
  2. 2.
    De Paz, J.F., Julián, V., Villarrubia, G., Marreiros, G., Novais, P.: Ambient intelligence–software and applications. In: 8th International Symposium on Ambient Intelligence (ISAmI 2017), vol. 615. Springer (2017)Google Scholar
  3. 3.
    Gonzaga, J., Meleiro, L.A.C., Kiang, C., Maciel Filho, R.: Ann-based soft-sensor for real-time process monitoring and control of an industrial polymerization process. Comput. Chem. Eng. 33(1), 43–49 (2009)CrossRefGoogle Scholar
  4. 4.
    Diallo, O., Rodrigues, J.J., Sene, M., Lloret, J.: Distributed database management techniques for wireless sensor networks. IEEE Trans. Parallel Distrib. Syst. 26(2), 604–620 (2015)CrossRefGoogle Scholar
  5. 5.
    Marz, N., Warren, J.: Big Data: Principles and Best Practices of Scalable Realtime Data Systems. Manning Publications Co. (2015)Google Scholar
  6. 6.
    Tassa, T.: Secure mining of association rules in horizontally distributed databases. IEEE Trans. Knowl. Data Eng. 26(4), 970–983 (2014)CrossRefGoogle Scholar
  7. 7.
    Zaharia, M., et al.: Apache spark: a unified engine for big data processing. Commun. ACM 59(11), 56–65 (2016)CrossRefGoogle Scholar
  8. 8.
    Fan, W.: Data quality: from theory to practice. ACM SIGMOD Record 44(3), 7–18 (2015)CrossRefGoogle Scholar
  9. 9.
    Merkel, D.: Docker: lightweight linux containers for consistent development and deployment. Linux J. 2014(239), 2 (2014)Google Scholar
  10. 10.
    Freitas, A., Curry, E.: Big data curation. In: New Horizons for a Data-Driven Economy, pp. 87–118. Springer (2016)Google Scholar
  11. 11.
    Sänger, J., Richthammer, C., Hassan, S., Pernul, G.: Trust and big data: a roadmap for research. In: 2014 25th International Workshop on Database and Expert Systems Applications (DEXA), pp. 278–282. IEEE (2014)Google Scholar
  12. 12.
    Moreau, L., et al.: The provenance of electronic data. Commun. ACM 51(4), 52–58 (2008)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.CIICESI, ESTGPolytechnic Institute of PortoFelgueirasPortugal
  2. 2.Algoritmi Centre/Department of InformaticsUniversidade do MinhoBragaPortugal

Personalised recommendations