Abstract
With the advent of big data today, agricultural research institutes have stored enough data in databases at every level. This data contains a lot of knowledge that is often not well known by researchers and on the other hand, it may contain hidden semantic relationships. Therefore, it is important to find an adequate storage system for these data in order to discover new useful information that can be shared between the different actors in the field to improve agricultural productivity in Burkina. The concept that best responds to this storage and analysis problem is the data lake. Indeed, data lakes offer the possibility of storing a large volume of data in any format and data structure. They also provide services for data access and analysis. Our work revolves around the integration and storage of agricultural data (structured, semi-structured and unstructured). With the plurality of agricultural storage systems, the heterogeneity of the actors in the field and the disparity of the tools leads us to think that the data lake is a solution.
The purpose of this essay is to investigate data lakes and their requirements while also putting out a broad framework for an agricultural data lake in Burkina Faso.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cartier, L.E., Bürge, M.: « Agriculture and artisanal gold mining in sierra leone: alternatives or complements? »J. Int. Dev. 23(8), 1080–1099 (2011). https://doi.org/10.1002/jid.1833
Dipama, J.-M. P.: « Changement climatique et agriculture durable au Burkina Faso : stratégies de résilence basées sur les savoirs locaux rapport d’étude », juin 2016, Consulté le: 31 juillet 2022. [En ligne]. Disponible sur: https://idl-bnc-idrc.dspacedirect.org/handle/10625/57568
Bellon-Maurel, V., Huyghe, C.:« L’innovation technologique dans l’agriculture »,Geoeconomie 80(3), 159–180 (2016)
« Data Lake ou Lac de Données : définition et utilisation », Formation Data Science | DataScientest.com, 22 février 2021. https://datascientest.com/data-lake-tout-savoir (consulté le 13 juillet 2022)
C. Madera, A. Laurent, T. Libourel Rouge, et A. Miralles, « How can the data lake concept influence information system design for agriculture? ». In: 11th European Conference Dedicated to the Future Use of ICT in the Agri-Food Sector, Bioresource and Biomass Sector (EFITA 2017), Montpellier, France, juill. 2017, p. 181‑182. Consulté le: 21 juillet 2022. [En ligne]. Disponible sur: https://hal.archives-ouvertes.fr/hal-01847697
Alekseev, A., et al.: Prototype of the Russian scientific data lake. EPJ Web Conf. 251, 02031 (2021). https://doi.org/10.1051/epjconf/202125102031
« Pentaho, Hadoop, and Data Lakes », James Dixon’s Blog, 14 octobre 2010. https://jamesdixon.wordpress.com/2010/10/14/pentaho-hadoop-and-data-lakes/ (consulté le 8 juillet 2022)
B. L, « Data Lake : définition, avantages et inconvénients pour l’entreprise »,LeBigData.fr, 10 juillet 2017. https://www.lebigdata.fr/data-lake-definition (consulté le 7 juillet 2022)
Stein, B., Morrison, A.: « The enterprise data lake: better integration and deeper analytics », p. 9
E.M.T. at E.S.S. Limited, « Introduction To The Concept Of Data Lake And Its Benefits – ESDS BLOG », 6 février 2015. https://www.esds.co.in/blog/introduction-to-the-concept-of-data-lake-and-its-benefits/ (consulté le 7 juillet 2022)
« Governing and Managing Big Data for Analytics and Decision Makers », p. 28
R. Hai, C. Quix, et M. Jarke, « Data lake concept and systems: a survey ». arXiv, 17 juin 2021. Consulté le: 8 juillet 2022. [En ligne]. Disponible sur: http://arxiv.org/abs/2106.09592
Sarramia, D., Claude, A., Ogereau, F., Mezhoud, J., Mailhot, G.: «CEBA: a data lake for data sharing and environmental monitoring », Sensors 22(7), 2733 (2022)
Hartmann, S., Küng, J., Chakravarthy, S., Anderst-Kotsis, G., Tjoa, A.M., Khalil, I. (eds.): DEXA 2019. LNCS, vol. 11706. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27615-7
Giebler, C., Gröger, C., Hoos, E., Schwarz, H., Mitschang, B.: Leveraging the data lake: current state and challenges. In: Ordonez, C., Song, I.-Y., Anderst-Kotsis, G., Tjoa, A.M., Khalil, I. (eds.) DaWaK 2019. LNCS, vol. 11708, pp. 179–188. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27520-4_13
Sawadogo, P., Darmont, J.: On data lake architectures and metadata management. J. Intell. Inf. Syst. 56(1), 97–120 (2020). https://doi.org/10.1007/s10844-020-00608-7
Inmon, B.: Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump. Technics Publications (2016)
[« Architecting Data Lakes [Book] ». https://www.oreilly.com/library/view/architecting-data-lakes/9781492042518/ (consulté le 26 juillet 2022)
Hai, R., Quix, C., Kensche, D.: Nested schema mappings for integrating JSON. In: Trujillo, J.C., et al. (eds.) ER 2018. LNCS, vol. 11157, pp. 397–405. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00847-5_28
Giebler, C., Gröger, C., Hoos, E., Schwarz, H., Mitschang, B.: Modeling data lakes with data vault: practical experiences, assessment, and lessons learned. In: Laender, A.H.F., Pernici, B., Lim, E.-P., de Oliveira, J.P.M. (eds.) ER 2019. LNCS, vol. 11788, pp. 63–77. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33223-5_7
LaPlante, A.: Architecting Data Lakes: Data Management Architectures for Advanced Business Use Cases, 1st edn. O’Reilly Media, Sebastopol (2016). Consulté le: 14 juillet 2022. [En ligne]. Disponible sur: https://learning.oreilly.com/library/view/-/9781492042518/?ar
Advances in Databases and Information Systems. Consulté le: 14 juillet 2022. [En ligne]. Disponible sur: https://link.springer.com/book/https://doi.org/10.1007/978-3-030-28730-6
« The Data Lake Architecture Framework - Digitale Bibliothek - Gesellschaft für Informatik e.V. » https://dspace.gi.de/handle/20.500.12116/35802 (consulté le 14 juillet 2022)
Darmont, J., Favre, C., Loudcher, S., Noûs, C.: « Data lakes for digital humanities ». In” Proceedings of the 2nd International Conference on Digital Tools & Uses Congress, October 2020, pp. 1–4 (2020). https://doi.org/10.1145/3423603.3424004
Mendelevitch, O., Stella, C., Eadline, D.: Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale. Addison-Wesley, Boston (2017)
Madera, C., Laurent, A.:« The next information architecture evolution: the data lake wave ». In: Proceedings of the 8th International Conference on Management of Digital EcoSystems, New York, NY, USA, November 2016, p. 174‑180 (2016). https://doi.org/10.1145/3012071.3012077
Fang, H.: « Managing data lakes in big data era: What’s a data lake and why has it became popular in data management ecosystem ». In: 2015 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), June 2015, pp. 820–824. https://doi.org/10.1109/CYBER.2015.7288049
Khine, p.p., Wang, Z.S.:« Data lake: a new ideology in big data era », ITM Web Conf. 17, 03025 (2018). https://doi.org/10.1051/itmconf/201817030251
« Satellite data-driven multi-objective simulation-optimization modeling for water-environment-agriculture nexus in an arid endorheic lake basin ».J. Hydrol. 612, 128207 (2022). https://doi.org/10.1016/j.jhydrol.2022.128207
Ouafiq, E.M., Saadane, R., Chehri, A.: « Data management and integration of low power consumption embedded devices IoT for transforming smart agriculture into actionable knowledge ».Agriculture 123, Art. No. 3 (2022), https://doi.org/10.3390/agriculture12030329
« Base de données complètes sur les exploitations agricoles : manuel de référence ». https://www150.statcan.gc.ca/n1/pub/21f0005g/21f0005g2011000-fra.htm#archived (consulté le 31 juillet 2022)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Sore, S., Traore, Y., Bikienga, M., Ouedraogo, F.T. (2023). An Architecture of a Data Lake for the Sharing, Agricultural Knowledge in Burkina Faso. In: Saeed, R.A., Bakari, A.D., Sheikh, Y.H. (eds) Towards new e-Infrastructure and e-Services for Developing Countries. AFRICOMM 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 499. Springer, Cham. https://doi.org/10.1007/978-3-031-34896-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-34896-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34895-2
Online ISBN: 978-3-031-34896-9
eBook Packages: Computer ScienceComputer Science (R0)