Abstract
Data governance is one of the strongest pillars in Data management program which goes hand in hand with data quality. In industrial Data Lake huge amount of unstructured data is getting ingested at high velocity from different source systems. Similarly, through multiple channels of data are getting queried and transformed from Data Lake. Based on 3Vs of big data it’s a real challenge to set up a rule based on traditional data governance system for an Enterprise. In today’s world governance on semi structured or unstructured data on Industrial Data Lake is a real issue to the Enterprise in terms of query, create, maintain and storage effectively and secured way. On the other hand different stakeholders i.e. Business, IT and Policy team want to visualize the same data in different view to analyze, imposes constraints, and to place effective workflow mechanism for approval to the policy makers. In this paper author proposed property graph based governance architecture and process model so that real time unstructured data can effectively govern, visualize, manage and queried from Industrial Data Lake.
Similar content being viewed by others
References
Brandes U, Eiglsperger M, Lerner J, Pich C (2010) Graph Markup Language (GraphML). Bibliothek der Universitat Konstanz
http://graphml.graphdrawing.org/specification.html-GraphML specification
Simplifying Data Governance and Accelerating Real-time Big Data Analysis in Financial Services with MarkLogic Server and Intel. White Paper 2014
Manuika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Hung Byers A (2011) Big data: the next frontier for innovation, competition, and productivity. McKinsey Global Institute (MGI), New York, United States
McAfee A, Brynjolfsson E (2012) Big data: the management revolution. Harvard Business Review 90(10):59–68
Davenport TH, Barth P, Bean R (2012) How ‘big data’ is different. Sloan Management Rev 54(1):43–46
Weber K, Otto B, Österle H (2009) One size does not fit all—a contingency approach to data governance. ACM J Data Inform Quality 1(1):4–27
Beath C et al (2012) Finding Value in the Information Explosion. Sloan Management Rev. 53(4):18–20
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dutta, H. Graph based data governance model for real time data ingestion. CSIT 3, 119–125 (2015). https://doi.org/10.1007/s40012-016-0079-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40012-016-0079-y