Skip to main content

Multivariate Classification of the Crude Oil Petroleum Systems in Southeast Texas, USA, Using Conventional and Compositional Data Analysis of Biomarkers

  • Chapter
  • First Online:
Advances in Compositional Data Analysis

Abstract

Chemically, petroleum is an extraordinarily complex mixture of different types of hydrocarbons that are now possible to isolate and identify because of advances in geochemistry. Here, we use biomarkers and carbon isotopes to establish genetic differences and similarities among oil samples. Conventional approaches for evaluating biomarker and carbon isotope relative abundances include statistical techniques such as principal component and cluster analysis. Considering that proportions of the different hydrocarbon molecules are relative parts of a laboratory sample, the data are compositional in nature, thus requiring the use of log-ratio approaches for adequate mathematical modeling. We apply both traditional and compositional modeling approaches to crude oil samples from an onshore area of about 50,000 square miles in southeast Texas. The data comprise 177 crude oil samples from producing oil fields that include key biomarkers, elemental, and isotopic values commonly used in source rock correlation studies. Our results indicate that compositional modeling has higher discriminating power and lower uncertainty than the traditional approach, allowing the identification of up to 16 clusters. Each cluster represents one oil family from a source rock organofacies ranging from Carboniferous to Paleogene. The families provide new insights into important petroleum systems in the Texas onshore region of the Gulf of Mexico sedimentary basin.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • G. Brock, V. Pihur, S. Datta, S. Datta, clValid: an R package for cluster validation. J. Stat. Softw. 25(4), 1–22 (2008)

    Article  Google Scholar 

  • Z.F.M. Burton, J.M. Moldowan, L.B. Magoon, R. Sykes, S.A. Graham, Interpretation of source rock depositional environment and age from seep oil, east coast of New Zealand. Int. J. Earth Sci. 108(4), 1079–1091 (2019)

    Article  Google Scholar 

  • G.E. Claypool, E.A. Mancini, Geochemical relationships of petroleum in Mesozoic reservoirs to carbonate source rocks of Jurassic Smackover Formation, southwestern Alabama. AAPG Bull. 73(7), 904–924 (1989)

    Google Scholar 

  • P.A. Comet, J.K. Rafalska, J.M. Brooks, Sterane and triterpane patterns as diagnostic tools in the mapping of oils, condensates, and source rocks of the Gulf of Mexico region. Org. Geochem. 20(8), 1265–1296 (1993)

    Article  Google Scholar 

  • M.A. Engle, E.L. Rowan, Interpretation of Na-Cl-Br systematics in sedimentary basin brines: comparison concentration, element ratio, and isometric log-ratio approaches. Math. Geosci. 45(1), 87–101 (2013)

    Article  Google Scholar 

  • J. Fox, S. Weisberg, An R Companion to Applied Regression, 3rd ed. (Sage, Thousand Oaks, CA, 2019), 577 pp.

    Google Scholar 

  • W.E. Galloway, Depositional evolution of the Gulf of Mexico sedimentary basin, in The Sedimentary Basins of the United States and Canada, ed. by A. Miall. Sedimentary Basins of the World, vol. 5 (Elsevier, Amsterdam, The Netherlands, 2008), pp. 505–549

    Google Scholar 

  • GeoMark Research, Ltd., RFDBASE—Rock and fluid database (2019), https://geomarkresearch.com/database-products/

  • B.U. Haq, J. Hardenbol, P.R. Vail, Mesozoic and Cenozoic chronostratigraphy and cycles of sea-level change, in Sea-Level Changes–An Integrated Approach. SEPM Special Publication 42 (Society of Economic Paleontologists and Mineralogists, Tulsa, OK, 1988), pp. 71–108

    Google Scholar 

  • F.E. Harrell Jr., K.L. Lee, B.D. Mark, Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 15(4), 361–387 (1996)

    Article  Google Scholar 

  • M. He, S. Graham, A.H. Scheirer, K.E. Peters, A basin modeling and organic geochemistry study in the Vallecitos syncline, San Joaquin Basin, California. Mar. Pet. Geol. 49, 15–34 (2014)

    Article  Google Scholar 

  • M. He, M. Moldowan, A. Nemchenko-Rovenskaya, K.E. Peters, Oil families and their inferred source rocks in the Barents Sea and northern Timan-Pechora Basin, Russia. AAPG Bull. 96(6), 1121–1146 (2012)

    Article  Google Scholar 

  • M. He, J.M. Moldowan, K.E. Peters, Biomarkers: petroleum, in Encyclopedia of Geochemistry. ed. by W.M. White (Springer, Cham, Switzerland, 2018), pp. 136–148

    Chapter  Google Scholar 

  • K.C. Hood, O.P. Gross, L.M. Wenger, S.C. Harrison, Hydrocarbon systems analysis of the northern Gulf of Mexico: delineation of hydrocarbon migration pathways using seeps and seismic imaging, in Surface Exploration Case Histories: Applications of Geochemistry, Magnetics, and Remote Sensing, ed. by D. Schumacher, L.A. LeSchack. AAPG Studies in Geology No. 48 and SEG Geophysical References Series No. 11 (2002), pp. 25–40

    Google Scholar 

  • I.T. Jolliffe, Principal Component Analysis, 2nd ed. (Springer, New York, 2002), 487 pp.

    Google Scholar 

  • M.C. Kennicutt II., T.J. McDonald, P.A. Comet, G.J. Denoux, J.M. Brooks, The origins of petroleum in the northern Gulf of Mexico. Geochim. Cosmochim. Acta 56, 1259–1280 (1992)

    Article  Google Scholar 

  • A.I. Levorsen, F.A.F. Berry, Geology of Petroleum, 2nd ed. (AAPG, 2001), 724 pp.

    Google Scholar 

  • K.R. Marra, R.R. Charpentier, C.J. Shenk, M.D. Lewan, H.M. Leathers-Miller, T.R. Klett, S.B. Gaswirth, P.A. Le, T.J. Mercier, J.K. Pitman, M.E. Tennyson, Assessment of undiscovered shale gas and shale oil resources in the Mississippian Barnett Shale. Bend Arch-Fort Worth basin, North-Central Texas. U.S. Geological Survey Fact Sheet 2015-3078 (2015)

    Google Scholar 

  • G.W. Milligan, M.C. Cooper, An examination of procedures for determining the number of clusters in a dataset. Psychometrika 50(2), 159–179 (1985)

    Article  Google Scholar 

  • B. Mirkin, Clustering: A Data Recovery Approach, 2nd ed. (CRC Press, 2013), 365 pp.

    Google Scholar 

  • R. Nehring, Oil and gas resources, in The Gulf of Mexico Basin, ed. by A. Salvador (Geological Society of America, Boulder, CO, 1991), pp. 445–494

    Google Scholar 

  • J. Palarea-Albaladejo, J.A. Martín-Fernández, zCompositions—R package for multivariate imputation of left-censored data under a compositional approach. Chemom. Intell. Lab. Syst. 143, 85–96 (2015)

    Article  Google Scholar 

  • J. Palarea-Albaladejo, J.A. Martín-Fernández, J.A. Soto, Dealing with distances and transformations for fuzzy C-means clustering of compositional data. J. Classif. 29(2), 144–169 (2012)

    Article  MathSciNet  Google Scholar 

  • V. Pawlowsky-Glahn, J.J. Egozcue, R. Tolosana-Delgado, Modeling and Analysis of Compositional Data (Wiley, Chichester, UK, 2015), p. 247

    Google Scholar 

  • J.H. Pedersen, D.A. Karlsen, K. Backer-Owe, J.E. Lie, H. Brunstad, The geochemistry of two unusual oils from the Norwegian North Sea: implications for new source rock and play scenario. Pet. Geosci. 12(1), 85–96 (2006)

    Article  Google Scholar 

  • K.E. Peters, D.J. Curry, M. Kacewicz, An overview of basin and petroleum system modelling, in Basin Modeling: New Horizons in Research and Applications, ed. by K.E. Peters, D. Curry, M. Kacewicz. AAPG Hedberg Series, vol. 4 (2012), pp. 1–16

    Google Scholar 

  • K.E. Peters, J.M. Moldowan, The Biomarker Guide: Interpreting Molecular Fossils in Petroleum and Ancient Sediments (Prentice Hall, Englewood Cliffs, NJ, 1993), p. 363

    Google Scholar 

  • K.E. Peters, L.S. Ramos, J.E. Zumberge, Z.C. Valin, C.R. Scotese, D.L. Gautier, Circum-Arctic petroleum systems identified using decision-tree chemometrics. AAPG Bull. 91(6), 877–913 (2007)

    Article  Google Scholar 

  • K.E. Peters, C.C. Walters, J.M. Moldowan, The Biomarker Guide, 2nd ed. (Cambridge University Press, Cambridge, UK, 2005), p. 1155

    Google Scholar 

  • K.E. Peters, T.L. Wright, L.S. Ramos, J.E. Zumberge, L.B. Magoon, Chemometric recognition of genetically distinct oil families in the Los Angeles basin, California. AAPG Bull. 100(1), 115–135 (2016)

    Article  Google Scholar 

  • R Core Team, R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2018), https://www.R-project.org

  • P.J. Rousseeuw, Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20(C), 53‒65 (1987)

    Google Scholar 

  • A. Salvador (ed.), The Gulf of Mexico Basin. The Geology of North America, vol. J (The Geological Society of America, Boulder, CO, 1991), 568 pp.

    Google Scholar 

  • A. Salvador, J.M.Q. Muñeton, compilers, Stratigraphic correlation chart: Gulf of Mexico Basin, in The Gulf of Mexico Basin, ed. by A. Salvador. The Geology of North America, vol. J (Geological Society of America, Boulder, plate 5, 1989)

    Google Scholar 

  • R. Sassen, Migration of crude oil from the Smackover source rock to Jurassic and Cretaceous reservoirs of the northern Gulf rim. Org. Geochem. 14(1), 51–60 (1989)

    Article  Google Scholar 

  • J.L. Shelton, M.A. Engle, A. Buccianti, M.S. Blondes, The isometric log-ratio (ilr)-ion plot: a proposed alternative to the Piper diagram. J. Geochem. Explor. 190, 130–141 (2018)

    Article  Google Scholar 

  • K. Siddiqui, Heuristics for sample size determination in multivariate statistical techniques. World Appl. Sci. J. 27(2), 285–287 (2013)

    Google Scholar 

  • Z. Sofer, Stable carbon isotope compositions of crude oils: application to source depositional environments and petroleum alteration. AAPG Bull. 68(1), 31–49 (1984)

    Google Scholar 

  • C.A. Sugar, G.M. James, Finding the number of clusters in a dataset: an information-theoretic approach. J. Am. Stat. Assoc. 98(463), 750–763 (2003)

    Article  MathSciNet  Google Scholar 

  • N. Tyler, T. Ewing, Major oil plays of south and south-central Texas, in Contributions to the Geology of South Texas (1986), pp. 24–52

    Google Scholar 

  • M. Visvanathan, A.B. Srinivas, G.H. Lushington, P. Smith, Cluster validation: an integrative method for cluster analysis, in Proceedings of IEEE International Conference on Bioinformatics and Biomedicine Workshops (2009), pp. 238–242

    Google Scholar 

  • L.M. Wenger, R. Sassen, D. Schumacher, Molecular characteristics of Smackover, Tuscaloosa and Wilcox-reservoired oils in the Eastern Gulf Coast, in Proceedings of the Ninth Annual Research Conference of the Gulf Coast Section SEPM Foundation (1990), pp. 37–57

    Google Scholar 

  • R.D. Woods, A. Salvador, A.E. Miles, Pre-Triassic, in The Gulf of Mexico Basin, ed. by A. Salvador. The Geology of North America, vol. J (Geological Society of America, Boulder, CO, 1991), pp. 109–129

    Google Scholar 

  • J. Zumberge, H. Illich, L. Waite, Petroleum Geochemistry of the Cenomanian-Turonian Eagle Ford Oils of South Texas, AAPG Memoir 110, American Association of Petroleum Geologists, Tulsa, OK (2016), pp. 135–165

    Google Scholar 

  • J. Zumberge, C.R. Scotese, Biomarkers from marine crude oils reflect modeled climatic/oceanographic conditions for the Late Cretaceous. Abstracts of reports from the 23rd International Meeting on Organic Chemistry, 9–14 September, Torquay, UK, 23, 555 (2007), https://www.researchgate.net/publication/264044691_Biomarkers_from_Marine_Crude_Oils_Reflect_Modeled_ClimaticOceanographic_Conditions_for_the_Late_Cretaceous

Download references

Acknowledgements

This manuscript completed a mandatory internal review following the U.S. Geological Survey (USGS) Fundamental Science Practices (https://pubs.usgs.gov/circ/1367/). We are grateful to Paul Lillis (USGS) for a thorough internal review that resulted in suggestions and comments that helped to improve the manuscript, and to Madalyn Blondes (USGS) for a critical reading of a late version of this chapter. Steven Cahan (USGS) added the geographical information to all maps and Eric Morrissey (USGS) drafted Fig. 14a. J. A. Martín-Fernández was supported by the Spanish Ministry of Science, Innovation and Universities under the project CODAMET (RTI2018-095518-B-C21, 2019-2021).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ricardo A. Olea .

Editor information

Editors and Affiliations

Ethics declarations

Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government.

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Olea, R.A., Martín-Fernández, J.A., Craddock, W.H. (2021). Multivariate Classification of the Crude Oil Petroleum Systems in Southeast Texas, USA, Using Conventional and Compositional Data Analysis of Biomarkers. In: Filzmoser, P., Hron, K., Martín-Fernández, J.A., Palarea-Albaladejo, J. (eds) Advances in Compositional Data Analysis. Springer, Cham. https://doi.org/10.1007/978-3-030-71175-7_16

Download citation

Publish with us

Policies and ethics