Understanding Catalytic Biomass Conversion Through Data Mining

Ras, Erik-Jan; McKay, Ben; Rothenberg, Gadi

doi:10.1007/s11244-010-9563-z

Understanding Catalytic Biomass Conversion Through Data Mining

Original Paper
Open access
Published: 03 June 2010

Volume 53, pages 1202–1208, (2010)
Cite this article

Download PDF

You have full access to this open access article

Topics in Catalysis Aims and scope Submit manuscript

Understanding Catalytic Biomass Conversion Through Data Mining

Download PDF

Erik-Jan Ras^1,2,
Ben McKay¹ &
Gadi Rothenberg²

2862 Accesses
37 Citations
Explore all metrics

Abstract

Catalytic conversion of biomass is a key challenge that we chemists face in the twenty-first century. Worldwide, research is conducted into obtaining bulk chemicals, polymers and fuels. Our project centres on glucose valorisation via furfural derivatives using catalytic hydrogenation. We present here new results for a set of 48 bimetallic catalysts supported on silica, and demonstrate the application of data mining tools to identify major trends in the data. These results are combined with a full factorial data set for the hydrogenation of 5-ethoxymethylfurfural over alumina-supported transition metal catalysts. All the catalysts in the combined datasets were synthesized and tested for performance under identical conditions. This, combined with the fact that no combinations of metals were left out, enables the use of advanced data mining tools. The paper describes the data and highlights the relevant trends from a chemist’s viewpoint.

Computational Chemistry of Catalytic Biomass Conversion

Accelerated discovery of multi-elemental reverse water-gas shift catalysts using extrapolative machine learning approach

Article Open access 21 September 2023

Biomass pyrolysis technologies for value-added products: a state-of-the-art review

Article 16 February 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Valorising biomass to bulk chemicals and/or fuel components is a ‘hot topic’ of research worldwide. Typically, “brute force” reactions are first used to break down the raw biomass into smaller building blocks [1–4]. Once this is done, it’s a job for catalysis chemists to turn these building blocks into useful chemicals. The challenge is that biomass differs significantly from crude oil. Crude oil is composed largely of unfunctionalised hydrocarbons. Biomass, on the other hand, is typically over-functionalized. It is a mixture of alcohols, ethers, esters, and carboxylic acids, so biomass refining presents new challenges. Previously, we have studied various catalytic options for valorising fatty acids [5], triglycerides [6], and glycerol derivatives [7]. Here, we extend this work to furfural derivatives, that can be obtained from cellulose, starch, hemicellulose and sugars [8].

Here we report new results on the selective hydrogenation of 5-ethoxymethylfurfural 1 over silica-supported bimetallic catalysts (Scheme 1). The results are discussed in terms of mechanistic implications and process conditions, using data mining techniques to highlight general trends and explain their chemical meaning. We recently reported the hydrogenation of 1 over alumina supported catalysts [9]. The key results of the alumina-based catalysts are compared here with the silica-based results in terms of major trends.

Our main objective is the selective carbonyl reduction product 5-hydroxyethylfurfuryl alcohol 2, although high selectivity routes to other products are also interesting. The alcohol 2 can be used as a component in novel diesel fuels. Alternatively, it can be oxidized to 2,5-furandicarboxylic acid and used as a monomer in biobased polyester synthesis [8].

Combining the silica and the alumina results gives a large and highly consistent data set. Figure 1 outlines the parameter space explored in this data set. The consistency comes from using high-throughput methodology, applying a 16-in-parallel flow reactor (http://www.flowrence.com) and ensuring that all catalysts are synthesized and tested under identical conditions. To efficiently explore this data set we use Principal Component Analysis (PCA), a technique typically used in data mining for dimension reduction and exploratory data analysis [10–12].

2 Results and Discussion

All catalysts were synthesized using a wet impregnation procedure. The liquid volume was chosen such that all catalysts could be prepared using identical precursor mass concentrations. This gives a rather dilute impregnation slurry, where the volume is determined by the metal precursor with the lowest solubility. In all cases impregnatition time was 24 h. Drying of catalysts was performed at 105 °C, followed by calcination in air at 320 °C. All experiments were performed in trickle flow in a commercial 16-reactor setup at 80–120 °C. In a typical experiment, the reactor system is loaded with 15 calcined catalysts at 50 mg each. The remaining reactor is left empty to obtain a reliable measurement of conversion and probe for uncatalysed reactions. The catalysts are first reduced at 300 °C using a mixture of hydrogen and nitrogen. After cooling down to 80 °C, each reactor is supplied with a feed of substrate solution corresponding to a LHSV of 0.032 mol g⁻¹ h⁻¹. The samples are collected in the liquid phase and analyzed offline using GC (Varian #CP9205 column, FID detector). A detailed description of the equipment used and the catalyst synthesis methodology is published elsewhere [9].

2.1 Temperature Effects in Activity and Selectivity

At a first glance, the number of catalysts with desirable performance is fairly low in the SiO₂-supported group when compared with our earlier results on Al₂O₃-supported catalysts. Where in the Al₂O₃ data set a substantial set of catalyst showed high conversions, in the SiO₂ data set only about 10% of the catalysts show conversions higher than 20%. Figure 2 shows the selectivity towards the unsaturated alcohol 2 as a function of the conversion of 1. The markers are shaded by test temperature. The observed temperature effect differs from the alumina-based catalysts. In this case, a 100% conversion was achieved in many cases already at 80 °C and with increasing temperature the selectivity towards the unsaturated alcohol 2 dropped dramatically. Here we see a more modest temperature effect.

Observing Fig. 2 in more detail, one sees similarities between silica and alumina. Generally speaking, using diethyl carbonate as a solvent gives higher conversion compared with using 1,4-dioxane. This is caused by a lower barrier for adsorbing 1,4-dioxane compared with diethyl carbonate on the catalyst surface. When looking at the best result obtained in this set, two catalysts with full selectivity to the unsaturated alcohol 2 stand out. These are Pt and Rh catalysts with a conversion of 50–65%. Using 1,4-dioxane, this performance is achieved at 120 °C; using diethyl carbonate, this performance is reached at 80 °C or 100 °C. Maintaining high selectivity at high temperature is usually difficult, because at high temperature often excess hydrogenation reactions take over, ultimately resulting in products like dimethyl furan and dimethyl tetrahydrofuran. The results in terms of conversion of 1 and selectivity to 2 obtained for the silica catalysts containing Pd, Pt and Rh catalysts are given in Table 1. The data for all catalysts is given in the supporting information.

Table 1 Key hydrogenation results obtained for SiO₂-supported catalysts containing Pd, Pt or Rh

Full size table

2.2 Effect of Main Metals and Promoters

To facilitate interpretation of the effects of main metals and promoters, all catalysts were prepared with a fixed main metal loading of 1 wt% and a fixed promotor loading of 10 mol% relative to the main metal. This 1:10 ratio should be sufficient to identify a promotor effect if present. We selected a diverse set of promotor metals, aiming to create large differences between promoters. The promoters include classical hydrogenation promoters (Cr, Fe, W), an adsorption promotor (Na), a poison (Bi) and a common promotor for the hydrogenation of aldehydes (Sn). The main metals are mostly typical noble-metal catalyst components (Au, Ir, Pd, Pt, Rh, Ru), but Ni is also included as a typical general purpose hydrogenation catalyst. Cu is included since it is commonly used in the gas phase hydrogenation of furfural to furfuryl alcohol [13, 14]. Most of the main metals used in our work were used before in the hydrogenation of (unsaturated) aldehydes with varying degrees of success. Platinum [15–20] and palladium [21] are the most common metals used.

The performance comparison in Fig. 3 clearly shows that Pd outperforms the other metals at all temperatures in terms of conversion. At 120 °C, only Pt comes close. Surprisingly, Ir only shows marginal activity when supported on SiO₂. Since ultimately selectivity is more important than conversion, both Pt and Rh should be considered for a closer investigation due to their high average selectivities.

2.3 Applying Data Mining Methodology

Principal Component Analysis (PCA) is often used to analyze large data sets in many disciplines. To accommodate the non-statistician reader, we include a short introduction below (adapted from [22] with permission of the author). Detailed descriptions of PCA and data mining in general are given elsewhere [23, 24]. The remainder of this section outlines the application of PCA to the combined data set reported here and in our previous work. We demonstrate the identification of major trends by analyzing the yields of identified components in the catalytic test and link these trends to the catalyst compositions and test conditions used. Please note that this type of modeling does not result in a predictive model. It will not predict what new combination of variables to use to obtain a better result. It merely aims to explain the existing data in a coherent manner.

2.4 A Simple Explanation of the Concept of PCA

Suppose that you have an experimental data matrix that contains the concentration profiles of 12,500 reactions, performed using 50 different catalysts tested with 50 different substrates under five different conditions, with each profile made of 10 points. This matrix contains 50 × 50 × 5 × 10 = 125,000 data points. It merits some serious thinking about data mining, because it is unlikely that you will see anything useful just by looking at 125,000 numbers. PCA can reduce this large matrix into two smaller matrices which are easier to examine and interpret. Using PCA, you can extract the key factors. These are the principal components, or PCs (sometimes also called the latent variables). Each PC is a linear combination of the original variables, but unlike the original variables, which may be correlated with each other, the PCs are orthogonal (i.e., uncorrelated, independent of one another). Mathematically speaking, if X is an (I × J) matrix that contains J variables for I reactions, PCA divides this matrix into a systematic part TP^T (the PCA model), and a residuals part E. T (I × R), and P (J × R) are two smaller matrices, the size of which depends on R, the number of significant PCs. T is the scores matrix. It represents the spread of the reactions within the model space. P is the loadings matrix. It describes the relationships between the variables. The geometrical representation of PCA is outlined in Fig. 4 below for further clarification.

2.5 Description of the Data Set

The data set contains data for all binary combinations of metals (8 main metals and 6 promotor metals) supported on silica and alumina. This set of 96 catalysts is augmented with the unpromoted main metals, again supported on silica and alumina, resulting in a set of 112 catalysts in total. Each catalyst is tested in two solvents and at three different temperatures. All other conditions were maintained constant. The yields of components 2–9 are used as variables for the PCA analysis. Prior to model generation, the yields were scaled to zero mean and unit variance, to eliminate inherent differences in scale.

2.6 Modelling Results

PCA analysis reduces the yields in the original parameter space to two PCs together explaining 68% of the overall variance in the original data. Practically this means we can describe almost 70% of what is going on in our experiments by looking at two factors rather than at the original eight. The remainder of this section outlines the key findings from interpretation of the model. A complete set of plots and the full scores and loadings tables are provided in the supporting information.

Looking at Fig. 6a, we see that the loadings for all yields are positive in the first PC. This simply means that this first PC is directly correlated with catalyst activity. Every observation with a high score on the first PC will have a high conversion and every observation with a low score on the first PC will have a low conversion. Translating this observation to the scores plot (Fig. 5) implies that generally conversion increases from left to right. This means we can effectively explain a large portion of the differences in conversion using the first PC. Looking at the loadings on the second PC (Fig. 6b), we see large differences in effect size (magnitude of the bar). Moreover, the effects have different directions. This gives information about the relationship between the variables. For example, the yield of the unsaturated aldehyde 3 is anti-correlated with the yield of the etherification product 5. This means that if a high yield of 3 is found, a low yield of 5 is expected and vice versa. From a chemistry point of view this makes sense, because if the furan ring has been hydrogenated to produce 3, the formation of the unsaturated component 5 is no longer possible. More generally, the second PC gives information about the selectivity of each observation.

Combining these two PCs allows us to quickly find the maximum yield of a given component in the scores plot. For example, a high yield of the saturated aldehyde 3 is expected at a moderately positive score on the first PC based on the loading for y(3) in Fig. 6a and a strongly positive score on the second PC based on the loading for y(3) in Fig. 6b. Following the same reasoning, we can expect a high yield of the etherification product 5 at a strongly positive score on the first PC and a strongly negative score on the second PC. Subtle trends can also be identified. An example of this is the saturated alcohol 4, which only occurs in small amounts (a maximum of 5% yield and an average of only 0.1% yield across all data points). This component may easily be overlooked by conventional visual data analysis. In our case, the PCA model shows in Fig. 6b that a “high” yield of 4 always coincides with a high yield of etherification product 5. This is explained by the fact that high amounts of 5 can only be formed if first substantial quantities of its direct precursor 2 are formed. Instead of etherification to 5, a small portion of the unsaturated alcohol 2 could be further hydrogenated to the saturated alcohol 4. The reason only a minor amount of the unsaturated alcohol follows this path is because it requires a change in adsorption mode (ring vs. oxygen).

Turning our attention to the effect of the metals used, the first thing that should be noted is that the two most successful silica-supported catalysts in the data set are not explicitly identified by the model at hand. This is not an error, but simply a confirmation of the most important attribute of PCA models: only trends valid across the majority of the data are identified. Less systematic trends, like two odd catalysts outperforming the other catalysts will not be identified. Clearly, this implies that PCA analysis should only be used together with conventional data analysis and not as its replacement. One of the major trends that can easily be identified is the high selectivity of Pd catalysts towards the unsaturated alcohol 3. From Fig. 6 we learn that a high yield y(3) coincides with a high score on both the first and second PC. This translates to the top-right quadrant in the scores plot (Fig. 5), which is almost exclusively populated with Pd catalysts.

3 Conclusions

We found that Pt and Rh supported on silica are efficient catalysts for the selective hydrogenation of 1 to the unsaturated alcohol 2, obtaining 100% selectivity for conversions up to 66%. We also found substantial promotor effects for Pd, Pt and Rh. Especially in the case of Rh, the unpromoted SiO₂/Rh catalyst shows negligible conversion at all test temperatures, while the SiO₂–Rh/Fe catalyst shows a conversion of 55% at 120 °C. Using PCA, we showed that the major trends in a large data set can be easily identified using standard data mining methodology. Effects of support, temperature and solvent are readily found and effects of metals can also be identified.

Comparing the results of the silica and alumina-supported catalysts, one sees some major differences. The most obvious is the much lower activity typically found for the silica catalysts. This can be explained by the much lower background activity of the unmodified support. Where bare alumina already gives 50% conversion at 80 °C, silica gives no conversion. From a process point of view, the Ir catalysts on alumina are promising due to their near full conversion and selectivity at low temperature. However, from an economic perspective the Pt catalysts on silica are more attractive on the basis metal price and availability. Full selectivity in combination with moderate conversion provides a basis for a recycle process.

4 Supporting Information Available

The full table of results for both alumina and silica data sets is provided in the supporting information as well as the scores and loadings of the PCA model.

References

Nowakowski DJ, Jones JM (2008) J Anal Appl Pyrolysis 83:12
Article CAS Google Scholar
Gallezot P (2007) Catal Today 121:76
Article CAS Google Scholar
Mascal M, Nikitin B (2009) ChemSusChem 2:859
Article CAS Google Scholar
Lee S, Eom M, Yoo K, Kim N, Jeon J, Park Y, Song B, Lee S (2008) J Anal Appl Pyrolysis 83:110
Article CAS Google Scholar
Kiss AA, Dimian AC, Rothenberg G (2006) Adv Synth Catal 348:75
Article CAS Google Scholar
Kiss AA, Dimian AC, Rothenberg G (2008) Energy Fuels 22:598
Article CAS Google Scholar
Gordillo A, Durán Pachón L, de Jesus E, Rothenberg G (2009) Adv Synth Catal 351:325
Article CAS Google Scholar
Moreau C, Belgacem MN, Gandini A (2004) Top Catal 27:11
Article CAS Google Scholar
Ras EJ, Maisuls S, Haesakkers P, Gruter GJ, Rothenberg G (2009) Adv Synth Catal 351:3175
Article CAS Google Scholar
Berglund A, DeRosa MC, Wold S (1997) J Comput Aided Mol Des 11:601
Article CAS Google Scholar
Janné K, Pettersen J, Lindberg NO, Lundstedt T (2001) J Chemom 15:203
Article Google Scholar
Kettaneh N, Berglund S, Wold S (2005) Comput Stat Data Anal 48:69
Article Google Scholar
Hubaut R, Daage M, Bonnelle JP (1986) Appl Catal 22:231
Article CAS Google Scholar
Rao R, Dandekar A, Baker RTK, Vannice MA (1997) J Catal 253:406
Article Google Scholar
Serrano-Ruiz JC, Lopez-Cudero A, Solla-Gullon J, Sepulveda-Escribano A, Aldaz A, Rodriguez-Reinoso F (2008) J Catal 253:159
Article CAS Google Scholar
Wu J, Shen Y, Liu C, Wang H, Geng C, Zhang Z (2005) Catal Commun 633
Mahata N, Goncalves F, Pereira MFR, Figueiredo JL (2008) Appl Catal A 339:159
Article CAS Google Scholar
Silvestre-Albero J, Serrano-Ruiz JC, Sepúlveda-Escribano A, Rodríguez-Reinoso F (2008) Appl Catal A 351:16
Article CAS Google Scholar
Shirai M, Tanaka T, Arai M (2001) J Mol Catal A: Chem 168:99
Article CAS Google Scholar
Zhao F, Ikushima Y, Shirai M, Ebina T, Arai M (2002) J Mol Catal A: Chem 180:259
Article CAS Google Scholar
Tanaka S, Mizukami F, Niwa S, Toba M, Tasi G, Kunimori K (2002) Appl Catal A 229:175
Article CAS Google Scholar
Rothenberg G (2008) Catalysis: concepts and green applications. Wiley-VCH, Weinheim. ISBN 978-3-527-31824-7
Jackson JE (1981) J Qual Tech 13:341
Google Scholar
Rothenberg G (2008) Catal Today 137:2
Article CAS Google Scholar

Download references

Acknowledgements

We thank the Netherlands Organization for Scientific Research (NWO) for funding received under the Casimir programme.

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Avantium Technologies B.V., Zekeringstraat 29, 1014BV, Amsterdam, The Netherlands
Erik-Jan Ras & Ben McKay
Van ‘t Hoff Institute of Molecular Sciences, University of Amsterdam, Nieuwe Achtergracht 166, 1018 WV, Amsterdam, The Netherlands
Erik-Jan Ras & Gadi Rothenberg

Authors

Erik-Jan Ras
View author publications
You can also search for this author in PubMed Google Scholar
Ben McKay
View author publications
You can also search for this author in PubMed Google Scholar
Gadi Rothenberg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gadi Rothenberg.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 121 kb)

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Ras, EJ., McKay, B. & Rothenberg, G. Understanding Catalytic Biomass Conversion Through Data Mining. Top Catal 53, 1202–1208 (2010). https://doi.org/10.1007/s11244-010-9563-z

Download citation

Published: 03 June 2010
Issue Date: September 2010
DOI: https://doi.org/10.1007/s11244-010-9563-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Understanding Catalytic Biomass Conversion Through Data Mining

Abstract

Similar content being viewed by others

Computational Chemistry of Catalytic Biomass Conversion

Accelerated discovery of multi-elemental reverse water-gas shift catalysts using extrapolative machine learning approach

Biomass pyrolysis technologies for value-added products: a state-of-the-art review

1 Introduction

2 Results and Discussion