Introduction: A Suggestive “new” Diagram Style from 1905

Many students of geometric morphometrics (GMM) as applied in evolutionary biology will already have encountered some version of Fig. 1a, the standard display of the Procrustes shape coordinates for the neuroanatomical octagon of midsagittal landmarks from the Vilmann data set of laboratory rodents radiographed longitudinally at eight ages (7 days, 14, 21, 30, 40, 60, 90, and 150 days). The raw data, initially published as Appendix A.4.5 of (Bookstein 1991), are currently among the archives of several on-line GMM data resources. My analyses in this paper are restricted to the 18 animals (out of an original 21) with complete data for all eight landmarks at all eight ages. By calling these coordinates “Procrustes shape coordinates” one means that they have been standardized specimen by specimen in four different aspects: horizontal average coordinate, vertical average coordinate, orientation (zero net rotation per specimen around the average, see Appendix A.2), and squared Centroid Size (CS), which is the summed squared distances of all the landmarks from their centroid case by case.

This analysis, which first appeared in Bookstein (1991), has been reproduced in many later publications, including several of the standard textbooks (Dryden and Mardia 1998, 2016; Bookstein 2014, 2018), and the data set it visualizes has served as exemplar for diverse advanced GMM analyses, such as integration (Bookstein 2015), factor analysis (Bookstein 2017), and a variant of descriptive finite-element analysis (Bookstein 2018: Sect. 5.2) borrowed from biomechanics.Footnote 1 For instance, Fig. 1b shows the first relative warp (RW, shape principal component) of these 144 configurations in the usual iconographic, a thin-plate spline. This particular grid represents the deformation from the sample average shape to its modification by a suitable multiple of this relative warp (here, four standard deviations of the corresponding principal component score) in the direction of greater age. This first RW is indeed strongly correlated with age. This is already clear in the display of Fig. 1c, the average shapes at the two extremes of age: clearly cranial base length changes with age by a greater factor than height above it does among the features that leap to the eye.

Fig. 1
figure 1

Aspects of the standard Procrustes analysis for the Vilmann data set. a Scatter of all eight Procrustes shape coordinate pairs for 18 animals radiographed at eight ages each. b Thin-plate spline (deformation) visualization of a convenient multiple of the first relative warp of these data with respect to the overall average shape. c Superposition, still in Procrustes pose, of the average shapes of the animals at age 7 (solid lines) and age 150 (dashed lines). Illustration at top: anatomy of the eight landmarks in situ, in anatomical orientation instead of the Procrustes orientation of the other panels. Clockwise from left: Opisthion, Interparietal point, Lambda, Bregma, Sphenoethmoid synchondrosis, Intersphenoidal synchondrosis, Sphenoöccipital synchondrosis, Basion

Since RW1 is correlated with age and age in turn is correlated with size, it might be useful to display the actual effect of age on form, not just on shape. To do so, it is most straightforward to “put size back into” the analysis using the same size measure CS that was used to standardize the size of all the specimens during the Procrustes normalization. This is arranged not by augmenting the shape coordinates by one additional scalar for log Centroid Size (the prescription for rising from Procrustes shape space to form space, see Mitteroecker et al. 2004; Bookstein 2018) but instead by multiplying each configuration of Procrustes shape coordinates by the same factor of CS that was divided out in the course of producing those Procrustes shape coordinates in the first place. In this paper these remultiplied configurations will be called Boas coordinates, as in Bookstein (2018), for reasons to be explained in the next section. To restore the functional for size that was divided out is almost equivalent to omitting one of the four standardizations in the standard generalized Procrustes analysis (GPA), namely, the scaling of each form to unit Centroid Size. (Its omission alters the average form, but only very, very slightly: see Appendix A.2.) The horizontal and vertical centering and the rotation to zero net rotation specimen by specimen around the common centroid remain.

After this modification (a truncation, really) of the standard Procrustes algorithm, the scatter of the 144 octagonal configurations appears as the remarkably different-looking display in Fig. 2a. Figure 2b shows the thin-plate spline for the first principal component of the coordinates in Fig. 2a using the same convention as in Fig. 1b, deformation from the mean to a suitable multiple of this component, while Fig. 2c compares the average over the 18 age-7 configurations to their average over the 18 age-150 configurations, for ease in appreciating spatial aspects of the growth rates of various interlandmark distances (see Bookstein 2018, Fig. 5.21).

Fig. 2
figure 2

Analogue of Fig. 1 for the Boas coordinates that back out the scaling step of the Procrustes algorithm. Orientation remains the Procrustes orientation (principal axes horizontal and vertical) already used in Fig. 1. a Centered, reoriented coordinates to the original scale as digitized by Melvin Moss around 1980. b First principal component of these 16 Cartesian coordinates, visualized by a thin-plate spline of the average form. c Averages of the octagons for the age-7 animals (solid lines) and age-150 animals (dashed lines). While frames b are virtually identical between this figure and the preceding figure, frames a and c are entirely different as the scaling step of the Procrustes algorithm is accepted (Fig. 1) or backed out (Fig. 2)

Figures 1b and 2b are almost identical. In other words, the statistical analysis of the growth of this rodent neuroanatomical octagon proceeds just as well using the Boas coordinates that canceled the Procrustes scaling step as it does using the standard Procrustes procedure with scaling. Better, in fact: from Fig. 2c one can extract the actual growth rates along diverse transects of these octagons, calibrations that are inaccessible from the Procrustes analogue in Fig. 1c.

But other aspects of the multivariate analysis have changed drastically. First, its language: the algebra of Fig. 1 is least-squares in “Procrustes distance,” which has no physical units, whereas that for Fig. 2 is least-squares in ordinary Euclidean distance on the digitizing tablet or the printed page, so that the variance explained by this and subsequent decompositions of the Boas coordinates comes in units of ordinary squared Euclidean distance, perhaps \(\hbox {mm}^2\). Partly as a consequence of this transformation of metric structure in the context of the actual process, growth allometry, that this data set is illustrating, the scatter of the first two relative warp scores in Boas space, Fig. 3b, is enormously different from that of the corresponding pair for the usual Procrustes coordinates, Fig. 3a. Principal component 2 (PC2) of the Boas coordinates is far weaker than relative warp 2 (RW2) of the Procrustes shape coordinates, for instance, while the separation of the age-7 forms from the others (lower left corner) is much more pronounced in the Boas setting, as is the excessive size of the single largest form in the data set (panel B, solid dot at PC1 = 122, PC2 = 1). The formulas for the horizontal coordinate in these two schemes have been graphed as thin-plate splines in Figs. 1b and 2b; the vertical coordinates will be diagrammed presently.

The most important contrast between the analyses is the discrepancy between the two graphical versions of the formula for the first principal component that, as we have seen, represents growth by equivalent diagrams of gradients in both of the analyses. Figure 4a shows RW1 as a displacement of each average shape coordinate by a vector corresponding to that RW’s loadings at length 0.5 (in units of sums of squared Procrustes distances accounted for); Fig. 4b, the same for the loadings of Boas PC1 as displacements from each average Boas coordinate, in units of squared Euclidean distance accounted for. The Boas display is much more suggestively structured than the Procrustes display. Figure 4b summarizes the Vilmann growth pattern as a centric allometry, displacement of all the landmarks nearly radially outward from the centroid, albeit at rates that vary from landmark to landmark. The Procrustes display, in spite of being a representation of the same growth process, permits no such simple verbal summary—some landmarks appear displaced outward along a central radius, others inward, and still others are displaced oblique to their radii at up to \(90^\circ \).Footnote 2

Fig. 3
figure 3

Scatters of the first two principal component (PC) scores for these 144 landmark configurations. These dimensions are also called the principal coordinates of the squared Procrustes distances and Boas distances, respectively (summaries of the sums of squared coordinate differences from specimen to specimen in Fig. 1a or 2a). In both frames, the youngest animals are at the left. a For the Procrustes shape coordinates. Horizontal, relative warp 1 (RW1); vertical, RW2. b For the Boas coordinates, PC1 likewise horizontal. In both panels the ordinate is uncorrelated with age, but it represents a far lower proportion of variance in panel b, the Boas analysis. Also note the clarity of the separation between the youngest 18 configurations in panel b vis-á-vis that in a. Symbols: \(\times \), age-7 animals; •, age-150 animals; \(+\), animals at the other six ages (14 days, 21, 30, 40, 60, and 90 days)

But notice that the shape of the configuration of \(\times \) symbols is the same between the panels of  Fig. 4—it is more or less the shape of the age-150 average configuration. In effect, all that has changed between the panels is the scaling of that imputed mean configuration with respect to the final average form. Yet only in Fig. 4b, the Boas representation of the data in their original Cartesian units, does the graphic rendition of PC1 align with the correct biological explanation (allometric growth) of this difference between the octagons. Insofar as rotation is not a component of the allometric style of explanation, it is Fig. 4b, not Fig. 4a, that authorizes the investigator to omit any remark beyond the apparent irrelevance of such a consideration.

Fig. 4
figure 4

Two representations of centric allometry, the diagrammatic innovation put forward in this essay. a As applied to the Procrustes mean shape, scaled to the mean at the oldest age, Fig. 1c. b As applied to the Boas mean form, again scaled to to the mean at the oldest age, Fig. 2c. The text argues that the representation in panel b is superior by far as a representation of growth allometry, the real biological process that this data set was explicitly designed to capture. BPC1: First principal component of these Boas coordinates

The caption of Fig. 2 claims that Figs. 1b and 2b are “virtually identical,” and indeed they do look quite similar. Yet the two panels of Fig. 3 do not much resemble each other, nor the two panels of Fig. 4. A claim that the methods are numerically equivalent—that the deformations of RW1 and BPC1 are almost exactly the same when interpreted as transformation grids—must be based on more than a subjective judgment. Figure 5 presents the necessary argument in explicit graphical form. The two panels here correspond not to the two types of analysis (that antinomy is coded in the plotting symbols instead) but to the two different spaces, Procrustes space or Boas space, in which the demand for a quantitative confrontation might be couched. In panel A the pair of linear composite scores are plotted by their coefficients as they would be diagrammed in Procrustes space. The open circles are the coefficients of the first conventional relative warp of these Procrustes shape coordinate octagons, plotted in correct proportion as vectors out of the Procrustes average shape. The solid circles are a different quantification, a sort of hybrid whereby the displacements of Boas coordinates in their Cartesian space are Procrustes-fitted to the original mean form by an adjustment of their size (an adjustment that is, of course, nonsense in the Boas context per se). When the two composites, RW1 and BPC1, are plotted to the same net Procrustes magnitude, the octets of vectors would have plotted directly atop each other had I not exaggerated the contrast between them by a factor of three before plotting. The crossproduct of these two 16-vectors is 0.99956, corresponding to an angle in their space of about \(1.7^\circ \) (the angle that the minute hand of a clock rotates in about 17 s).

In panel B the same hybrid display is presented in Boas space instead of Procrustes space. The solid circles are now just the displacements corresponding to Boas PC1, whereas the open circles correspond to landmark-by-landmark displacements of Procrustes RW1 after a rescaling (an adjustment that is, of course, nonsense in the Procrustes context) in order to have the same ratio of net Euclidean magnitudes (Centroid Sizes). Here the contrast is marginally visible without any need for the threefold magnification. From either panel one draws two obvious conclusions: first, the deformations we are discussing—the deformations of allometric growth of these rat neurocranial octagons—agree as regards their gradients everywhere across the form; but, second, the interpretation of those gradients in terms of Cartesian coordinates is entirely different between the two spaces.

In view of this identity of meaning between the two panels of Fig. 4 (up to a scaling step that can easily be shown to be biologically meaningless, see Appendix A.1), the contrast between the two panels of Fig. 3 becomes even more salient, inasmuch as the horizontal coordinates of these two plots are so obviously not the same between the analyses. That is, of course, because Centroid Size, a specific correlate of the allometric growth under study, has been divided out of the analysis in panel A but remains implicit in panel B. The inescapable conclusion is that the score on Procrustes RW1 is irrelevant to the understanding of allometric growth, precisely because it does not capture the appropriate size-dependence explicitly modeled in BPC1.

Fig. 5
figure 5

Demonstration of the equivalence of Procrustes RW1 and Boas PC1 as deformations. a Contrast of the two octets of displacement vectors, exaggerated threefold, after BPC1 is projected down into Procrustes shape space. b The same after RW1 is scaled up into Boas space, this time without exaggeration of their contrast. Open circles: shifts corresponding to RW1. Solid circles: shifts corresponding to BPC1. Asterisks: landmark-by-landmark means in the appropriate space, weighted by the inverse of Centroid Size in a and unweighted in b

The advantage of the Boas analysis in a study of growth allometry is particularly evident in one specific aspect of Fig. 4b, the near-collinearity between each solid segment and its matching dashed segment. Such an alignment permits a description in which each landmark grows away from their common center without much additional rotation, so that the distances from the center at these fixed angles become a sufficient summary of the growth pattern. Moreover, the ratio of the two distances at each landmark is a good representation of the specific growth rate assignable to that landmark: maximal at the two ends of the cranial base, minimal at the landmarks in-between and along the top of the calva. The Procrustes diagram, Fig. 4a, proffers no such description in millimeters, as every verbalization of the pattern there would have to include the word “relative”—the relative rate of expansion of height with respect to increase of cranial base length, etc. Whenever the approximate collinearity so apparent in Fig. 4b obtains, the first Boas PC, by remaining in units of length, is better serving the function that Centroid Size was designed to serve in the Procrustes approach: a “best possible” size measure. In other words, when BPC1 manifests this collinear pattern, the diagram assures us that those aspects of form being ignored by the centroid-to-landmark distance description—the rotations of the individual landmarks around the pooled version that kept the whole configuration rigid—are negligible with respect to the growth changes at \(90^\circ \) to them, the growth rates of these centric distances themselves. Hence the name I have suggested for this technique, centric allometry.

A comparison of Fig. 4b with Figs. 1b or 2b reinforces this “irrotational” property of Boas PC1 analysis. In either thin-plate spline figure there is a clear contribution to RW1 that is a relative repositioning of the landmark Opi with respect to its pair of neighbors Bas and IPP: the distance Opi–Bas grows much more slowly than the distance Opi–IPP. This effect, however, is azimuthal, not radial: it pertains to distances along the perimeter of this octagon, not outwards from its center. By design, the contrasting style of diagramming a Boas analysis in Fig. 4b attenuates this feature—when what is shown is a much smaller multiple of this same first PC, there is only a small hint of an azimuthal effect at Bas. Analyses of this growth data set published earlier have identified this phenomenon in the vicinity of Opi either as an expression of the feature of smallest scale in this octagon, partial warp 5 of the standard toolkit (Bookstein 2018), or as a similarly scale-inappropriate factor that emerges from the alternative factor-analysis machinery of Bookstein (2017). It is indeed part of a description of growth, but it does not participate in the large-scale summary that is our concern in this paper. Put another way, the information that is de-emphasized in Fig. 4b about this strong local effect is still present in the display of Fig. 2b, which is a different graphical representation of the same principal component BPC1. Evidently a competent analysis of this data set requires inspection of both.

The question immediately arises of why Fig. 1 was ever constructed—why a study of growth allometry should have been situated in Procrustes shape space at all. Why should this author and every later investigator into the multivariate structure of this classic data set ever have bothered to divide out Centroid Size in investigations where the ultimate organismal interpretation will require that division to be reversed?Footnote 3 For studies along these lines, this paper has introduced a new analysis of landmark configurations that does, indeed, cancel out the effects of this specific biologically inappropriate step in the standard algorithm. Yes, there is a loss of mathematical elegance, several aspects of which will be touched on in the next section; but the gain in empirical explanatory power for the original scientific purpose, that of understanding patterns of organismal growth, more than compensates. Moreover, the standard reduction of form-variation to size, affine shape, and partial warps (see Bookstein 2018, Chapter 5) proceeds in parallel except that size is now folded into the affine component, where it could have been encoded all along.

The outline of the rest of this paper is as follows. Section 2 briefly reviews the reason that Boas coordinates have that name, compiles some early references to the comparative biometrics of organisms in terms of “size” versus “shape,” and surveys the small biometric literature that adumbrated this notion of centric allometry a few decades ago from a morphometrically less sophisticated point of view. Section 3 extends the analysis to a maneuver inaccessible from the standard Procrustes toolkit: the joint subsetting of landmark lists along with sample subgroups in order to enhance the power of the hypothesis of growth allometry, as explicated via a simplistic phylogenetic example from hominoid evolution. A closing discussion (Sect. 4) explains why D’Arcy Thompson ought not to be credited with any of these ideas as either evolutionary biologists or developmental biologists now apply them; then considers the competing versions of the role of explicit experimental data in studies of growth as promulgated by Hans Przibram in the 1920’s, versus Peter Sneath in the 1960’s; and, finally, explores the comparative costs and benefits between scaled and unscaled approaches (analysis of Procrustes coordinates versus Boas coordinates) in the design of biometric tools for analysis of organismal growth. Following these remarks is an Appendix reviewing three aspects of the algebra of the proposed method—Centroid Size, specimen-by-specimen rotation, and the advantages of the Boas principal component method for representing centric allometry—in greater detail.

GMM and Allometry

A hundred fifteen years ago the pioneering American anthropologist Franz Boas published a suggestion so simple that it occupied only two columns in the issue of Science in which it appeared (Boas 1905): that superposition of multiple forms in the service of anthropological investigations should not be by any two-point registration (a preferred point and a preferred interlandmark segment out of it, both fixed in position on the page) but instead by a simple criterion of least squares.Footnote 4 Boas’s note is entitled “The horizontal plane of the skull and the general problem of the comparison of variable forms,” but its rhetoric treats those two phrases as contrasting, not complementary: his closing comment is that if his new least-squares method is embraced, the “arbitrary element in composite drawings or photographs,” namely, the choice of a reference segment between some pair of landmarks to serve as the operationalization of horizontal, “may be eliminated.” (This technical issue in craniofacial biology endured for decades: see, for instance, the critique in Moyers and Bookstein (1979).)

To state that the criterion of fit is “least-squares” is in fact to solve the problem in an instant. Boas’s algorithm for this fit has two steps: superposition of the centroids of all the forms at a common point, which may as well be the origin of coordinates, and then reorientation around that common centroid to least-squared differences by precisely the same formula that all of our Procrustes toolkits find themselves using here in 2020—it sets to zero the landmark-weighted reorientation of each form around their average as referred to this common centroid. This is itself an elementary theorem: the least-squares fits between two configurations involve the same rotation whether or not they are rescaled. It follows directly, for instance, from the regression formula (5.7a) on p. 411 of Bookstein (2018), or equivalently from the singular-value notation in Rohlf and Slice (1990). (See Appendix A.2 of this article for details.) Boas’s own student Eleanor Phelps, for example, avoided any discussion of shape when she adapted his 1905 method of unscaled superposition to the problem of quantifying variability of different midsagittal landmark configurations (Phelps 1932). All of her diagrams are explicitly declared to be in centimeters; the technology is unmistakably the same appeal to least-squares as in Boas’s original note of 27 years earlier.

Some more recent representations of the variability of cranial form, though not emulating Boas’s exploitation of least-squares methodology, have likewise presented unrescaled superpositions in this way: see, for instance, Delattre and Fenart (1960) or Moss et al. 1983. Robert Corruccini’s method of Cartesian coordinate analysis (Corruccini 1981) reviews a range of late 20th-century approaches to direct analysis of coordinates, some “following the data normalization procedure advocated by Sneath” (in effect, the Centroid Size scaling of classic Procrustes) and some not. Corruccini’s own preference (p. 34) is for the version that we would now call the explicit scaling to Centroid Size unity—the presentation in Fig. 1.

Boas’s idea of a superposition that optimizes summed squared specimen-to-specimen landmark distances over position and orientation implicitly assumes that neither position nor orientation is actually a meaningful biological quantity. Both of these propositions are arguable in particular settings. Position surely matters for discussions of functions like locomotion for which lever arms over a landscape are relevant. And orientation matters in at least two different contexts: when function has to do with the local horizontal, or when one aspect of the form is an axis or plane of symmetry. (Boas himself already mentions this latter concern, as in the presence of bilateral symmetry the generalized three-dimensional rotation of a solid skull reduces to a two-dimensional problem.) Speculations on the relevance of criteria like these should be a part of any investigation of landmark coordinate data prior to an invocation of either Boas’s algebra or Procrustes algebra in the course of their preparation for some multivariate statistical analysis.

But the discussion must become a great deal more sophisticated when the issue is whether or not to carry out a size standardization. Size per se is a quantification of enormous biological significance for both growth and function, serving as it does variously for biomass, for age, and for aspects of organismal scale that often need to be accommodated in the pursuit of subtler explanations. This multiplicity of roles for size in biomathematics aligns with a distinction of purposes between taxonomy and growth analysis together with the historical accident of an early but widely cited and appreciated application of grids in numerical taxonomy. As Sokal and Sneath (1963, p. 80) put it, “allometry is a problem related to the effect of environment on characters ...and to the problem of redundancy and empirical correlation (that is, the crude measures may depend on a small number of underlying causes).” That redundancy, which the taxonomist evidently treats as a confound, is instead the actual focus of studies under the rubric of growth allometry that are this paper’s concern. Burnaby (1966) makes the claim even more explicitly: when variation due to growth is “extraneous to the desired comparisons,” it serves solely as a “nuisance factor in morphometric work” that needs to be “eliminated” from diverse other multivariate methods for population comparisons. The method of “trend-surface analysis of transformation grids” that Sneath (1967) would introduce four years later turns explicitly to this issue of the “small number of underlying causes” for data sets involving multiple landmarks without, however, reopening this question of why we should be dividing out any summary second-moment quantity in the first place. We will shortly return to this concern for a better methodology of residualization in respect of size by altering the list of dimensions undergoing standardization to include an additional term from the lore of GMM, the uniform component.

In addition to this consistently overlooked intellectual aspect of the Procrustes approach, several anomalies in the approach to allometric analysis ascribed to “standard GMM” in the figures of the previous section are worth noting that this Boas method, the cancellation of scaling, neatly circumvents. The most fundamental of these anomalies arises from the conventional notion that any analysis that is to be called “allometry” must involve the separation of the data signal into two channels, “size” and “shape” in some paired sense. For instance, Sneath’s great paper of 1967, which was one of two principal stimuli for my own dissertation a decade later (the other was Gould 1966), states at the outset, as an axiom requiring no defense, that once “corresponding points” have been marked on two or more images of organisms, “the diagrams are scaled and fitted to give the best possible fit; this gives measures of size and shape difference.” Why “scaled and,” and why “size and shape”? Corruccini’s approach just reviewed likewise simply glides over this crucial decision, insofar as his interest is not in the growth of his specimens (the two species of hylobatids) but their distinction as species, given that he already knows Symphalangus is larger.

Writing at about the same time from a quite different disciplinary perspective, Gould (1966) embraces this same anomaly when, at the very outset of his argument, he says that he uses the term ‘allometry’

in its broadest sense, to designate the differences in proportions correlated with changes in absolute magnitude of the total organism or of the specific parts under consideration.

But this is certainly not the “broadest sense” of this term, which had already been formalized by Huxley (1932) in his notation of power laws relating multiple size measures on the same organism. Gould may have been an accidental victim here of the syntax of the English language—the title of his article, “Allometry and size in ontogeny and phylogeny,” suggests that “size” needs to be contrasted with something, whereas a title like “size and size in ontogeny and phylogeny” would be more consistent with the actual data analysis problem at hand; but a phrase like “size and size” seems very peculiar in English (although the same phrasal structure with a comparative adjective, “bigger and bigger,” would not be). This is not a new point: see, for example, Bookstein (1989). In summary, a rhetoric of “size versus shape” is inappropriate for studies of growth per se.

Most of Gould’s reflections on how to discuss issues of “allometry and size” concerned either qualitative descriptors of shape classes or else “proportions” construed as explicit decimal ratios between prespecified pairs of single quantitative variates. Only at the end of his article, Sect. VIII, did he broaden his scope to consider “total form alteration.” In this context he recognized three categories of method: transformed coordinates sensu D’Arcy Thompson, “sets of allometric equations” such as growth gradients, and “multivariate analysis,” here meaning the computation of principal components and their interpretation as factors in keeping with later expositions such as Blackith and Reyment (1971), Reyment et al. (1984), or Reyment and Jöreskog (1993). (The Boas method of the present essay would be categorized as a composite of the second and third classes here, growth gradients and multivariate analysis.) In any of these extensions of what was originally a bivariate rhetoric, the notion of a “proportion” (a single discrete ratio of particular interest) has vanished. Once the morphological complexity of a data set has risen toward our contemporary standard, no dichotomy remains between a “Gould–Mosimann school” and a “Huxley–Jolicoeur school” of biological interpretations (contra Klingenberg 2016)—the whole idea of “proportion” is an atavism of its origins in the language of Galileo’s time, when notions of biometrics were indeed a novelty. For half a century now we have been well beyond that stage’s focus on bivariate scatterplots and the corresponding simple linear regressions or residuals.

An even broader sense of this concept of allometry, hinted at by Gould’s refusal to define what he meant by “absolute magnitude,” involves the interaction between notions of the comparison of size measures vis-á-vis those of shape. When applied in studies comparing the sizes of anatomical substructures, the notion of “size” embraces not only quantities of root-mean-square type, such as the Centroid Size that the present article will compare to the least-squares-optimal score on Boas principal component 1, but also such familiar measures as perimeter and area, which, although appearing symmetric in their treatment of individual coordinates, nevertheless show enormous sensitivity to the actual shapes of the anatomical components they are intended to quantify. For example, a pair of four-landmark forms of which one is a square of side 1 and the other a rectangle of sides 0.129 and 3.871 have areas in the ratio of 2:1 but perimeters in the ratio of 1:2—if the space of alternative size measures is not delimited in some theoretically sensible fashion, the ratio description of allometry is uncertain even as to polarity. The ratio of Centroid Size measures for this pair of forms is even more extreme, more than 2.7 to 1. This shape change is not intended to be biologically realistic, of course, but only serves to illustrate the algebraic pathology here. Jim Rohlf (pers. comm.) is fond of pointing out a similar problem that afflicts numerical taxonomy: classifications, clusterings, or cladograms based on different choices of a distance measure between taxa can yield entirely different estimated phylogenies. And the Procrustes world does not have access to any equivalent of the molecular-clock models attempting to ground such a choice in biologically meaningful quantities measurable by other technologies than imaging. Procrustes distance, in other words, should not be considered a biologically meaningful quantity, but only a human artifact along the lines of the sums of squares and crossproducts that appear in many other biometrical contexts.

Thus the words “size” and “shape” are not terms of art in morphometrics—they cannot be assigned operational meanings a-priori, but must be defined explicitly in the context of each study separately. In particular, for landmark-based growth studies, the division of the explanatory realm into two classes of reportage, “size” and “shape” without any algebraic notation, the way Gould does it, is not a scientifically productive strategy, separating as it does a multivariable signal homogeneous in units into two incommensurate channels (one in centimeters, one that is technically unit-free) whose relationship is too confounded a function of the unstated algebraic details of their measurement. Both the experimental programmes of Hans Przibram’s Vienna Vivarium (sketched below in the Discussion) and the power-law models of Julian Huxley (1932) instead dealt with a multiplicity of length measures by explicit tabulations. The description of a data set of Boas coordinates by the centric allometry language of this paper can be considered a special type of these (see Appendix A.3). The biometrics of growth per se, as distinct from the biometrics of phylogeny, can go forward perfectly well as the biometrics of some laboratory-derived series of growing numbers—there is nothing intrinsically important about the space of their ratios, which is to say, geometric shape (see Bookstein 1986).Footnote 5

An insistence on separating “size” from “shape” may have served the purposes of numerical taxonomy, which (forgive the oversimplification) aims more centrally at discovering groups than at classifying specimens into groups or at understanding functional morphology or morphogenesis within those groups. In Sect. 5.3.7 of Sokal and Sneath (1963), for example, the coefficients of an allometric relation are described as characters in their own right for studies in which the units are taxa rather than individuals and where size range per se is a less useful quantification. It is possible that this segregation of multiple centimeter-scaled mensurands into two channels, “size” remaining in centimeters while “shape” has become unit-free, is serving a social, rather than a biological, purpose. The sociologist Bruno Latour argues in his Latour (1990) that ever since the invention of printed diagrams back in the sixteenth century one of their main advantages is the ease with which they can be combined with written text—turned into components of a book or a scientific article. Phelps (1932) may have been the first publication to do this explicitly when she experimented with diverse polygons of cranial landmarks (see her Figs. 2, 3, 4, 5, 6). Then another way of justifying the rescaling step of an ordinary Procrustes analysis is the purpose of printing images of all specimens to the same scale on the page, as in the diagrams of Medawar’s famous extrapolation of Thompson’s method (Medawar 1945) to embrace specific comparisons of growth stages.

A different attempt to avoid rescaling, Todd and Mark’s (1981a) method of “cardioidal strain,” explicitly defended its purpose as psychological. Its authors, they say, “have recently demonstrated that radial transformations of human facial profiles [such as their “cardioidal strain” model produces] are generally perceived as growth by naïve observers ...whereas other classes of transformation ...are almost never perceived as growth.” Ironically, I criticized the Todd-Mark article in the year of its publication for precisely that reason (Bookstein 1981)—the strain model there used only four coordinates of morphometric data out of the myriads afforded by the real human cranium; their “cardioidal” scheme was too simplistic to be of any analytic use. A reply by Todd and Mark, appearing in the same journal issue (Todd and Mark 1981b), retorted, in effect: let’s see you do better. The present essay might indeed be seen as a belated constructive response to the mainly destructive critique presented in Bookstein (1981).

Assume that if “size” is to predict anything, the reification chosen for that construct should be the one that predicts the data as well as possible in a least-squares sense. But that is, according to the singular-value decomposition (an elementary theorem), a characterization of the first principal component of the data, the very formulation being diagrammed here as superimposed over the average form in Fig. 4b and later analogues. Then the step in the Procrustes method that divides by Centroid Size is incompatible with the actual reporting of an allometric dependency except when the allometry that is the intended subject of discussion is in fact totally absent. Likewise, for authors who, like Sneath or Corruccini, are recommending a least-squares method such as trend-surface analysis, why is the role of size taken as a divisor rather than a regressor? After all, in the principal-component approach, the second principal component (I will have more to say about this anon) is an analysis of residuals of the measured data after the effect of the first principal component is accounted for by ordinary linear least-squares; but a division by size is inconsistent with any least-squares residualization as examined at any later stage by principal components, analysis of variance, or anything else. Bookstein (2014; 2018) has shown how to approximate division by size using an approximate linearization, regression-out of the differential of the explicit formula for Centroid Size (see Appendix A.2). But this does not matter for a methodology in which whatever “residual size-shape correlation” remains (the phrase is from Corruccini 1981) is adjusted out by linear regression of the shape coordinates on Centroid Size after that same size correlation has already been divided out in the course of the overall standardization. Evidently, in this approach size has been corrected for twice, once by division and again by regression. Such a procedure makes no sense.

Fig. 6
figure 6

Net assessment of the implications of the size standardization omitted by the Boas algorithm for the explanation of patterns of the resulting registered configurations. Top row: residuals from the first principal component of the Procrustes shape coordinates (left) versus those omitting the standardization by Centroid Size. Second row: the same after the uniform component has been partialled out from both systems, hence the descriptions “nonaffine nonallometric” of their panel labels (a truncation of “nonaffine nonallometric shape”). These two plots are indistinguishable

How can we assess the net import of this double division? Consider the pair of comparisons in Fig. 6. In the top row, the scatterplot on the left is of residuals from Procrustes RW1—of course, Centroid Size has already been divided out from these coordinates—whereas the scatterplot on the right is of the residuals from the Boas coordinates’ PC1, without any division. Thus the righthand scatter has been residualized from only one regression (removal of PC1), whereas that on the left is the result of two separate standardizations, of which the first was, without any justification, carried out by division instead of by regression. In the lower row are the same scatters after a further standardization, this one removing the uniform component of variation (combination of shear and dilation, the transformations that leave all parallel lines parallel). The coordinates on the left are those of the standard Procrustes toolkit. Those on the right have been standardized case by case instead, by explicit fit of the Boas coordinate configuration on the coordinates of the average form separately in \(x-\) and \(y-\) directions, hence, a fit that incorporates a standardization of the uniform component (including its aspect of scale) for each of the Cartesian coordinates separately, but by least squares, not by division.Footnote 6 But these plots are almost exactly the same (except for scale). In other words, a three-step procedure—dividing by Centroid Size, then regressing out Procrustes RW1, then regressing out the Procrustes uniform term—is equivalent to the two-step procedure of regressing out Boas PC1 and then the Boas uniform term. Thus the original division by Centroid Size is irrelevant to the analysis of growth allometry for these data—it adds no insights beyond those that would have been available already by analysis of the Boas coordinates, which were only centered and oriented, not also rescaled. I return to this topic, the trade-off between geometrical notation and biometrical insight, in Sect. 4.

The approach to growth analysis recommended in this paper, already demonstrated in Fig. 4b, is least-squares in the original Boas sense, namely, with respect to the centered, reoriented coordinates of the original Cartesian data in their original metric units. It is important to keep in mind that this common center at (0, 0) is purely an algebraic artifact—I am not aware of any convincing method for validating this locus or any other “biological” allometric center by purely geometrical considerations. The only serious article I am aware of on this theme is Moss et al. (1983), which I reprogrammed for Fig. 5.55 of Bookstein (2018). Moss et al. discovered that the problem of finding a clear “center” of allometric growth of the human skull, meaning, a point around which variation of displacements was maximally radial rather than angular, had no good computational solution, and Bookstein (2018) concurred. (The next year Moss et al. (1984) replaced the centering scheme with a less constrained protocol, the “allometric network,” that fell much closer to the original Huxleyan scheme of multiple interlaced power laws formulated in terms of interlandmark distances.) In another irony, Bookstein (1983) had already demonstrated that for any quadrangle of landmark locations in two forms there is an entire curve of exact candidates for the allometric center, a curve whose equation is cubic in the Cartesian coordinates, and for any two pentagons there is still, in general, a discrete set of exact solutions. The interpretation of diagrams such as that in Fig. 4b should thus be limited to aspects that can be put into words without explicit reference to that center.

These descriptions include interlandmark segments that, drawn as linear loci, pass close to this fictional center, and also segments between adjacent landmarks of a boundary, for which divergence of the vectors in Fig. 4b can be interpreted circumferentially (as will be exemplified in Sect. 3). Interpretations like these are consistent with biological explanations in terms of interstitial growth, directional growth within the tissues in-between the landmarks. They have less a-priori cogency in applications to phylogeny, where such explanations have no corresponding explanandum—after all, nothing “grows” from Pan to Homo, whereas Vilmann’s rodent neurocranial octagons did indeed grow from age 7 days to age 150 days. Hence the analysis in Fig. 4b should not be interpreted as if it mapped growth over the interior of the head—there is nothing biological about that center point; it is simply the superimposed centroid of all the given landmark configurations. The next section returns to this theme via experiments with truncation of landmark lists in the pursuit of more and more convincing Boas growth analyses.

Figure 7 helps us explore the implications of the new method for this specific data set of rodent neuroanatomical octagons. Figure 7a compares the two different estimates of “size” arising from our two different GMM protocols, Centroid Size (horizontal) versus BPC1 score (vertical), as applied to Vilmann’s rat octagons. These two measures correlate nearly 0.999, and both are in units of length, but only the Boas version incorporates (via its loadings) the actual effect profile of size as an allometric factor, the presumptive deliverable of a growth allometry analysis. Although the two size variables correlate nearly perfectly, they have radically different formulas. One of them, Centroid Size, has a formula that pays no attention to actual sample variation—it is merely a multiple of the root-mean-square of interlandmark distances case by case—whereas the other candidate, PC1 of the Boas coordinates (BPC1), is calibrated to the actual covariance structure of the Boas coordinates, thereby capturing much of the effect of growth allometry that was the focus of Vilmann’s actual experimental study. These coefficients are displayed explicitly in the dashed lines of Fig. 4b, each one an (xy) pair for the corresponding pair of loadings in the actual formula for this PC1. (The drop in this size-size correlation from this figure to some of those in the next section will be part of the argument that the data there need subsetting prior to any study of growth allometry per se.)

Continuing, Fig. 7b, c show the vertical components of the two panels in Fig. 3 as thin-plate splines of their own. The grids agree regarding the posterior shear of the roof of the calva with respect to the cranial base, but only the Boas version also shows the nonlinear change in proportions along the cranial base and the contrast of SOS–ISS and Brg–Lam growth rates. Still, because division by Centroid Size is so nearly equivalent to a projection, the scatter at upper left in Fig. 6, Procrustes shape residualized on RW1, is very nearly the same as the scatter below it, which is further residualized on the uniform term. Note (Fig. 1) that this is not merely because growth is “nearly uniform”—in both analyses there is clearly a strong nonuniform (nonaffine) component that combines a growth-gradient, a multiple of partial warp 1, with a focal effect between Opi and IPP (see e.g. Bookstein 2018, Figs. 5.84, 5.98). Analyses in which PC1 is to be interpreted as “general size” clearly need to be of the Boas coordinates, not the Procrustes coordinates from which size (in the sense of Centroid Size) has already been divided out.

Fig. 7
figure 7

Additional aspects of the Vilmann analysis. a Scatter of the alternative size measures, Centroid Size (divisor in the Procrustes method) versus principal component 1 of the Boas coordinates. b, c Thin-plate splines of the second components in the scatters of Fig. 3 of which the first components were already displayed in Figs. 1 and 2

The more cogent defense of the appearance of Centroid Size in the geometric morphometric context pertains to its formula as a score, not its use as a divisor. As Bookstein (1986) showed heuristically and then Dryden and Mardia (1998, 2016) showed analytically, one particularly symmetric family of statistical models for the Cartesian data (the “isotropic Mardia–Dryden,” independent circular normal variation at every landmark around some fixed template), which is spherically Gaussian in its space, yields an almost exactly spherical distribution of the Procrustes (but not the Boas) coordinates in their space. On this model, furthermore, the covariance of Centroid Size with every shape measure (that is, with every direction in this shape or form space), in the limit of indefinitely large samples with infinitesimally small shape variance, is exactly zero. This means that on the assumption of a wholly isotropic distribution of landmark coordinates around a shared template, one can examine an allometric relationship for “statistical significance”—for a correlation of landmark proportions with size—by calculating the dependence of Centroid Size on the list of all the shape coordinates (in practice, its multiple correlation).

The extension to using this score as a divisor may arise from simple graphical convenience (the desirability of being able to represent configurations being compared in panels of approximately the same size on the printed page, regardless of details of their shape) or from the fact that the formal linearization of the Procrustes procedure, the J-matrix notation of Appendix A.2, shows that for size to be used to augment a list of shape coordinates in a way preserving the spherical symmetry of the composite, it is necessary to replace Centroid Size by its logarithm, the differential of which is scaled as the reciprocal of size per se around its own sample mean. With this transformation, in the limit of small variation, the variability of Centroid Size is commensurate with that of any other explicit dimension of shape variability if formulated in a linear notation at unit Procrustes length under the same Mardia–Dryden model. But the symmetry of this model makes it essentially valueless for biometrical applications, where in virtually all contexts the distribution of landmark Cartesian coordinates is not symmetrical in that pervasively spherical way (Bookstein 2016). Then the justification in terms of division does not apply to the data likely to arise in any biological application; in any event, the formulation of the Procrustes algorithm, with its culminating division by size, came many years before the Mardia–Dryden formalism.

Let me summarize the argument to this point. The standard Procrustes procedure (for which see the standard GMM textbooks) incorporates a step, the normalization of geometric scale using the formula for Centroid Size, that appears to be at once (i) biologically meaningless, (ii) geometrically unnecessary, and (iii) incompatible with analyses of growth allometry. I have reviewed a variety of published methodologies of GMM that omit this step, and others that include it; but none of these earlier resources have discussed this crucial decision, even to the extent of a single rhetorical clause. For instance, Boas didn’t divide, but Sneath (1967) did, without saying why; and Corruccini copies Boas’s method but “normalizes by Sneath’s method,” again without ever hinting at a reason for doing it this way. My suggestion in this article, the replacement of the standard Procrustes approach to growth allometry by one using the first principal component of Boas coordinates, cannot be ruled out of order simply on the basis of anything in the existing literature (more than a century!) of morphometrics. In short, it deserves a try.

The Vilmann data set may not be a representative example of allometric studies—after all, it represents the closest that that competent mammalian experimentalist could come to a repeated measurement of the selfsame growth process. A design nearly as effective is the “longitudinal growth study” that is traditional for the best human craniofacial work (cf. Riolo et al. 1974). But most studies of allometry, following Gould (1966), instead involve “cross-sectional” samples, observations of multiple organisms at one single time. Paleobiological studies, in particular, are necessarily of this design. For the resulting inferences to have any validity it is appropriate to ask to what extent the analysis of such a design, in which every organism’s ontogeny is calibrated to its own environment, matches the analysis of any representative individual had it been the sole specimen under study.

By virtue of its perfectly balanced longitudinal design the Vilmann data set permits us to examine this question explicitly for the subsample of 18 animals with complete data that this paper has relied on. Figure 8 replicates the analysis of Fig. 4b separately for each of the 18 rodents in the Vilmann study. For ease of viewing at this reduced scale, the lines that were dashed in Fig. 4b here are thickened instead. Animal-to-animal variability here, while detectable by careful scrutiny, is evidently insubstantial. The apparent invariance of growth trajectories in Fig. 8 is across the full age span of the animals imaged. (For a finer-grained representation, age 7 days to age 14 days only, see Fig. 7.20 of Bookstein 2014.) In other words, the centric allometry we are seeing is not an artifact of averaging over diverse individuals, nor is it an artifact of the strength of the overall correlation of size with shape; instead it is a characteristic of the growth trajectories separately. (In cladistic terms, it would be called a character in its own right.) The invariance in this figure thus suggests a mastery of the experimental setting completely in keeping with the idealizations of one philosophy of biology, the founding of morphological explanations in actual morphogenetic data under the tightest possible experimental control (Vilmann’s animals were “close-bred”). An analogous study in human children would involve careful serial imaging of multiple sets of monozygotic twins (replacing experimental control of embryonic conditions by explicit identity of genotypes). But I am aware of no such data set in the craniofacial literature.

Fig. 8
figure 8

Replication of the centric allometry analysis, Fig. 4b, for each of the 18 experimental animals individually. The thick lines replicate, for each animal separately, the same vectors that were represented by dashed lines in Fig. 4b. The separate panels are indistinguishable except by tiny details of these growth trajectories

An Example from Anthropoid Phylogeny

The degree of experimental control of animal material demonstrated by this last analysis of the Vilmann rodents is far from typical for the sort of data sets one encounters in metazoan evolutionary biology. It is much more usual to be dealing with data that is heterogeneous as regards ontogeny, phylogeny, or both. This section offers a provisional guide through a centric allometric analysis for contexts like that. The key principle will turn out to be subsetting both of the landmark data facets at once: not only the specimen list but also the landmark list. The ultimate purpose of such a workflow is analogous to the purpose of random-walk modeling in paleobiology (see, e.g., Bookstein 2012): to extract a particularly simple model the residuals from which have a seemingly unpromising structure not worthy of further investigation in any detail, and then to speculate on why the living form has aligned with this particular model. In our context of geometric morphometrics, such a desideratum, arrival at a representation of residuals that justify ceasing to analyze further, is typified by the lower row of diagrams in Fig. 6. Such diagrams resemble the corresponding diagrams of the (overly symmetric) isotropic Mardia–Dryden distribution involving independent, identically distributed circular noise at each landmark of a common template (see Bookstein 2018, Fig. 5.37 or Dryden and Mardia 2016, Sects. 11.1.1–11.1.2). Whether there is a relatively weaker residual factor structure in such scatters considered multivariately is not as relevant as their greatly restricted variance compared to the variances by landmark which which they began. That is, the point will be the replacement of the variation in Fig. 2a by the variation in the lower row of Fig. 6, further investigation of which in the data set at hand would not be a fruitful use of the biologist’s time.

An exemplary “realistic” demonstration begins, then, with the customary announcement of a template for the anatomical characterizations of the landmarks that compose it. We already saw one of these in the diagram at the top of Fig. 1. The present Fig. 9, modified from Bookstein (2018), serves this function here: a combination of caricatures of the landmark points in situ with their Latin names and terse operational definitions. I cannot reconstruct any explanation of how this particular sample of 29 forms came to be delineated back in its original version of 2003—it is not a representative sample of any population of museum-accessible samples, available hominids, anthropoid taxa, or any other convenient rubric. Nevertheless it ends up serving a very useful pedagogic purpose as workbench for a demonstration of how to simplify anatomically complex extended landmark configurations by a combination of specimen sequestration and landmark sequestration.

Fig. 9
figure 9

Template for the data sets of up to 20 landmarks analyzed in Fig. 10 through Fig. 15. The data are sketched here as they might trace a typical form of the sample (in the original study, Bookstein et al. 2003, these were synthetic midsagittal images derived from CT scans). Landmark names and operational definitions: Alv, alveolare, inferior tip of the bony septum between the two maxillary central incisors; ANS, anterior nasal spine, tip of the spina nasalis anterior; Bas, basion, midsagittal point on the anterior margin of the foramen magnum; BrE, BrI, external and internal bregma, outermost and innermost intersections of sagittal and lambdoidal sutures; CaO, canalis opticus intersection, intersection point of a chord connecting the two canalis opticus landmarks with the midsagittal plane; CrG, crista galli, point at the posterior base of the crista galli; FCe, foramen caecum, anterior margin of foramen caecum in the midsagittal plane; FoI, fossa incisiva, midsagittal point on the posterior margin of the fossa incisiva; Gla, glabella, most anterior point of the frontal in the midsagittal; InE, InI, external and internal inion, most prominent projections of the occipital bone in the midsagittal; LaE, LaI, external and internal lambda, outermost and innermost intersections of sagittal and lambdoidal sutures; Nas, nasion, highest point on the nasal bones in the midsagittal plane; Opi, opisthion, midsagittal point on the posterior margin of the foramen magnum; PNS, posterior nasal spine, most posterior point of the spina nasalis; Rhi, rhinion, lowest point of the internasal suture in the midsagittal plane; Sel, sella turcica, top of dorsum sellae; Vmr, vomer, sphenobasilar suture in the midsagittal plane. Dotted curves sketch the alignment of these anatomical boundaries between the landmarks; once standardized, the dots turned into semilandmarks in a later version of this same data set explained in Sect. 5.5.4 of Bookstein (2018)

From the most up-to-date prior analysis of these 29 20-landmark configurations, represented in my two textbooks (Bookstein 2014 and Bookstein 2018), it is helpful to extract the summary panels in Fig. 10. As this figure will be a model for the five that follow it, it is worth taking some space to explain its panels separately. The twenty landmarks are those of our template, Fig. 9. The specimens comprise 17 adult modern H. sapiens, 4 juvenile sapiens aged 2, 4, 4, and 10 years, one archaic specimen (the one conventionally known as “Mladeč”), four H. neanderthalis (Atapuerca, Kabwe, Guattari, and Petralona), the single australopithecus specimen STS5 (“Mrs. Ples,” although its sex is now considered uncertain), and one each of each sex of Pan troglodytes. The corresponding Procrustes scatter, Fig. 10a, shows considerable heterogeneity (though not to an extent beyond those of other paleoanthropological data sets in my experience), especially at Brg and Alv.

The first summary graphic in the conventional workflow for such a data analysis, Fig. 10b, is a projection of the first two relative warp scores of these 29 specimens (their principal coordinates with respect to Procrustes distance, square root of the summed squared landmark shifts in Fig. 10a). It is evident that this sample will not submit to any holistic multivariate investigation if this is how it looks in a preliminary exploratory analysis. In this RW1–RW2 plane the adult Homo sapiens form one cluster falling in-between a sparser cluster for the neanderthals and a separate string of three loci for the juveniles. Oblique to this collineation and more extended than the contrast between juvenile sapiens and neanderthals is a jet of points in order of their phenetic distance from Homo: first, the australopithecus; then the female chimpanzee; and finally the male chimpanzee, apparently a victim of his own hypermorphosis. As the Bookstein textbook narratives have already argued, such a plot indicates that the pooled analysis per se will not have any chance of making sense: the assumptions permitting an inspection of the RW’s as coherent descriptive composites are, without exception, contradicted.

This inference is extended by the panels in the center of this figure. Panel (c), upper center, is the scatter of Boas coordinates this paper is recommending replace the Procrustes coordinates. In this Boas scatter, the larger forms are at greater distance from (0, 0) than the smaller forms, leading to an elongation of the distributions for some of the peripheral landmarks. The effect of this nonscaling is much clearer in panel (d), lower center: the replacement of the apparently directional RW1 of panel (b) by the pair of components here, nearly equal in explained variance, and for which the implicit dimension of growth (contrast of the juvenile H. sapiens with the adults) lies in a direction nearly along the second principal component rather than the first, where it would be expected to fall were this a legitimate growth study at this stage. In keeping with the panel to its left, the neanderthals still bracket the adult sapiens at one extreme, with the juveniles bracketing it at the other. But, in contrast to panel (b), the two chimpanzees now agree in their separation from the sapiens (while the male still deviates from the female in a direction that would again generally be taken to connote hypermorphosis). The grid for BPC1, panel (f), is shown here only because it will be in this position in each of the next five figures. But for this particular data set, 29 forms by 20 landmarks, BPC1 is irrelevant to any explication of growth allometry. One sees that pretty clearly as well in panel (e), the diagram that would show centric allometry if this data set behaved anything like the Vilmann data in Fig. 4b. Unfortunately, the net impression here is of something entirely different, a sort of spiderlike squashing whereby as many of the extreme landmarks shift obliquely to their radii as along it. The centric model, in short, is worthless for this sample. If we needed a numerical diagnosis of the problem, it is sufficient to compute the correlation between Centroid Size (the Procrustes scaling factor) and the first Boas principal component score, which is, as we explained above, the optimal explanation of all the landmark positions in a least-squares sense. This correlation is 0.849, versus 0.9986 for the analysis of the Vilmann data in Figs. 1 and 2.

Fig. 10
figure 10

Key frames of a dataflow of specimens digitized using the Fig. 9 template and then analyzed by the method of centric allometry described in this article. This analysis was of all 29 specimens using all 20 landmarks. Panels, here and in the subsequent five figures: a scatter of Procrustes shape coordinates; b scatter of Procrustes RW1 against RW2 scores (the first two principal coordinates of the summed squared differences of the coordinates in a); c scatter of Boas coordinates; d scatter of Boas PC1 against PC2 scores (first two principal coordinates of c); e centric allometry diagram for Boas PC1; f thin-plate spline for a visually suitable multiple of Boas PC1. For the relation between principal components and principal coordinates, see Bookstein (2018)

Fig. 11
figure 11

Key frames of an analysis of 26 specimens, 20 landmarks as in Fig. 10

The display in Fig. 10b mandates that we discard the three outlying specimens—the three non-Homo specimens—from the sample, leaving the 18 adult sapiens (including Mladeč), the four juveniles, and the four neanderthals. The corresponding six-panel display, Fig. 11, now looks entirely different, not so much regarding its coordinate scatterplots (panels a and c) as in its ordination plots by interspecimen distance (panels b and d). Both panels continue to show the alignment of the two four-specimen subsamples (juveniles at one end, neanderthals at the other) to either side of the central cluster of 18 H. sapiens, but now, with the other genera omitted, this axis determines the first principal component of the data quite precisely. And furthermore its precision is much greater in the Boas analysis than the Procrustes analysis, inasmuch as a strong correlate of this growth scaling, namely, Centroid Size, had been divided out in the course of the Procrustes analysis, greatly weakening its influence on the variance of the first principal component. The correlation of CS and BPC1 score has now jumped to 0.981. (But it will crawl even higher before we are done.)

Panel (f), the first Boas principal component as a grid, is familiar from decades of thin-plate-spline-rendered anthropometry. The neural skull shows negative allometry with respect to the facial skull, and while that neural skull deformation is close to isotropic, growth in the maxilla is highly directional (the trait usually called prognathism). The centric allometry diagram for this data subset, panel (e), no longer looks like a squashed spider with legs splayed every which way, but instead shows radial dilation throughout most of the maxillary landmarks. The alignment of the PC at Bregma, though, is azimuthal, not radial, and the discrepancy between the implied displacements at inner and outer inions is likewise incompatible with the underlying centric intuition. Finally, something looks perplexing at FCe.

So now it is reasonable to subset the landmark list, not the specimen list, by deleting the two Bregmas, the two Inions, and, because they are on the suture joining the bones on which this pair of structures lies, the paired Lambdas. (It is possible that none of these are actually valid landmarks. Bregma, in particular, is known to sometimes arise as a wormian bone sliding along the dura over the course of early development. An extension of the centric method that would allow such points to slide along outline curves to a position of maximum consistency with the behavior of the remaining landmarks is imaginable but falls outside the scope of this paper, as a special case of a deeper question: whether the hominid neurocranium can be persuasively analyzed by landmark-based methods at all.)

Fig. 12
figure 12

Analysis of 26 specimens, 14 landmarks as in Fig. 10. Note that with the deletion of six landmarks the orientation of the mean form in a, c, e, and f has rotated by about \(30^\circ \) on the page

The resulting analysis, Fig. 12, shows considerable progress. Limited to maxillary structures and the cranial base, the Boas ordination (panel d) is now even more consistent with a single dominant factor for allometry, while the landmarks of the centric allometry diagram, panel (e), now conform except at FCe and CrG. The corresponding thin-plate spline, panel (f), looks just like the homologous sector of the more extended grid in Fig. 11. The correlation between CS and the BPC1 score is now 0.994.

Neanderthals were a different species from us, and, furthermore, the present sample includes no neanderthal juveniles. (Research out of the Max Planck Leipzig may have obviated this problem, someday permitting a centric-allometric analysis of a mixed sample of the two species of Homo in a balanced design. See Gunz et al. 2010 or Neubauer et al. 2018.) So it is reasonable to delete the neanderthals from this experimental data set at this stage. There results the analysis in Fig. 13, a sample now reduced to 22 specimens on the same 14 landmarks.

Fig. 13
figure 13

Analysis of 22 specimens, 14 landmarks as in Fig. 10

The scatters in panels (a) and (c) are much tamer now than they were in Fig. 12—the neanderthals are, after all, a different species—but this improvement in the taxonomy has had the unfortunate effect of completely destroying the interpretability of Procrustes RW1, panel (b). (This problem is typical, of course, of situations in which one applies a principal component analysis to a mixture of multiple groups: see Bookstein (2018), Fig. 4.3. The juveniles now comprise one corner of a plot on two axes of nearly equal variance, with the remaining 18 sapiens distributed over both of the opposite edges.) The corresponding Boas ordination, however, has no such difficulty: the three youngest juveniles fall at one end of an obvious principal component whose other end combines the Mladeč specimen with two of the largest modern specimens. The centric plot, panel (e), remains fully radial except at FCe and CrG, and the grid diagram, panel (f), is likewise unchanged in showing hypertrophy of the maxilla in a primarily anterior direction (meaning, anatomical anterior, which is rotated a bit downward of the printed horizontal in this Procrustes pose). And the correlation between Centroid Size and the BPC1 score has risen just a bit, to 0.995.

The last two centric descriptions have both included the phrase “except at FCe and CrG.” It is time to delete these two landmarks as well, thereby reducing our analysis to the maxilla along with the lower border of the cranial base (only). Anatomically, we have left the brain case; the analysis is now wholly limited to a combination of the cranial base and the face.

Fig. 14
figure 14

Analysis of 22 specimens, 12 landmarks as in Fig. 10

This adjustment (Fig. 14) clarifies the nature of the centric model without substantially altering any aspect of the actual arithmetic. The Procrustes ordination, panel (b), is still unusable, while the Boas scatter is barely changed from that of the previous analysis on 14 landmarks instead of 12. The centric plot, panel (e), is now wholly radial, with a corresponding surge in that CS–BPC1 correlation to the satisfyingly high value of 0.997. The thin-plate depiction of the growth factor, panel (f), is likewise unchanged between the panels except in the immediate vicinity of where those two deleted landmarks had been.

This would normally constitute the end of the data subsetting cascade. But for pedagogic purposes it is helpful to reverse the penultimate decision, the deletion of those four token neanderthals, to see how we are doing, at least in respect of the form of this species as adults. There results one final tableau of panels, Fig. 15.

Fig. 15
figure 15

Concluding analysis of 26 specimens, 12 landmarks as in Fig. 10

With the neanderthals restored to the sample, the Procrustes scatter in panel (b) is not quite as unpleasant as it was in Figs. 13 and 14. The extra weight of those four additional specimens has the effect of rotating RW1 back onto the orientation it had in Fig. 12, with the sapiens juveniles and the neanderthals bracketing a big cluster of adult sapiens along the abscissa. But now the corresponding Boas plot, frame (d), is exactly what we would wish to see: the same description, but with the dominant axis of variation now very precisely specified with respect to the best alternative (the second principal component). The thin-plate diagram, panel (f), continues to clearly show the contrast between cranial base size and a maxillary size enhanced in the anatomically anterior direction, while the centric allometry plot, panel (e), remains reassuringly radial almost everywhere, consistent with this panel in the preceding figure. The correlation between Centroid Size and the first Boas principal component score is almost unaffected, at 0.996, by this extension to an additional species (however poorly sampled).

For purposes of summarizing the power of the explanatory growth factor at which we have arrived it is worth examining one final display (Fig. 16) that shows all the variation remaining after we have regressed out BPC1 of Fig. 15 along with the uniform component of these forms (as explained in Sect. 2, Fig. 6 and footnote 6). Over the remaining twelve landmarks, these residual scatters look roughly circular and of comparable variance landmark by landmark. They are not independently distributed, but any noncircularity they might display is much less than that of the original Boas coordinates, Fig. 10c, nearly 80% of the variance of which they have exhausted by this pair of algorithmically rigorous least-squares maneuvers.

Fig. 16
figure 16

Residual variation after removing both BPC1 allometry and the affine terms for the 26 12-gons in Fig. 15. The meaning of “nonaffine nonallometric” [shape] is as in Fig. 6

The example here should not be considered as dealing with any sort of “modularity.” A landmark subconfiguration whose growth changes can be summarized by one single centric allometry does not thereby attain the status of any sort of developmental “module”—the explanation of this association could instead be relevance to a common function, biochemical pathway, or environmental constraint. Vilmann’s purpose, like that of Melvin Moss (who first introduced the world of craniometrics to these data when he digitized them from Vilmann’s original cephalograms), was to found an analogous methodology for the assessment of interventions on growth, interventions that might be clinical (as in orthodontics) or instead genetic (as in research into craniofacial growth of knockout animal models by Hallgrimsson and others; for an example in this connection where size per se is part of the experimental signal, see Schwarze et al. 2020). In any of these contexts, to demonstrate a centric allometry like that in Fig. 4b or Fig. 15e is to reify only one single explanatory factor, physical growth (expansion of tissues), rather than any more complex sort of explanation dealing with embryonic structure, biochemical pathways, functional constraint, genomics, or natural selection.

The dramaturgy here wields the centric model to find a single simple-component subset of a data set, instead of taking the data set as a whole and attempting to model it in toto. I will trace one origin of this research theme, the centrality of environmental determinants of animal development within the single taxon, in the next section. The analysis here mentions but does not investigate the obliqueness of the two allometries in Fig. 10d, for instance, that for the growth of the H. sapiens specimens (which turns out to be essentially the same as the static allometry between the two species of Homo) versus that for the sexual dimorphism of Pan. As Fig. 15 shows, that taxon need not be limited to one single species; but the explanations the method affords do indeed have to deal with one single factor, namely, growth allometry. In the title of Gould’s great 1966 article, “Allometry and size in ontogeny and phylogeny,” it appears that there is a much deeper connection between allometry and ontogeny than between allometry and phylogeny, at least when allometry is treated at this level of morphometric detail.

Discussion

I do not grant any priority as regards landmark-based allometric studies to the most celebrated of the early 20th-century approaches, even though it is customarily taken to have launched the method of grids exploited in several of our figures. I mean, of course, the method of “Cartesian transformations” originally published in 1917 by D’Arcy Thompson. That is because I believe Thompson’s method to have ultimately proved a dead end rather than a stimulus for the actual development of GMM or any other extant branch of biometrics. (I elaborated on this point in Sect. 5.1 of Bookstein 2018.) In terms of any intellectual history relevant to today’s preferred techniques, the next contribution after Boas’s of 1905 would be the publications of the Vienna Vivarium, the experimental biology laboratory located at the edge of the great Prater park at some distance from the actual university. Its underappreciated effort to quantify developmental biology was recalled to our attention in the same year as Thompson’s centenary via a retrospective symposium and then the proceedings volume edited by its chairman (Müller 2017). The Vivarium not only carried out strikingly original, technically masterful research on the environmental determinants of developmental variation and regeneration (Coen 2006) but also produced many publications in domains we would now classify under the rubric of theoretical biology as that term was interpreted at the time (today’s equivalent is probably better taken as “philosophy of biology”).

A particularly clarion statement of their programme can be found in Chapter 2, “Organometrie,” of Hans Przibram’s short pamphlet Aufbau mathematischer Biologie (“The Structure of Mathematical Biology”) of 1923. Przibram argues that organismal form must be regarded as the summation of a morphogenesis that is continuous in both space and time, as observed over varying experimental conditions. But because growth is interstitial, we must measure likewise interstitially (that is, inside the organism’s boundary) prior to any attempt to summarize via external measurements such as relations among landmarks. The argument continues (in my modern rephrasing) to emphasize that investigations must begin with studies of growth trajectories before they pass on to examine ecophenotypy. And comparisons of species must likewise be considered dynamically, so as to accommodate differences in their developmental conditions, “although these data do not yet afford of a mathematical treatment.” In short, organismal form is an epiphenomenon of developmental physiology—a more succinct opposite of the Sokal–Sneath focus on taxonomic concerns cannot be imagined.

For any proponent of a methodology such as Przibram’s, Thompson’s method of Cartesian transformations cannot be viewed as helpful. In Przibram’s critique of 1923, Thompson’s method, which he claims (correctly, in my opinion) essentially reduces to comparisons of positions, orientations, and proportions among organs, is purely inductive, and does not conduce to explanations. Indeed, he notes, Thompson gives us no insight into how an organism made up of diverse organs could submit to analysis via a holistic assessment in the first place, let alone change into another species while remaining an integrated, functioning entity. Przibram concludes this frustratingly brief chapter with an insight astonishingly ahead of its time:

Thompson’s holistic deformations can be made comprehensible if we can visualize a space lattice upon the living form, so as to assess how each little piece changes its shape under conditions that vary by species. Here lies open a rich, nearly undeveloped field that invites a mathematization, one whose erection we hope will begin very soon. At the conclusion, measurement of the physical space grid will have been unified with stereochemical organometry. (Przibram 1923:14, my translation)

Until the day when observation of gridded tissue all over the developing organism has become a technically feasible mode of primary data collection, Przibram is arguing, Thompson’s error is fundamental: those little grid square distortions cannot share any biological reality with the empirically identifiable homologous points at which the data, such as they were, made contact with geometry. Sneath, writing almost half a century later, was careful not to make this error. To him, the grids are for purely descriptive taxonomic purposes, in effect, tools of visual psychology rather than biometry—none of his examples deal with organismal growth. Whereas, Przibram’s life work (until it was terminated prematurely in Theresienstadt, in 1944) was the erection of an early version of what is now called “evo-devo” in which those little grid cells were indeed the explicit subject of experimental data collection. In my current view the American anthropologist Franz Boas is more appropriately considered the ultimate progenitor of geometric morphometrics than the British natural philosopher and classicist D’Arcy Thompson; and had Przibram lived into the age of the technology he was imagining, founders’ credit might well have been shared with him.

What is meant by the “data” in this context needs to be consistent, at least conceptually, with the Przibram experimental setting: the matching of locations between successive images of the same organism, or the implication of such a match via the theoretically fundamental notation of biological homology. I have discussed this in other publications (e.g. Bookstein 2019) that need not be excerpted here. The concept of matching is baked into the formalism of landmark points, the information modality for which this method of centric allometry (along with other applications of both the Boas and the Procrustes methodologies here) was designed. As I explained long ago, in the earliest of treatises on geometric morphometric and the associated thin-plate splines,

landmarks are quite a bit more than instructions for measurement; they are the places at which one’s explanations of biological processes ought to be grounded. (Bookstein 1991, p. 17)

This single requirement—that correspondences across time or across specimens be interpretable as statements encoding a-priori biological theory—distinguishes most of today’s GMM praxis from the more industrialized approaches optimized for processing verbose or complex image data sets, particularly in three or four dimensions, that lack a-priori textbook-referenced annotation. For instance, the deformatica computer package (Durrleman et al. 2014) from the INRIA Aramis project team parametrizes its deformations not by landmarks but by “the most variable parts” of their templates. Such “control points” do not logically substitute for landmarks—their locations do not convey prior biological knowledge, but instead are put forward as an a-posteriori computational substitute for such knowledge. Hence, I would argue, their variability is not necessarily tied to meaningfulness in any developmental or evolutionary context, nor would it conduce to descriptions in terms of growth-gradients or the further exploration of the spatial or genomic spectrum of variation in these patterns. The goals of deformatica and similar packages, then, must be considered rather more clinical-medical than biological. They align better with group discrimination or classification—with the purposes of machine vision, in other words—than with the goal of understanding how normal development works globally via the coordination or compensation of processes at different scales in different regions of the form (cf. Mitteroecker et al. 2020). It is hard to imagine how their (admittedly elegant) discrete differential geometry could be matched to the languages whereby patterns of organismal growth come to make biological sense.Footnote 7

To explain one particular aspect of the Boas maneuver we must turn to the semiotics of actual statistical notation. For landmark data that are close to their “template” (in effect, the sample mean) in both size and shape, the Procrustes maneuver is equivalent to treating the landmark configurations as one single data matrix of cases by coordinates and then projecting out all four of the following composite variables: the mean x-coordinate, the mean y-coordinate, size (taken as Centroid Size), and orientation. Here size and orientation are treated only via their linearizations, which display an unexpected symmetry when plotted as little vectors of displacement at the means of the landmark coordinates: they fall perpendicularly upon the form. The standard Procrustes approach projects out both of these, thereby preserving a certain formal algebraic symmetry of the resulting residual distributions but at a considerable cost, namely, disqualification of the language of least-squares in centimeters. The Boas standardization preserves that least-squares language at the cost of losing access to this mathematical symmetry. In the presence of size allometry, the linearization that I notated in my “J-matrix approach” (Bookstein 2014, 2018 and Appendix A.2 here) will not extend with any confidence over more than a fraction of an order of magnitude as regards geometric scale. But it is difficult to claim statistical validity for GMM studies of broader taxonomic regimes anyway. (In my view any validly linearized “morphometrics of macroevolution” is highly unlikely.)

The trade-off sketched here is thus in practice an empirical issue, not an a-priori one. Does the prior literature on the natural history of the taxon under examination suggest that the dimension of scale is dominant enough in its importance that it must not be manipulated, but instead left for explanation? For topics like growth allometry, the answer apparently should be “Yes.” The treatment of size and orientation as somehow equipotent on purely geometrical grounds is unnatural for the class of natural-science applications with which analyses of the evolution of development are concerned. Advanced mathematizations of spaces of very high dimension but very weak theory seem now to making great strides in connection with the emergence of large-scale DNA sequencing resources and other megavariate contexts; but that is not the context of applications at which the present methodology is aimed. Even if images can be followed over megavoxels in individual instances of growth, it is extremely difficult to persuasively match more than a few dozen parameters across individual organisms (cf. Przibram 1922, tables). As the embryologist Paul Weiss (a Vienna-born student of Przibram’s who successfully fled the Nazis) noted in 1956, “Identical twins are much more similar than any microscopic sections from corresponding sites you can lay through either of them” (Gerard 1958, p. 140). Stated more formally, an appropriately controlled experiment requires, in particular, appropriate control of the level of detail at which data are captured.

In closing I would highlight a contrast between the purpose of centric allometry analyses like these two exemplars and the more commonly encountered task of “modeling a covariance structure,” in this instance the covariances of the shape coordinates in Figs. 2a or 15a. The purpose of the analyses here is the visualization of the hypothesis of growth allometry per se, augmented perhaps by the standardization for uniform change that does not alter descriptions of local features of change (cf. Bookstein 2015). The point is not “goodness of fit” but explanation, and the explanation of growth in Fig. 2, in terms of original Cartesian coordinates and least-squares in millimeters, is far more persuasive than that of Fig. 1, in “Procrustes units.” The issue of an overall eigenstructure, as in methods of factor analysis (Bookstein 2017), is oblique to that specific task. Put another way, the manipulations here do not change the significance level of the claim of unidimensionality for views like those in Figs. 4b or 15. One can check for sphericity of residuals from the PC1 model, for instance, and the asymptotic chisquare statistics are virtually identical (and highly significant, too, in view of the curvilinearity obvious in either Figs. 1a or 2a).

Hence I am not claiming that there is no further structure to be had in this data set, only that neither of our two exemplary samples (18 rodents, or 26 hominids) would be a promising testbed upon which to pursue it. Of the total variance of the Boas coordinates in Fig. 15c, 79% is explained by the variables controlled in Fig. 16. One of these, the affine term, is a completely a-priori function of the mean, a standardization not for size alone but also for any nonlocal aspect of shape change; while the other, the first principal component of the Boas coordinates, clearly dominates already according to Fig. 15d. The residual, 21%, is not worth trying to explain in a sample so arbitrary and unbalanced as this one. By comparison, in the corresponding summary (Fig. 6) for Vilmann’s well-designed experimental data set, the fraction of total Boas variance explained by BPC1 along with those affine terms is 97.6%, leaving only 2.4% to be attributed to the remaining (uncontrolled) individual variation. The next experiment might well be along Przibram’s principles, perhaps investigating how diverse contrasts of experimental conditions affect the growth allometries here. Although these residual fractions of variation—2.4% versus 21%—are enormously different, the algebra of the centric allometry model as fitted is the same. This suggests perhaps a wider domain of application than just the mammalian craniofacial complex: for instance, analysis of the growth of analogous hard-tissue complexes bordering soft-tissue contents under constraint by functional stresses and strains. It might also apply to strongly allometric series of forms observed over phylogeny rather than ontogeny, such as the series of vestibular superpositions recommended by Delattre and Fenart more than half a century ago.

Klingenberg’s (2016) review of “concepts of allometry in geometric morphometrics” eschews algebraic notation along with any of the associated equations. In spite of these circumlocutions he concludes that the poles of the fundamental bifurcation with which the present paper is concerned, between the methods that treat size and shape as logically separate properties versus the methods where this distinction is dropped, are “logically compatible and should provide results that are comparable in empirical studies” (p. 132). The two examples in this essay clearly contradict that optimistic generalization. Yes, “logically” the Procrustes approach and the Boas approach exploit the same data base of information (the same original Cartesian coordinate arrays) and are interconvertible by purely algebraic tactics, as demonstrated, for instance, in Fig. 5. Yet our community’s papers rely on Procrustes-based morphometrics not for its logic (which I have critiqued strenuously in several earlier venues, e.g. Bookstein 2016, 2018) but for its arithmetic, specifically, the geometry of shape coordinates along with the optimal low-dimensional algebraic summaries of their covariance structures. Although Centroid Size and Boas PC1 correlate 0.999 for the rat growth data (Fig. 6a), the decision to divide by Centroid Size resulted in radically different sets of vectors landmark by landmark across the form (Fig. 4) and substantially different ordinations by their scores (Fig. 3). And as Figs. 10 and 11 show, Boas analyses can make sense of principal component scatters that wholly fail to make allometric sense in a Procrustes context. In both the rat growth example and the hominid static allometry example here, the two styles of analysis diverge to a startling extent in their implications for understanding organismal form and its development. (For more on the logic of growth gradients and their pattern analysis see Bookstein (2019).) Note that neither formula, RW1 or BPC1, is aligned with Centroid Size (see Appendix A.3).

More generally, I would argue, the optimism of Klingenberg’s 2016 paper is not supported by any explicit formulas for the conversion of either type of linear multivariate finding from one class of methods to the other. Instead his argument may serve mainly ironically: a reason for intentionally elevating explicit algebraic details of an allometric analysis method to a position of priority over verbal summaries in the actual text of empirical reports. It should not be up to the researcher to “prefer one way of thinking or the other,” the way Klingenberg phrases the matter on pp. 132–133. No, the choice is not merely a subjective choice of cognitive style. Rather, it demands a match between the formalisms of an algebraic/stochastic model and the rhetorical structure of whatever morphogenetic theory is ostensibly supported (or refuted) by the analysis at hand. And for analysis of allometry, that must be the match between the formula for size and the explanatory role of size, as demonstrated in this essay. Analyses of growth allometry need to rely on measures of size attuned to the allometric explanation ultimately arrived at, and Centroid Size does not align with any such explanation beyond the obvious formulaic null.

Hence the Procrustes construction for landmark data has unreasonably displaced the boundary between concepts of “size” and concepts of “shape” from where it should have been were these notions to apply cogently in a growth study. As the captions of Figs. 6 and 16 indicate, the term “shape” continues to be useful as part of the exegesis for diagrams that follow a prior sequestration of information appropriately referred to as “size.” But the requirement that such a “shape” descriptor be produced by some equation relying on the division symbol \(\div \) must be replaced by a far more sophisticated language aligned with the underlying purpose of biological explanation. By enforcing an arithmetical division by Centroid Size prior to any further statistical analysis, today’s Procrustes-based geometric morphometrics has drifted far from a productive articulation with the biology, biomechanics, and histology of organismal growth and form. To resuscitate that alignment may mandate reversing that size standardization step in spite of its premature ubiquity as part of the GMM toolkit. Boas coordinates are an elementary alternative to that standard GMM, no more difficult than Procrustes shape coordinates to produce and manipulate and compatible with thin-plate splines and other useful representational tools of the toolkit. The Procrustes method construes “shape” as a set of theoretically incoherent proportions—a set of quotients of centered, rotated Cartesian coordinates by a biologically inappropriate measure of “size.” This biotheoretical blunder has hobbled geometric morphometrics for more than 25 years. To circumvent it by a maneuver as simple as foregoing that division would help restore the centrality of allometric developmental models to the morphometric analyses that would, in turn, reinvigorate the evolutionary biology of organismal form.