Super Resolution Digital Image Correlation (SR-DIC): an Alternative to Image Stitching at High Magnifications

Hansen, R. S.; Waldram, D. W.; Thai, T. Q.; Berke, R. B.

doi:10.1007/s11340-021-00729-2

Super Resolution Digital Image Correlation (SR-DIC): an Alternative to Image Stitching at High Magnifications

Research paper
Open access
Published: 13 May 2021

Volume 61, pages 1351–1368, (2021)
Cite this article

Download PDF

You have full access to this open access article

Experimental Mechanics Aims and scope Submit manuscript

Super Resolution Digital Image Correlation (SR-DIC): an Alternative to Image Stitching at High Magnifications

Download PDF

R. S. Hansen¹,
D. W. Waldram¹,
T. Q. Thai^1,2 &
…
R. B. Berke ORCID: orcid.org/0000-0003-0612-8665¹

4708 Accesses
8 Citations
Explore all metrics

Abstract

Background

High-resolution Digital Image Correlation (DIC) measurements have previously been produced by stitching of neighboring images, which often requires short working distances. Separately, the image processing community has developed super resolution (SR) imaging techniques, which improve resolution by combining multiple overlapping images.

Objective

This work investigates the novel pairing of super resolution with digital image correlation, as an alternative method to produce high-resolution full-field strain measurements.

Methods

First, an image reconstruction test is performed, comparing the ability of three previously published SR algorithms to replicate a high-resolution image. Second, an applied translation is compared against DIC measurement using both low- and super-resolution images. Third, a ring sample is mechanically deformed and DIC strain measurements from low- and super-resolution images are compared.

Results

SR measurements show improvements compared to low-resolution images, although they do not perfectly replicate the high-resolution image. SR-DIC demonstrates reduced error and improved confidence in measuring rigid body translation when compared to low resolution alternatives, and it also shows improvement in spatial resolution for strain measurements of ring deformation.

Conclusions

Super resolution imaging can be effectively paired with Digital Image Correlation, offering improved spatial resolution, reduced error, and increased measurement confidence.

Effects of Various Shape Functions and Subset Size in Local Deformation Measurements Using DIC

Article 30 June 2015

Augmented Lagrangian Digital Image Correlation

Article 06 December 2018

DIC Challenge 2.0: Developing Images and Guidelines for Evaluating Accuracy and Resolution of 2D Analyses

Article 04 January 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Digital Image Correlation (DIC) is a non-contacting technique used to examine localized strain across a material’s surface [1]. By comparing images of a sample before, during, and after load application, DIC can calculate surface deformation and strain at any point which is visible to cameras. This method allows for measurements to be taken at a variety of length scales without a loss of quality in the measurement for different physical scales [2].

Although many physical length scales can be used, the resolution of the images used to compute deformation remains a limiting factor for DIC measurements. DIC can detect sub-pixel magnitudes of displacement [3], but the measurements are computed using subsets of pixels, thus averaging each ‘pointwise’ displacement measurement over the area of each subset. In addition, strains are computed from multiple subset displacements, averaging the strain measurement over an even larger area [4] and causing undesirable spatial ‘smoothing’ of measurements [5]. Due to the number of pixels necessary for a single strain data point, the physical size of the subsets used has a direct impact on the spatial smoothing of the measurement. To maximize the benefits of the full-field measurements from DIC compared to single averaged measurements from strain gauges, higher resolution images allow for smaller physical subset sizes which produce each “pointwise” measurement. This is especially true when it is necessary to perform DIC over small regions of interest.

To address the need for improved spatial resolution, some researchers have improved image resolution by removing a sample from an experiment’s controlled environment to take DIC images under an optical microscope before and after deformation [6]. Similarly, even higher-resolution DIC images have been captured with a scanning electron microscope (SEM) [7, 8]. Such microscopy techniques enable measurements at the nanometer length scale, improving the spatial resolution. However, this added resolution in a single image often comes with a reduced field of view. To overcome the issue of small fields of view, images with adjacent fields of view can be stitched together to create one single image. At the optical scale, researchers have stitched together multiple images from optical microscopes in order to compare DIC results with grain microstructure [9, 10]. Stitching has also been demonstrated at the SEM scale, [11, 12], allowing even higher resolutions. The result is higher spatial resolution, with DIC subset covering smaller physical areas, while capturing large fields of view.

However, such high-resolution, large field-of-view methods face two key limitations. First, microscopy-based experiments must either be (a) conducted under operating conditions that are survivable by the imaging equipment or (b) performed ex-situ, removing the sample from the experiment’s environment. At room temperature, specialized SEM setups have been developed that allow in-situ images [13,14,15], whereas high-temperature applications have traditionally utilized ex-situ measurements [16]. The second limitation stems from the working distance. Previous experiments have been performed using microscopes that have very short working distances, making stitching ill-suited for environmental chambers and long-range optics. High-magnification imaging with low-resolution cameras in specially-fitted has facilitated in-situ DIC of small fields of view at elevated temperatures, but requires a specially fitted optical microscope heating stage [17]. Similarly, long-range optics have allowed for high-resolution DIC measurements in environmental chambers, but without accommodation for large fields of view [18]. However, the challenge remains that high-magnification in-situ measurements with full fields of view are limited by working distance of the optics.

As an alternate approach, Super Resolution (SR) imaging techniques may produce images of sufficiently high resolution while maintaining larger fields of view and long working distances. Super resolution is a post-processing technique to combine multiple overlapping low resolution (LR) images to produce a high resolution (HR) image in a region common to all images [19]. Image stitching and SR are compared schematically in Fig. 1. While both processes are shown to yield similarly increased resolution, image stitching is often done ex situ (or occasionally under an environmental microscope), whereas SR could theoretically improve the process by allowing the LR images to be taken in situ and at longer working distances (although the HR images still require post-processing).

This super resolution post processing of images has been a growing field over the last several decades. The basic principles were initially studied as early as 1974 as a mathematical method to improve images beyond resolutions otherwise limited by the diffraction limit of light, and reduce effects of blurring in images [20]. In 1984, Tsai and Huang first applied some of these principles for creating higher-resolution images from multiple frames [21]. Early applications ranged from improving resolution of emission spectra images in biochemistry [22] to overcoming the quality-reducing effect of atmospheric turbulence in telescopes [23]. As potential applications for super resolution imaging surfaced, computing ability also improved, leading to several advances in SR taking place in recent years. Emerging applications in other fields include retinal imaging [24], other telepathology [25], and improved smartphone cameras [26]. In the case of smartphones, SR techniques have allowed camera resolution to approach that of traditional cameras such as DSLRs by overcoming current sensor, pixel, aperture, and other hardware limitations. Some variations of super-resolution processes use the techniques to improve a single low-resolution or blurry image [27], compared to traditional applications which use multiple images with overlapping fields of view to improve accuracy [28]. Among the newest advances in super-resolution are the utilization of machine learning, comparing known low- and high-resolution image pairs to train super-resolution algorithms [29], demonstrating the expanding set of potential applications.

While principles of super resolution have been solving a variety of problems for years, SR has yet to be applied in the field of experimental mechanics. This paper demonstrates the potential application of super resolution imaging to improve high-magnification DIC measurements using open-access SR software. The three techniques featured in this software and examined in this research are Robust Super Resolution [30], the Papoulis Gerchberg method [19, 31], and Structure Adaptive Normalized Convolution [32], the merits of which are discussed in the theory section below. The algorithms are evaluated for their effectiveness to perform DIC: first qualitatively, by reconstructing a sample image and comparing the quality of each visually; then quantitatively, by rigid body displacement and deformation measurements. The quantitative SR tests are then compared with unprocessed LR results to demonstrate the improvements in high-magnification DIC due to the SR resolution.

Theory

There are two common processes which are essential to all super resolution (SR) algorithms: 1) identifying position of each low resolution (LR) image with respect to one common high resolution (HR) reference grid and 2) projecting LR pixels onto the grid [19]. Figure 2(a) shows schematically a set of 4 LR pixels on a small section of the HR grid. The LR pixels are numbered 1–4, while the HR pixels are lettered A-D. Because each LR image has some displacement Δx and Δy with respect to the HR grid, the SR algorithm must determine which LR pixels will influence a given HR pixel. In Fig. 2(a), an HR pixel is shown to be influenced by up to four LR pixels from just a single image. For example, HR pixel A is entirely contained within LR pixel 1; HR pixel B lies partially within LR pixels 1–2; while HR pixel D lies partially within all four LR pixels 1–4. Because there are often several LR images input into a SR algorithm, many LR pixels are used to create a single HR pixel. Figure 2(b) shows schematically the same 4 h pixels projected onto a second LR grid with pixels numbered 5–8. For example, pixel A is entirely contained within pixels 1 and 5 and would thus weight both pixels evenly; while pixel B has larger proportions within pixels 2 and 5 than within pixels 1 and 6, and would thus weight pixels 2 and 5 more heavily. The specific processes of these two steps, as well as a description of the algorithms used to accomplish these steps, are described below.

Step 1: Position Registration Between LR and HR

The first challenge in super-resolution computing is placing multiple LR images onto a common grid. To take advantage of information from multiple images, the images must necessarily be unique from each other. This is often due to some small translational displacement between the camera position in the capture of the image [33], but it can also be caused by lens distortion or other deformations caused by the camera and lens system [34]. To reconcile all images, these camera displacements between a chosen first image and subsequent images must be estimated with sub-pixel accuracy. Those relative displacements then allow the position of each image to be registered on the common grid or framework for the SR image.

There are several algorithms to accomplish this step of position registration. Early efforts used a transformation to the frequency domain, where translations in the horizontal and vertical direction can be estimated by frequency phase shifts [21, 35]. Such methods assume global motion occurs for the entire field of view, which is a potential disadvantage for images whose subjects undergo non-uniform deformation [36]. Several additional techniques have been implemented to improve their performance of such methods. These include extracting rotation information from the phase shifts [35], as well as avoiding aliasing by using low-frequency parts of the image [37]. A more recent algorithm, proposed by Vandewalle et al. [38] uses this Fourier transform on the image to identify translation on subsequent images when compared with an initial image. This algorithm combines the robustness against aliasing from frequency filtering with the ability to capture rotations as well as linear displacements from the phase shifts and amplitudes.

Other methods remain in the image spatial domain, rather than the frequency domain [39]. One of the foundational spatial domain algorithms was developed by Keren [40]. It utilizes Taylor expansions to estimate planar motion between images, based on the parameters of rotation and vertical and horizontal shifts. The algorithm then seeks to minimize the error of the approximation, solving a set of linear equations to find the shift and rotation parameters. This is an iterative method, adding the parameter solutions to the system of linear equations and resolving until it converges sufficiently. In order to cut down on computation time, the algorithm uses a ‘Gaussian pyramid’ scheme, which focuses first on a coarse down-sampled image, and then a progressively finer down-sampled image until the full image is used. Other spatial domain methods have been developed which can account for other motion models such as segmented and temporal motion [41]. Algorithms have also been developed which estimate the rotation first, then correct the rotation before estimating spatial shifts [42].

Both the spatial and frequency-based registration algorithms return rotation and shift parameters. These shifts, with sub-pixel accuracy, allow all LR images to be placed on a common reference grid. Once these shifts have been estimated, the pixel information from the LR images can then be used to construct the SR image [38]. Of the methods discussed, Keren’s is used through the rest of this paper.

Step 2: Combining Multiple LR Images into a Single SR Image

After positioning all LR images on common coordinates, the overlapping information from the LR pixel sets must be processed and combined into a single SR pixel set. Several algorithms have been developed to accomplish this reconstruction. They have some common features, yet they also vary in complexity. A comparison of the features of four algorithms is included in Table 1. They are fairly representative of SR capabilities and provide a framework with which to analyze the application of SR imaging for DIC measurements.

Table 1 Comparison of SR reconstruction algorithm features

Full size table

One of the earliest SR techniques to create a high-resolution grid from projected low-resolution pixels is Iterated Back Projection (IBP). The goal of IBP is to construct a SR image that, when deconstructed into LR images, best reproduces the original LR set [43]. The SR image is obtained iteratively from an initial guess featuring a grid of SR pixels with the same resolution and placement as the desired SR outcome. After each iteration, the SR image is deconstructed by averaging groups of SR pixels together based on (1) the size of SR pixels with respect to a LR pixel, (2) the diffraction pattern of a single point, transmitted to the image plane on the sensor (the point spread function)[44], and (3) the distance between each SR pixel and the LR pixel being influenced. A simple example, without consideration of the point spread function, is demonstrated in Fig. 2(b): HR pixel A lies entirely within LR pixel 1 and thus is weighted entirely in the average; whereas HR pixel D is only ¼ within LR pixel 1 and is thus weighted by ¼. Once deconstruction of a HR estimate is complete, the original and deconstructed LR images are compared to update the HR result as informed by considerations (1)-(3). This process is iterated until the simulated LR images converge with the original LR images within an acceptable error.

One of the shortcomings of the IBP method is oversensitivity to noise. In the algorithm, when the normalized average of the LR pixels is taken, there is no significant mechanism to address issues of noise. To respond to this, the Robust Super-Resolution (RSR) algorithm uses a median estimator, rather than an average [30]. This makes the algorithm more robust against noise outliers. The result is an algorithm that builds on IBP by addressing the significant drawback of high sensitivity to motion blur or high noise. Because RSR is itself a direct improvement upon IBP, only RSR is considered through the rest of this paper.

Similar to IBP and RSR, the Papoulis-Gerchberg (PG) algorithm works through iteration [19]. For its initial guess, any SR pixel which lies entirely within one LR pixel is given the same value as the LR pixel. Any SR pixels which span multiple LR pixels are initially set equal to zero [31]. After known values are assigned, extrapolation between known pixel values is performed using signal processing techniques developed by Papoulis and Gerchberg [45]. This extrapolation is an iterative process of alternate projections and begins by transforming the image signal from the spatial to the frequency domain. The spectral signal goes through a low-pass filter, and the signal is then transformed back to the spatial domain. This new, extrapolated signal is then added to the original known signal, and the transformation and filtering is iterated. Each iteration reduces the mean square error of the extrapolation, and eventually the iterations will converge. The result is a noise-reduced SR image.

Finally, Structure Adaptive Normalized Convolution (SANC) is a response to the need to pick up underlying directional textures in the image, such as lines and curves. SANC works by assuming that LR images are blurred by a Gaussian convolution [32], meaning that a pixel is assumed to be influenced by pixels which lie close to it. Many SR algorithms (including IBP) assume Gaussian blur, which they refer to as a point spread function [4]. SANC is unique, however, because it considers image structure when assuming a Gaussian blur and it accounts for signal certainty. Image structure is considered for every pixel in the use of a gradient structure tensor. The gradient structure tensor determines if a pixel lies along a line in the image. Normalized averaging similar to IBP is performed, but it is improved by using information from the gradient structure tensor and accounting for signal certainty in a similar way to RSR. SANC uses both methods to limit the effect of noise and accurately predict shape structure when performing SR.

Methods

To assess the usefulness of SR computing in improving spatial resolution in DIC, three of the algorithms are used in several separate tests: RSR, PG, and SANC. In the first test, a qualitative comparison is performed in which an image is downsampled and then reconstructed using the SR algorithms, to demonstrate the advantages and disadvantages of each algorithm. The second test evaluates the pairing of SR and DIC in comparing applied and DIC-measured rigid body translation of a patterned specimen. The third test consists of a mechanical test which produces a non-uniform strain field, to compare LR and HR images and their effectiveness in DIC strain measurements.

SR Algorithm Initial Comparison

An initial test determined the accuracy with which each SR algorithm could recreate an existing HR image. A test image, shown in Fig. 3, was chosen which contains 3 important features: a) familiar shapes, b) straight edges, and c) areas of repetitive texture. This highlights the ability of each SR algorithm to reproduce those features. These abilities help to inform the practicability of using these algorithms for DIC-based strain measurements.

Using super resolution imaging software developed and made publicly available by Vandewalle et al. [38], the HR test image is deconstructed into nine overlapping LR images. First, nine copies of the HR image are created, shifting all but the first by random x and y displacements, ranging from -4 to 4 h pixels in 0.125 h sub-pixel increments. Next, each of the 9 h images are converted to LR images, downsampling by averaging each 2 × 2 group of HR pixels into a single larger LR pixel. In pixels which shift beyond the edge of the initial image, pixels outside the edge of the shifted region of interest retain their original values. Since each LR pixel summarizes data from 4 h pixels, the overall size of the LR image is reduced to a quarter of the HR image. The 9 LR images are then run through each of the chosen SR algorithms, using the software from Vandewalle, with an interpolation factor of 2, meaning that each dimension of the image is increased by a factor of 2. Thus, each LR pixel covers the same physical area as a 2 × 2 set of SR pixels. The result of the test is one SR image for each algorithm which could be compared to the HR image of the same resolution and field of view. This process is depicted in Fig. 4. Visual inspection rather than numerical interpretation is used for comparison, as is widely done in comparing SR algorithms [46].

Rigid Body Translation Test

As a first introduction of SR imaging in DIC measurements, SR images were used as input images to measure a known translation. First, a micrometer-driven translation stage was positioned vertically as shown in Fig. 5, holding a small T-316 stainless steel ring sample, with outer diameter of 12.7 mm and wall thickness of 1.2 mm. A speckle pattern was applied to the ring with black paint on a white background. A Basler 15 MP camera was attached to a second vertically positioned translation stage, allowing controlled offsets of the image to produce overlapping LR fields of view. These stages were separated to allow a 290 mm working distance between the end of a 25 mm lens and the ring sample, representative of distance requirements for viewing through an environmental chamber. The specimen was illuminated by Cole-Parmer fiber optic lights. The camera’s field of view, including the speckled ring, is shown in the right of the figure.

A series of images were then taken of the specimen as summarized in Table 2. After focusing the lens on the ring sample, 9 reference images were taken at differing camera positions, followed by 9 noise images. The camera position varied from ± 0.0254 mm in both the vertical and horizontal directions and was centered about zero. The ring sample was then translated 0.127 mm in the vertical direction, and 9 images were taken in the same manner. This process was repeated up to a final ring sample translation of 0.762 mm. Prior to super resolution post-processing, each LR image was cropped to the same size (1500 × 1644 pixels) to still capture the ring along with applied translation, while reducing computation time. The result was a set of images for each algorithm which included a single image at every translation of the ring.

Table 2 Image capture scheme for Rigid Body Translation test, showing LR images taken and SR images processed at each displacement

Full size table

For each set of 9 LR images, the same SR software used in the initial comparison test was used to produce 3 SR images. For all 3 SR images, the step 1 image registration was again performed using the Keren registration algorithm. Step 2 was then performed using the Robust Super Resolution (RSR), Papoulis-Gerchberg (PG), and Structure-Adapted Normalized Convolution (SANC) algorithms, respectively, with an interpolation value of 2. The SR images produced by each of the RSR, PG, and SANC algorithms were then imported into VIC-2D [47], a commercial DIC algorithm which is widely used in the experimental mechanics community. Correlation was performed using a subset size of 49 pixels and a step size of 5 pixels to obtain full field displacements. For comparison, a set of images consisting of one LR image from each displacement was also imported into VIC-2D. A subset size of 25 pixels and step size of 3 pixels was used for the LR measurement, such that a comparable physical area in mm would be represented by each LR and SR subset. The displacements were then plotted against the known applied displacements to assess how closely each of the SR methods can reproduce a known translation, and to compare SR-DIC results to traditional LR-DIC measurements.

As a comparison tool, two additional images sets were produced: One LR image set made by averaging the nine LR images to combat noise (referred to as LR Average), and one HR image set which expands the single LR Average image by a factor of 2 through cubic interpolation (referred to as HR Interpolation). To produce the LR Average image set, the LR images were first shifted by the x and y offsets found with the Step 1 Keren algorithm in order to register on a common grid, then averaged together. Expanding each LR Average image by a factor of 2 through bicubic interpolation, then, provides a benchmark to compare the RSR, PG, and SANC algorithms against. Both the LR Average and HR Interpolation image sets were prepared and imported into VIC-2D for DIC measurement.

After preparing displacement data, further analysis on the accuracy of the measurement was performed, investigating the effect of subset size on the spatial standard deviation of the displacement measurement, as subset size has a significant impact on the correlation accuracy [48]. For each algorithm, the analysis was first performed at the subset sizes described in the methods Sect. (49 for SR, 25 for LR). Smaller subset sizes were investigated, moving down in increments of 4 pixels for SR (45, 41…) and of 2 pixels for LR (23, 21…) to maintain similar physical subset sizes. This reduction in subset size continued until the images no longer correlated, thus exploring the lower limit of subset size for each algorithm. Similarly, subset sizes larger than the sizes of 49 and 25 were investigated in increments of 8 pixels for SR (57, 65…) and 4 pixels for LR (29, 33…) up to a size of 97 or 49 pixels. For every subset size, the same step size was maintained (5 for SR, 3 for LR) to preserve a similar number of total subsets.

Mechanical Deformation Test

To study the full implementation or SR imaging into DIC strain measurements, SR techniques were then used in a mechanical deformation test. The same ring specimen from the rigid body translation test was placed in a Gleeble 1500D load frame with an environmental chamber. The same camera and lens as before were aimed at the specimen through the chamber viewing window from a working distance of 330 mm. In addition, a Qioptic Optem Fusion zoom lens was used with a second Basler 15 MP camera and focused on a portion of the ring. The variable magnification of the zoom lens was adjusted to produce a field of view roughly 15 times smaller than the LR lens in order to provide a more accurate ‘high resolution’ image with which to compare the super resolution images. Custom grips were designed to apply a tensile load on the inner surface of the ring, as shown in Fig. 6.

The grips were slowly moved apart until a small increase of force was registered by the load cell, indicating that both grips had come into contact with the inside face of the ring. From this zero-displacement location, a set of 9 images was then captured in succession, to be combined later to produce a single SR reference image. Several seconds passed between each of the 9 images to allow for small random offsets of the field of view caused by vibration of the load frame. Another set of 9 images was then captured to provide a noise measurement. The grips were then moved apart under displacement control in increments of 0.1 mm, causing a non-uniform strain distribution in the ring. At each displacement increment, another set of 9 images was captured. This process of grip displacement followed by image capture continued until a final net grip displacement of 1.4 mm was achieved.

Upon completion of the mechanical deformation, each set of 9 LR images was combined using the three SR algorithms. Then, the sets of SR images were imported in VIC-2D and correlated with a subset size of 49 and step size of 5. Similarly, one of the 9 LR images at each displacement was imported and correlated with a subset size of 25 and a step size of 3, allowing similar physical areas to be represented by each subset. The zoom lens images were also imported and correlated with a subset size of 151 and step size of 5. Strain maps were generated and compared for each of the SR methods and for the LR image set. A subset size-match confidence analysis was performed for the mechanical deformation test, following the same process used for the rigid body translation test.

Results

The comparison of the three chosen SR algorithms to LR imaging techniques is supported by the results of the three tests: The SR algorithm initial comparison, the rigid body translation test, and the mechanical deformation test. These results are summarized below.