Design and Parameter Optimization of Zero Position Code Considering Diffraction Based on Deep Learning Generative Adversarial Networks

Wang, Shengtong; Luo, Linbin; Li, Xinghui

doi:10.1007/s41871-023-00221-6

Design and Parameter Optimization of Zero Position Code Considering Diffraction Based on Deep Learning Generative Adversarial Networks

Original Article
Open access
Published: 05 February 2024

Volume 7, article number 2, (2024)
Cite this article

Download PDF

You have full access to this open access article

Nanomanufacturing and Metrology Aims and scope Submit manuscript

Design and Parameter Optimization of Zero Position Code Considering Diffraction Based on Deep Learning Generative Adversarial Networks

Download PDF

416 Accesses
Explore all metrics

Abstract

Absolute measurement has consistently been the primary focus in the development of precision linear and angular displacement measurements. The scheme design of binary zero position codes is an important factor for absolute measurement. Designing and optimizing high-bit zero position codes with over 100 bits face considerable challenges. Simultaneously, the working parameters of zero position codes [unit code width (b), distance (d), and yaw angle (α)] remarkably affect their post-installation performance, particularly in absolute positioning and limit code application in multi-degree-of-freedom measurement schemes. This study addresses these challenges by proposing a design method for zero position codes that considers diffraction based on generative adversarial networks and aims to explore a design with increased efficiency and accuracy as well as optimization for high-bit zero position codes. Additionally, the tolerance range of zero positioning performance for each working parameter is examined. By leveraging the adversarial network structure, this study generates the optimization of a 150-bit code and processes the tests of the zero position code by using simulation results. The following working parameter ranges for code design are recommended on the basis of theoretical and experimental results: b greater than 10 μm, d and α within 1000 μm and 3490 μrad, and avoidance of intervals with sharp changes in the full width at half maximum. The proposed code design and parameter optimization lay a solid foundation for research and engineering applications in absolute measurement field and have considerable potential for generalization and wide applicability.

Highlights

1.
This article is the first to use deep learning adversarial networks to generate absolute positioning binary codes;
2.
This article analyzes the influence of installation and unit code width parameters on positioning performance based on the principle of diffraction propagation and the positioning performance labels;
3.
This article establishes a comprehensive model (ZD-GAN) for generating absolute positioning codes through deep learning. The model can select the best absolute positioning code and provide guidance for the optimal installation and usage parameter range.

Read Pointer Meters Based on a Human-Like Alignment and Recognition Algorithm

Non-invasive calibration of a Stewart platform by photogrammetry

Article Open access 02 April 2024

A Study of Positioning Error Compensation Using Optical-Sensor and Three-Frame

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Precision measurement is a perennial important research direction in high-end manufacturing. It mainly uses two schemes, namely electrical and optical. First, the electrical scheme mainly includes eddy current [1,2,3,4], inductance [5], and capacitance [6] sensors. However, this approach is limited by nonlinearity and electromagnetic interference, particularly for large measurement ranges [7], and is commonly used in applications with small measurement ranges.

Second, the optical scheme mainly consists of laser interferometers and optical encoders. Each scheme has its advantages and disadvantages. Laser interferometers, which rely on the wavelength of light as their measurement reference [8], are susceptible to environmental influences and can achieve high precision in well-controlled laboratory environments [9]. Nevertheless, industrial applications need a stable measurement approach. Optical encoders, which use the physical structure of gratings as their measurement reference, are little affected by environmental factors, and their upper limit of precision can rival that of laser interferometers [10, 11]. Optical encoders are easy to integrate [12] and can be extended to multiple-degree-of-freedom (MDOF) measurements [13,14,15]. Thus, they have found extensive applications in high-precision machine tools and other manufacturing equipment [16]. The lithography machine manufacturer ASML has also adopted these devices in related applications, demonstrating the growing potential of optical encoders in precision manufacturing that surpasses even that of laser interferometers [17].

Absolute measurement has always been a main development direction for optical encoder systems. Related research mainly involves placing an absolute code (zero position mask and reference code) along the linear displacement direction, utilizing zero position pulses for absolute positioning [18, 19], and allowing for the straightforward measurement of absolute positions with a simple optical setup [20]. This binary code can be applied in one dimension (X-axis) first and can be reused in the Y-axis at the same time to enable two-dimensional (2D) positioning. This kind of code is more convenient to process than other codes. It does not require point-by-point processing, only short-line drawing, and its processing cost at the micron level is greatly reduced. Yang et al. [21] and Li et al. [22] initially proposed algorithms based on traditional theories to efficiently reduce computation and design absolute codes. Later, Sáez-Landete et al. [23] utilized artificial intelligence (AI) algorithms using global optimization to design 2D binary codes with 10 × 10 pixels that are primarily applied for alignment in lithography [24]. A genetic algorithm was also used to achieve nanometer-level alignment in the form of a chessboard pattern [25]. Sáez-Landete et al. further designed one-dimensional (1D) codes based on the genetic algorithm called dividing rectangles and optimized designs with 100 elements [26, 27]. However, these aforementioned studies did not consider diffraction effects, which become remarkable in the presence of small unit code widths and large distances. Sáez-Landete et al. used a genetic algorithm to design 1D [28] and chessboard [29] absolute codes while considering diffraction to address the above issue. Chen and Wen proposed the cross-entropy (CE)-based method to further reduce the second maximum value by approximately 15% for chessboard patterns but did not consider diffraction [30]. Subsequently, Su et al. introduced a new algorithm called parametric minimum CE that reduced the second maximum value by 8.33%–22.22% [31]. Moreover, they considered the minimum diffraction effect and maximum amplitude as a combined optimization problem; their algorithm achieved a 16.12%–20.90% increase in the sum of the slope of the central peak and the effective signal amplitude of a 2D optical zero reference signal compared with a recently proposed CE method [32].

The increase in the number of binary zero positioning codes can enhance throughput and pulse positioning accuracy. However, the exponentially increasing complexity of code design poses challenges. For example, for a 150-bit binary code with 17 transparent bits, exhausting all possible code arrangements can take 1.1 × 10²² schemes. Although exhausting all the schemes is unrealistic, researchers have used AI to summarize the laws in precision measurement [33,34,35]. For example, AI helped the other researchers find the relationship between the angle of an object to be measured and the position of a light spot [36]. AI models also exist in signal processing [37,38,39,40,41]. The present work also hopes to use AI to learn rules that traditional methods find difficult to build on the basis of a limited number of coding datasets, generate new codes, and use new codes for subsequent performance analysis.

Generative adversarial networks (GANs) are generative models that learn through two neural networks competing with each other. A GAN consists of a generator and discriminator. The generator builds filtered database codes and labels as input, and its output needs to imitate the real samples in the training set as much as possible. The input of the discriminator is the real code and label or the output of the generator, and its purpose is to distinguish the output of the generator from the real sample. The generator and discriminator compete with each other and learn continuously. The ultimate goal is to enable the generator to form a certain rule model, generate new codes and labels, and then input them into the traditional diffraction propagation model to verify and analyze its performance. Therefore, this study chooses a GAN network model to generate code that meets the index. Compared with other traditional algorithms, such as references [19, 20], this method can provide richer and faster code types and is not just limited to one code scheme. Compared with a previously existing machine learning method based on the genetic algorithm [28, 29], this method has a faster training speed and does not need to design the genetic algorithm manually.

Furthermore, previous related studies typically assumed ideal working conditions with fixed distances and thus did not analyze the performances of zero position codes in spatially varying working postures. Therefore, a set of posture intervals is urgently needed as a theoretical guide for code installation and an integrated absolute MDOF measurement scheme design. This work provides a method for code design and working parameter optimization. After the code is generated with the help of AI, specific working parameters are optimized and analyzed while still relying on the traditional diffraction propagation model. The use of AI allows this study to avoid the large amount of time and computing power required for code design and generation.

In summary, current traditional mathematical methods find optimizing designs for binary absolute zero position codes beyond 100 bits challenging. Additionally, the lack of performance analysis for spatially varying working postures leads to limitations in code installation and the design of MDOF measurement schemes. With the aim of addressing these issues, this study proposes a design scheme based on GANs for binary zero position codes considering diffraction (the first letters of “zero” and “diffraction” were selected to form the abbreviation ZD-GAN with GAN). For the first time, coordinated optimization analysis is conducted on the generated schemes for working parameters. The tolerance of the working range is analyzed, and the optimal working state is determined. This study provides valuable guidance for the design, installation, and integration of absolute MDOF measurement schemes by using absolute zero positioning binary codes.

2 Principle of the Zero Position Code Design

The principle of this study is mainly based on those of deep learning GAN and diffraction propagation. The diffraction propagation principle allows for the simulation and generation of a sufficient number of performance labels for corresponding codes, thereby serving as a foundational database for GAN. This approach enables rapid learning and the establishment of its own model.

2.1 Zero Positioning Code and Parameter Introduction

In previous binary code designs, diffraction is generally believed to have no effect on the evaluation of generated codes despite its influence on pulse positioning. Simple correlation algorithms were used to achieve the variation in light intensity during the displacement between the mask and reference code. However, continuous simulation and experimental studies have shown that diffraction effects cannot be ignored for applications requiring high precision. Therefore, diffraction effects must be considered during code generation and evaluation. When designing binary codes, the main parameters to consider are the unit code width (b), working distance (d), and the yaw angle (α).

As shown in Fig. 1a, two reference codes exist on scale grating. Figure 1a shows the ideal pose state of the reference code and mask. Figure 1b depicts the changes in position state between the mask and reference code. Figure 1c illustrates the photodetector (PD) light intensity signal that changes with the position state between the mask and reference code. By taking 1D motion as an example, the laser emitted by LD can be decomposed into sub-beams. Analyzing the propagation categories of sub-beams reveals that sub-beam types 1, 2, and 3 exist. Among these sub-beam types, 1 and 2 pass through the mask, whereas 3 does not pass through the mask, and its proportion to the overall beam remains unchanged. Among these parameters, z is the distance corresponding to the different x coordinates (shown in Fig. 1b) between the reference code and mask; d is the distance from the rotation center of the reference code to the mask. As shown in Fig. 1b, the movement of the scale grating in the positive X direction can be equivalent to the movement of the mask in the negative X direction. State S1 is the state wherein the reference code and mask do not overlap, and only the type 1 sub-beam is present. In the S2 state, a coincident light transmission unit exists between the reference code and mask. At this point, type 1 sub-beams transition to type 2 sub-beams, transmitting out. The energy detected by PD decreases when the light transmission units overlap. In the S3 state, the reference codes on the mask and grating completely overlap, with the number of type 2 sub-beams being the highest. At this time, the light intensity information detected by PD is the smallest and is, therefore, in a negative pulse state. The minimum pulse appears when and only when the reference code and mask completely overlap. At this time, the minimum negative pulse peak is defined as the pulse peak, and the minimum value is the pulse peak. The states of S4 and S5 are the same as those of S2 and S1, respectively. After using the sub-beams for the light beam describtion, this paper uses the diffraction propagation model to analysis the pulse, which is shown in Fig. 1c.

Combining the reference code with the grating is a suitable approach in the design of the integrated absolute six-degree-of-freedom (6-DOF) measurement scheme. The combination method is shown in Fig. 1a. The scale grating of the whole structure is a 1D grating with a period of 1 µm, and the reference code is directly written on the grating surface. The unit code width exceeds 10 µm, and the diameter of the light spot is generally 2 mm such that the light spot can cover numerous grating structures, and the description of the reference code does not have a considerable influence on the diffraction effect of the grating. The ratio of the light-transmitting unit should not be excessively large; otherwise, in the S3 state, a large amount of light is transmitted and effectively reflected and diffracted light will not form for subsequent 6-DOF detection. First, a sufficient interference signal intensity is necessary to achieve MDOF and absolute measurements simultaneously. Therefore, the transmittance ratio needs to be controlled to be less than 20% (when the binary code number is 150, the light transmission slit number is less than 30) to avoid reducing the resolution of the interference signal. Under the same code working condition and without considering diffraction, a small unit code width (b) leads to high positioning accuracy. However, under practical conditions, a small unit code width results in strong diffraction effects, increasing uncertainties during actual operation. Hence, an appropriate unit code width ensures positioning accuracy while enhancing system stability and resistance to interference. Second, six spatial posture changes occur between the zero position mask and reference code on the scale grating. In-plane motion has no effect on encoding performance, whereas out-of-plane motion alters the working distance (d). In theory, a large working range of d is preferred. Lastly, the rotation angles include those of yaw (α), pitch (γ), and roll (β). In Fig. 2, the yellow axis is the axis of rotation; the yellow dot is the origin, which is the spot energy center; and any other axes and origins are non-center rotations. Figure 2a presents the state when the mask tilts. Figure 2a shows the pose parameters that exist during tilting with sub-beam types 4, 5, and 6. α is the tilt angle and is positive in the counterclockwise direction. Figure 2a shows a negative angle. Figure 2b shows that the α rotation axis is the Y-axis direction. Figure 2c illustrates that the β rotation axis is the Z-axis direction. Figure 2d depicts that the γ rotation axis is the X-axis direction.

Changes in α are complex (as shown in Fig. 2a) and affect the reflection direction of the reference code, resulting in different propagation distances for the leftmost and rightmost light rays. For light rays from the zero position mask, the unit code width (b) of the reference code area decreases in size due to inclination, leading to the reduced transmission width of the area and length of the reference code. The intensity distribution of the light reflected back to the mask is no longer one-to-one, causing the slippage of the theoretically aligned position. Light passes through this mask by (1) being reflected back and received by the PD, (2) being reflected back by the reference code area and then blocked by the mask, and (3) passing through the reference code. When γ changes, the rotation axis in the center of the light spot has almost no effect on the pulse signal. However, if the axis revolves around others, then the change in γ can be seen as a change in the working distance (d). As for roll angle β, a rotation axis in the center of the light spot, the zero position mask gradually loses the transmission area, leading to the distortion of the pulse signal, the reduced intensity of the pulse peak, and even the loss of the pulse signal. Otherwise, the rotation can be converted into two in-plane motions. Given that β and γ have convertible effects on the pulse signal, this study mainly analyzes the influence of b, d, and α, which are collectively referred to as working parameters.

At different working parameters, the full width at half maximum (FWHM) of the pulse peak is denoted as W (in µm). The ratio of the secondary peak to the main pulse peak is the contrast K. In theory, a small W is desired to confine the zero position interval finely. A high K is preferred to locate the main pulse peak in the presence of disturbances clearly, thereby avoiding the incorrect selection of other pulse peak positions. In this study, K/W is used as a performance label to analyze and compare schemes. When the contrast K does not remarkably differ between two sets of codes and both sets of codes can distinctly and correctly determine the pulse peak position, the scheme with a small W is prioritized for absolute positioning, resulting in the high upper limits of positioning accuracy. The constraints on K and W represent the tolerance of the binary absolute zero position code to the working parameters, theoretically limiting its ultimate precision during practical operation. Therefore, thorough analysis and investigation are required to guide practical absolute measurement schemes.

2.2 Principle of Pulse Signals Considering Diffraction

The ZD-GAN network architecture can be built, and the original dataset must be generated. In this study, a random generation approach is adopted to create 150-bit codes with less than 20% transparent elements. The labels of these codes need to be analyzed and determined on the basis of diffraction propagation principles.

The structure of the mask and reference code can be represented by using the matrix a = [a₁, a₂,…, a_n], where n represents the number of code bits, a_i = 1 denotes a transparent element in the code, and a_i = 0 represents an opaque element. Each code bit has a unit code width of b. D is the total width of the code. On the basis of the optical path principle illustrated in Fig. 1a and under the assumption that α is 0 and the laser beam passes through the mask for the first time after the energy beam splitter, the optical field distribution (without considering diffraction effects) can be described as follows:

$$t(x) = \sum\limits_{i = 1}^{n} {a_{i} \times {\text{rect}}\left( {\frac{x - i \times b}{b}} \right)} ,\quad 0 \le x \le 3D,$$

(1)

where rect is the rectangular window function and x represents the horizontal coordinate of the grating in Fig. 1b.

In this study, the angular spectrum theory of light is used to simulate the diffraction effects of light rays. Therefore, the optical field distribution when the beam passes through the mask and reaches the scale grating can be described as follows:

$$I\left( {x,z} \right) = \left| {F^{ - 1} \left\{ {F\left\{ {t\left( x \right)} \right\} \cdot \exp \left( {{\text{i}}2\uppi z\sqrt {1/\lambda^{2} - v^{2} } } \right)} \right\}} \right|^{2} ,$$

(2)

where F{t(x)} represents the Fourier transform of the light intensity after passing through the mask, λ is the wavelength of the laser, v is the spatial frequency, and z is the distance between the mask and the reference code and varies with changes in the yaw angle.

When illuminated by the laser beam, the reference code region on the grating modulates and reflects back the optical field. This reflection characteristic function can be written as

$$f(x) = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {0 \le x < D} \hfill \\ {1 - t(x),} \hfill & {D \le x < 2D} \hfill \\ {1,} \hfill & {2D \le x < 3D} \hfill \\ \end{array} } \right..$$

(3)

When the reference code undergoes displacement along the length direction of the measurement grating, the intensity distribution of the reflected beam can be expressed as follows:

$$I_{f1} \left( {x_{0} ,z} \right) = I\left( {x,z} \right) \cdot f\left( {x + x_{0} } \right),\quad 0 \le x \le D,\quad 0 \le x_{0} \le 2D,$$

(4)

where x₀ represents the value of the relative displacement between the mask and the reference code.

Given a reference code area on the scale grating, the reflected light also shows diffraction effects. In this study, the diffraction effects are also analyzed by using angular spectrum theory. Thus, the optical field distribution illuminating the mask can be expressed as

$$I_{{f2}} (x_{0} ,z) = \left| {F^{ - 1} \left\{ {F\left\{ {I_{f1} (x_{0} ,z)} \right\} \cdot \exp \left( {{\text{i}}2\uppi z\sqrt {1/\lambda^{2} - v^{2} } } \right)} \right\}} \right|^{2} .$$

(5)

The final beam passes through the mask again and is then detected by the PD. The optical field is remodulated by the narrow slit of the mask, and its intensity distribution function can be expressed as

$$I_{m} (x_{0} ,z) = I_{f2} (x_{0} ,z) \cdot t(x).$$

(6)

2.3 Structure of GANs

The dataset is obtained by using professional simulation software (MATLAB and the diffraction propagation principles) and by constructing a GAN network structure to generate 150-bit codes. As shown in Fig. 3, the GAN model process begins with defining the generator, discriminator, and training functions. A diffraction propagation model is used to establish a dataset that includes several randomly generated codes as inputs. The input content consists of the zero position code and corresponding label (K/W).

The dataset is divided into training and testing sets. The generator’s role is to produce new codes that resemble those in training, whereas the discriminator attempts to differentiate between the generated and real training codes. The GAN combines the generator and discriminator as parameters, coordinating their training and sharing their parameters. During this process, the effectiveness of the discriminator is evaluated by using the binary CE loss functions, and the training functions are used for the GAN model through multiple rounds.

The GAN network used in this study has a simple network structure. Its logic is shown in Fig. 3. First, the generator and discriminator, as well as training, are defined. Second, the label and code provided by the dataset are entered into the generator and discriminator, respectively. The network structure of the generator wherein the label enters is fully connected, and the final output is a fake code (150 bits). The discriminator makes a distinction between the real and fake codes generated by the generator. The network structure adopts a series of full connection blocks (consisting of a fully connected layer and a nonlinear activation layer) and calculates the loss that mainly consists of discriminator and generator judgments on true or fake codes, as shown in Fig. 3. Given that the code has 150 bits, the discriminator learns faster than the generator. Therefore, the initial learning rates adopted by the generator and discriminator here are 0.005 and 0.0001, respectively. This approach can ensure that the two can learn against each other and avoid the situation, wherein generator loss keeps increasing, and discriminator loss is almost 0. The finally trained generator receives a set of labels with high values and generates a set of new codes that are then input into the diffraction propagation model to obtain their corresponding real labels. The model calculates K and W and applies non-maximum suppression to select the optimal set of codes.

At this point, the waveform data of the pulse signal can be obtained, and K and W can be analyzed. The standard working parameters are set as d = 500 μm, b = 10 μm, and α = 0 μrad. The GAN model is completed with 16000 sets of training data and 4000 sets of test data, and binary codes are generated by ZD-GAN. For each generation, 10 binary codes are obtained, and the diffraction propagation model performs non-maximum suppression. This process ultimately produces absolute binary code schemes with improved robustness and accuracy. In the original training dataset, the best label is K (1.7920)/W (17.8839) = 0.1002, whereas the best-generated data have K (8.1156)/W (35.3961) = 0.2293 and K (6.5739)/W (28.5469) = 0.2303. This result represents a maximum improvement of 129.84%, demonstrating the effectiveness of the proposed ZD-GAN.

3 Code Performance Simulation and Theoretical Analysis

The use of ZD-GAN provides two absolute codes: A (16 transmissive units) with a contrast of 8.12 and B (25 transmissive units) with a contrast of 6.57 (Table 1).

Table 1 Two codes generated by ZD-GAN

Full size table

Figure 4a shows the shape distribution of codes A and B (colored regions represent transmissive elements) and their performance with varying b values. Figure 4b, c shows the coordinated effects of d and α on the positioning performance of code A, whereas Fig. 4e, f shows the same effects but on the positioning performance of code B. Figure 4d presents the optimal positions of K/W given by the global optimization algorithm for code B.

Figure 4a demonstrates that when b is less than 10 μm, codes A and B exhibit contrasts close to 1 and a sharp change in the FWHM (W). An observation of the waveforms reveals that the main reason for the sudden increase in W is the secondary pulse peak closely following the main one, resulting in the calculation of W to include both pulses and not only the half-width of the main pulse peak. The considerable change in W indicates a clear decline in the performance of the pulse signal. This decline can considerably affect the search for the main pulse peak given experimental noise and thereby indicates the deterioration of pulse signal performance. For unit code widths beyond 10 μm, only code B shows a bimodal phenomenon, leading to a decrease in contrast to approximately 1. However, W does not change considerably, indicating that the bimodal phenomenon occurs only within a small interval near the valley bottom (i.e., pulse peak) and has no remarkable effect on fitting and positioning performance. Additionally, in general, code B is preferred, given its smaller W than code A.

Figure 4b, e indicates that code A has a larger range than code B for contrasts greater than 1. For both codes, the area of contrast greater than 8 is similar. For both codes, the tolerance to changes in the negative angle rotation for distance is also higher than that to changes in the positive angle rotation. However, Fig. 4c, f shows that the area of large W in code B is smaller and is less affected by negative rotation angles for distances less than 1000 μm than that in code A. Therefore, considering the tolerance to working distance and rotation angles, the overall performance of code B is superior to that of code A.

After the analysis of the three parameters, code B is finally selected as the experimental subject, and the positioning performance is evaluated. The result of the global optimization algorithm for code B is shown in Fig. 4d. The main global optimization algorithm here aims to provide an optimal working position with increased accuracy. The main parameters are limited to the working distance from 500 µm to 1500 μm and α from 1 × 10⁻³ μrad to 40 mμrad; by using a Bayesian optimizer, the maximum number of calculations and time are 200 times and 60000 s, respectively, and the acquisition function of the next point shows the expected improvement. Given that the global optimization algorithm is looking for the minimum value, this work adds a negative sign to K. The objective of optimization is to maximize the label (contrast) with b fixed at 10 μm and the minimum working distance d at 500 μm. The final results are d = 500.23 μm and α = − 39.86 mrad, which fall within the optimal range obtained from traditional theories. The proposed algorithm can serve as a guide for optimal orientation with increased accuracy.

In summary, based on the analysis in Fig. 4, the findings suggest avoiding sharp changes in W by designing zero position code with a width b greater than 10 μm and within the working distance d and yaw angle α of 1000 μm and 20.9 mrad, respectively. Additionally, a working position with a small W as the central parameter is suggested for practical purposes.

4 Zero Position Code Experiment and Result Analysis

In accordance with the principles illustrated in Fig. 1, this study constructed the zero positioning experimental system shown in Fig. 5a, b. Additionally, the mask and reference of code B, as depicted in Fig. 5c, were designed and fabricated. The precision of both components was measured and verified by using a high-magnification microscope, as shown in Fig. 5d, and was found to meet the error requirement of less than 1 μm.

The displacement device used was Z825B of THORLABS, which has a repeat positioning accuracy of 0.2 μm and a resolution of 0.05 μm. The rotation device TD-170-50 has a rotation accuracy of 145.4 μrad and a repeat accuracy of 12.1 μrad. The PD model was the Hamamatsu S5106. The principle of light intensity detection is as follows: A Si PIN photodiode array converts optical signals into electrical signals through the absorption of light and generation of electron–hole pairs. Given that the intensity of this electrical signal is proportional to that of the incident light, it can be used to measure the intensity of light or the change in the intensity of light. This study used an amplifier to amplify signals and then obtained the corresponding waveform signals.

As shown in Figs. 1a and 5b, the beam starts from the light source and enters the polarizing beam splitter and quarter wave plate. Part of the beam is reflected by the mask, enters the PD, passes through the mask, and irradiates the reference code. Subsequently, as shown in Fig. 1b, c, with the movement of the reference code, the energy of the light beam reflected by the reference code changes continuously, thereby causing the energy change in the PD and finally forming a pulse peak.

During the experiment, the zero position of the rotation served as the zero angle reference, and the distance d and yaw angle α were varied. The yaw angle changed by 3490 μrad in each step, and the waveform and positioning performance of the pulse peaks were recorded.

The spot energy used in the experiment is a laser that follows a Gaussian distribution and exhibits good temporal stability. The light is initially projected onto the mask, and with the reference code movement, the PD starts generating signals that are received by the computer. Figure 6 shows the comparison between the simulation and experimental waveforms when d = 880 μm. The excellent matching confirms the reliability of the simulation.

In the experiments, pulse energy was normalized in consideration of a total reflectance of 6% from the reference code (chromium plate) to the PD. Thus, different waveform graphs were produced at distances of 1780, 1530, and 1280 μm. The reference code was rotated in steps of 3490 μrad around the zero position within a range of ± 10.47 mrad, as shown in Fig. 7.

The analysis reveals that the positive angle rotation causes distortion in the pulse signal, resulting in a decrease in contrast and an increase in the FWHM (W). In turn, this phenomenon reduces the positioning performance. This finding is consistent with the simulation results. Conversely, negative angle rotation enhances the pulse signal but also induces changes in the zero position. Thus, actual calculations require adjustments based on angle measurements (Fig. 7).

The decrease in distance enhances the pulse signal, indicating that reducing the distance indeed weakens the diffraction effect. This finding is aligned with the simulation results. However, an excessive reduction in distance may lead to collisions between the reference code and mask during rotation, making a small d for the experiment unfeasible. In actual practical use, the length of the structure (the grating or the mask) can be shortened such that the reference code can have a large rotation angle.

If the given angle varies, the contrast K and W will change, as shown in Fig. 8a. Although generally similar to the simulation values, the experimental contrast K does not maintain a constant level and then decrease; rather, K starts to decrease early with the change in angle, indicating that in practical use, the pulse waveform is highly sensitive to the rotation angle due to factors, such as the energy distribution of the light spot and electric noise. Similarly, the experimental W increases with angle as in the simulation but at a fast rate. The main reason for this deviation is the design of code B. When the amplitude of the main pulse decreases, the immediate occurrence of the secondary peak causes the calculation of W to include both pulses rather than only the half-width of the main pulse peak. As a result, W increases dramatically in the experiment, appearing as early as 3490 μrad. Therefore, in practical applications, large rotations after this angle must be avoided.

Figure 8b shows the final evaluation of the positioning performance at 880 and 1780 μm with 0 degrees of rotation. The absolute sum of the positioning variation during the forward and reverse movements of the displacement stage is at the sub-micrometer level, demonstrating that the generated code can achieve precise positioning. The sub-micrometer positioning accuracy provides support for future high-precision hybrid-positioning applications.

5 Summary

This study addresses the limitations in the design and performance of zero position codes for precision absolute measurement. A novel approach using deep learning, namely ZD-GAN, is introduced to generate a high-quality 150-bit zero position code. Various working parameters affecting the practical performance of the zero position code, including unit code width (b), working distance (d), and yaw angle (α), are analyzed through simulations and experiments. The results demonstrate that the codes generated by ZD-GAN are effective and outperform those generated by the training datasets. The analysis of two typical codes shows that the contrast (K) is generally positively correlated with b and negatively correlated with d and α. The FWHM (W) mainly has positive correlations with d and b, and its relationship with α increases in complexity at large distances. The findings suggest that for code design, b must be greater than 10 μm, and the values of d and α should ideally be within the ranges of 1000 μm and 20.94 mrad, respectively, to avoid abrupt changes in W. However, in consideration of noise in practice, the angular range is limited to below 3490 μrad, and the selection of parameters with a small W as the central working parameter is advised. The other corresponding rule model requirements, such as a long work distance and large yaw angle, can be quickly established with the help of ZD-GAN, and finally, the new code can be verified, analyzed, and optimized with the help of the diffraction propagation model. The proposed code design and parameter optimization provide valuable guidance for the development and design of absolute positioning measurements.

Availability of Data and Materials

The authors declare that all data will be made available on reasonable request.

References

Xue XH, Dong YG, Wu X (2020) Motion induced eddy current sensor for non-intrusive vibration measurement. IEEE Sens J 20:735–744. https://doi.org/10.1109/Jsen.2019.2943931
Article ADS CAS Google Scholar
Ma T, Han Y, Xu Y, Dai P, Shen H, Liu Y (2023) Wide temperature range and low temperature drift eddy current displacement sensor using digital correlation demodulation. Sensors (Basel). https://doi.org/10.3390/s23104895
Article PubMed PubMed Central Google Scholar
Wang SC, Xie BR, Huang SM (2022) Design and analysis of small size eddy current displacement sensor. Sensors (Basel). https://doi.org/10.3390/s22197444
Article PubMed PubMed Central Google Scholar
Mizuno T, Enoki S, Hayashi T, Asahina T, Shinagawa H (2007) Extending the linearity range of eddy-current displacement sensor with magnetoplated wire. IEEE Trans Magn 43:543–548. https://doi.org/10.1109/Tmag.2006.887853
Article ADS Google Scholar
Sun SZ, Lv Z, Han Y, He ZY, Zhang JM (2022) A novel inductive angular displacement sensor based on time-grating. Meas Sci Technol. https://doi.org/10.1088/1361-6501/ac4c68
Article Google Scholar
Li X, Wang R, Du H, Lu Y (2022) Three-dimensional micro-displacement measurement method based on capacitance-grating sensor. Measurement. https://doi.org/10.1016/j.measurement.2021.110179
Article Google Scholar
Nabavi MR, Nihtianov SN (2012) Design strategies for Eddy-current displacement sensor systems: review and recommendations. IEEE Sens J 12:3346–3355. https://doi.org/10.1109/Jsen.2012.2204321
Article ADS Google Scholar
Ortlepp I, Zöllner J-P, Rangelow IW, Manske E (2021) Heterodyne standing-wave interferometer with improved phase stability. Nanomanuf Metrol 4:190–199. https://doi.org/10.1007/s41871-021-00098-3
Article Google Scholar
Matysik S, Orso E, Black A, Ahrens N, Schmitz G (2011) Monitoring of 7alpha-hydroxy-4-cholesten-3-one during therapy of cerebrotendinous xanthomatosis: a case report. Chem Phys Lipids 164:530–534. https://doi.org/10.1016/j.chemphyslip.2011.05.001
Article PubMed CAS Google Scholar
Zhu JH, Wang GC, Wang ST, Li XH (2022) A reflective-type heterodyne grating interferometer for three-degree-of-freedom subnanometer measurement. IEEE Trans Instrum Meas 71:1–9. https://doi.org/10.1109/TIM.2022.3213005
Article Google Scholar
Deng X, Tan W, Tang Z, Lin Z, Cheng X, Li T (2022) Scanning and splicing atom lithography for self-traceable nanograting fabrication. Nanomanuf Metrol 5:179–187. https://doi.org/10.1007/s41871-022-00140-y
Article CAS Google Scholar
Wang S, Liao B, Shi N, Li X (2023) A compact and high-precision three-degree-of-freedom grating encoder based on a quadrangular frustum pyramid prism. Sensors (Basel). https://doi.org/10.3390/s23084022
Article PubMed PubMed Central Google Scholar
Li XH, Gao W, Muto HS, Shimizu Y, Ito S, Dian S (2013) A six-degree-of-freedom surface encoder for precision positioning of a planar motion stage. Precis Eng J Int Soc Precis Eng Nanotechnol 37:771–781. https://doi.org/10.1016/j.precisioneng.2013.03.005
Article Google Scholar
Yu KN, Zhu JH, Yuan WH, Zhou Q, Xue GP, Wu GH, Wang XH, Li XH (2021) Two-channel six degrees of freedom grating-encoder for precision-positioning of sub-components in synthetic-aperture optics. Opt Express 29:21113–21128. https://doi.org/10.1364/OE.427307
Article PubMed ADS Google Scholar
Wang S, Luo L, Zhu J, Shi N, Li X (2022) An ultra-precision absolute-type multi-degree-of-freedom grating encoder. Sensors (Basel). https://doi.org/10.3390/s22239047
Article PubMed PubMed Central Google Scholar
Hu PC, Chang D, Tan JB, Yang RT, Yang HX, Fu HJ (2019) Displacement measuring grating interferometer: a review. Front Inform Tech El 20:631–654. https://doi.org/10.1631/Fitee.1800708
Article Google Scholar
Castenmiller T, van de Mast F, de Kort T, van de Vin C, de Wit M, Stegen R, van Cleef S (2010) Towards ultimate optical lithography with NXT:1950i dual stage immersion platform. Proc SPIE Adv Lithogr. https://doi.org/10.1117/12.847025
Article Google Scholar
Li X, Wang H, Ni K, Zhou Q, Mao X, Zeng L, Wang X, Xiao X (2016) Two-probe optical encoder for absolute positioning of precision stages by using an improved scale grating. Opt Express 24:21378–21391. https://doi.org/10.1364/OE.24.021378
Article PubMed ADS CAS Google Scholar
Shi YP, Zhou Q, Li XH, Ni K, Wang XH (2020) Design and testing of a linear encoder capable of measuring absolute distance. Sens Actuators A Phys 308:111935. https://doi.org/10.1016/j.sna.2020.111935
Article CAS Google Scholar
Shi YP, Ni K, Li XH, Zhou Q, Wang XH (2019) Highly accurate, absolute optical encoder using a hybrid-positioning method. Opt Lett. https://doi.org/10.1364/OL.44.005258
Article PubMed Google Scholar
Yang XY, Yin CY (1986) A new method for the design of zero reference marks for grating measurement systems. J Phys E Sci Instrum 19:34–37. https://doi.org/10.1088/0022-3735/19/1/004
Article Google Scholar
Li YJ (1990) Design of zero reference marks for grating measurement systems—a new method. Meas Sci Technol 1:848–851. https://doi.org/10.1088/0957-0233/1/9/002
Article ADS Google Scholar
Saez-Landete J, Alonso J, Bernabeu E (2005) Design of two-dimensional zero reference codes by means of a global optimization method. Opt Express 13:4230–4236. https://doi.org/10.1364/opex.13.004230
Article PubMed ADS Google Scholar
Li H, Zhou C, Wang S, Lu Y, Xiang X (2018) Two-dimensional gold matrix method for encoding two-dimensional optical arbitrary positions. Opt Express. https://doi.org/10.1364/oe.26.012742
Article PubMed PubMed Central Google Scholar
Saez-Landete J, Salcedo-Sanz S, Rosa Zurera M, Alonso J, Bernabeu E (2006) Design of two-dimensional zero reference codes with a genetic algorithm. Opt Lett 31:1648–1650. https://doi.org/10.1364/ol.31.001648
Article PubMed ADS Google Scholar
Saez-Landete J, Alonso J, Bernabeu E (2005) Design of zero reference codes by means of a global optimization method. Opt Express 13:195–201. https://doi.org/10.1364/opex.13.000195
Article PubMed ADS Google Scholar
Saez-Landete J, Salcedo-Sanz S, Rosa-Zurera M, Alonso J, Bernabeu E (2005) Optimal design of optical reference signals by use of a genetic algorithm. Opt Lett 30:2724–2726. https://doi.org/10.1364/ol.30.002724
Article PubMed ADS Google Scholar
Saez-Landete J, Salcedo-Sanz S, Rosa-Zurera M, Alonso J, Bernabeu E (2007) Generation of optical reference signals robust to diffractive effects. IEEE Photonics Technol Lett 19:1133–1135. https://doi.org/10.1109/Lpt.2007.900334
Article ADS Google Scholar
Saez-Landete J, Salcedo-Sanz S, Cruz-Roldan F, Amo-Lopez P, Blanco-Velasco M (2008) Design of two-dimensional optical alignment signals robust to diffractive effects. J Lightwave Technol 26:1702–1707. https://doi.org/10.1109/Jlt.2008.919428
Article ADS Google Scholar
Chen JC, Wen CK (2010) Design of two-dimensional zero reference codes with cross-entropy method. Appl Opt 49:3560–3565. https://doi.org/10.1364/AO.49.003560
Article PubMed ADS Google Scholar
Su YS, Wu TC, Wang CH, Chang MK (2011) Generation of two-dimensional optical reference signals based on parametric minimum cross entropy. IEEE Photonics Technol Lett 23:813–815. https://doi.org/10.1109/lpt.2011.2140368
Article ADS Google Scholar
Su YS，Wu TC, Wang CH, Chang MK (2012) Robust design of two-dimensional optical reference signals against diffraction effects. J Lightwave Technol 30:2168–2175. https://doi.org/10.1109/Jlt.2012.2193663
Article ADS Google Scholar
Woillez M, Blervacque A, Constantinides G (1966) Meningioma of the small wing of sphenoid bone with atypical manifestation. Bull Soc Ophtalmol Fr 66:603–605. https://doi.org/10.1109/5289.685492
Article PubMed CAS Google Scholar
Dobrowiecki TP, Louage F, Meszaros TC, Roman G, Pataki B (1997) Will measuring instruments turn into agents? IEEE Trans Instrum Meas 46:991–995. https://doi.org/10.1109/19.650814
Article ADS Google Scholar
Wieczorowski M, Kucharski D, Sniatala P, Pawlus P, Krolczyk G, Gapinski B (2023) A novel approach to using artificial intelligence in coordinate metrology including nano scale. Measurement. https://doi.org/10.1016/j.measurement.2023.113051
Article Google Scholar
Shi J, Li Y, Tao Z, Zhang D, Xing H, Tan J (2022) High-precision autocollimation method based on a multiscale convolution neural network for angle measurement. Opt Express 30:29821–29832. https://doi.org/10.1364/OE.467878
Article PubMed ADS Google Scholar
Sheppard CJR (2021) The development of microscopy for super-resolution: confocal microscopy, and image scanning microscopy. Appl Sci Basel. https://doi.org/10.3390/app11198981
Article Google Scholar
Jeon W, Jeong W, Son K, Yang H (2018) Speckle noise reduction for digital holographic images using multi-scale convolutional neural networks. Opt Lett 43:4240–4243. https://doi.org/10.1364/OL.43.004240
Article PubMed ADS Google Scholar
Aguenounon E, Smith JT, Al-Taher M, Diana M, Intes X, Gioux S (2020) Real-time, wide-field and high-quality single snapshot imaging of optical properties with profile correction using deep learning. Biomed Opt Express 11:5701–5716. https://doi.org/10.1364/BOE.397681
Article PubMed PubMed Central Google Scholar
Yin Z, Qin R, Du H, Zhou W, Sun J, Sun D, Liu Y (2023) Design and parameter identification for a positioning platform with a large stroke and high precision for segmented mirrors. Micromachines (Basel). https://doi.org/10.3390/mi14040713
Article PubMed Google Scholar
Wang Q, Lyu W, Zhou J, Yu C (2023) Sleep condition detection and assessment with optical fiber interferometer based on machine learning. iScience 26:107244. https://doi.org/10.1016/j.isci.2023.107244
Article PubMed PubMed Central ADS CAS Google Scholar

Download references

Funding

This work was supported by the National Natural Science Foundation of China with No.62275142 and the Basic and Applied Basic Research Foundation of Guangdong Province with No.2021B1515120007.

Author information

Authors and Affiliations

Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China
Shengtong Wang, Linbin Luo & Xinghui Li
Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, Shenzhen, 518055, China
Xinghui Li

Authors

Shengtong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Linbin Luo
View author publications
You can also search for this author in PubMed Google Scholar
Xinghui Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization was done by SW and XL; methodology was done by SW and XL; software was done by SW and XL; validation was done by SW, LL, and XL; formal analysis was done by SW, LL, and XL; investigation was done by SW, LL, and XL; resources were done by XL; data curation was done by SW and LL; writing—original draft preparation was done by SW, LL, and XL; writing—review and editing was done by SW, LL, and XL; visualization was done by SW; supervision was done by XL; project administration was done by XL; funding acquisition was done by XL. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Xinghui Li.

Ethics declarations

Competing interests

The authors declare that we have no competing interest in this paper.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, S., Luo, L. & Li, X. Design and Parameter Optimization of Zero Position Code Considering Diffraction Based on Deep Learning Generative Adversarial Networks. Nanomanuf Metrol 7, 2 (2024). https://doi.org/10.1007/s41871-023-00221-6

Download citation

Received: 12 August 2023
Revised: 06 November 2023
Accepted: 08 November 2023
Published: 05 February 2024
DOI: https://doi.org/10.1007/s41871-023-00221-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Design and Parameter Optimization of Zero Position Code Considering Diffraction Based on Deep Learning Generative Adversarial Networks

Abstract

Highlights

Similar content being viewed by others

Read Pointer Meters Based on a Human-Like Alignment and Recognition Algorithm

Non-invasive calibration of a Stewart platform by photogrammetry

A Study of Positioning Error Compensation Using Optical-Sensor and Three-Frame

1 Introduction

2 Principle of the Zero Position Code Design

2.1 Zero Positioning Code and Parameter Introduction