Abstract
Zernike expansion is an important tool for aberration retrieval in the optical field. The Zernike coefficients in the expansion can be solved in a linear system from those focal region intensity images, which can be modeled by the extended Nijboer–Zernike approach. Here we point out that those coefficients usually follow from different prior distributions, and especially, their variances could be dramatically diverse. To incorporate the prior information, we further introduce customized penalties to those Zernike coefficients and adopt a global adaptive generative adjustment algorithm for estimating coefficients. Based on both simulated and real data, numerical experiments show that our method outperforms other conventional methods, and provides an estimate of Zernike coefficients with a low mean square error.
Similar content being viewed by others
Introduction
The aberration retrieval (AR) from the intensity point-spread function in the focal region is widely used in the optical field. It usually adopts the Zernike expansion to represent the aberration linearly. The phase retrieval1,2, phase diversity3,4 and curvature sensing5,6 are three classical methods for the AR. They solve inverse problems based on the optical mechanism and statistical parameter estimation. The work7 considered an extended Nijboer–Zernike (ENZ) diffraction from an analytic description of the focal field and realizable solutions for the aberration coefficients7,8,9. A further work10 suggested an ENZ AR method for identifying the imperfection of lens from the intensity point spread function (PSF) of the optical system. This ENZ AR method can be further applied to the high-resolution optical lithography11. The general pupil function can be represented by a linear function of Zernike coefficients in the Zernike expansion. Through further diffraction integrals, the light field on the focal plane also has an expansion:
in which the pair \((r,\varphi )\) are polar coordinates on the image plane, the parameter f is the camera distance from the focus plane, \(\{V_n^m(r,f)\}_{n,m}\) are contants relying on the optical system given r and f, and coefficients \(\{\beta _n^m\}\) are extended Nijboer–Zernike coefficients.
In theory, the expansion considers infinite terms, but in practice, only the first Q terms are retained for the aberration retrieval. For example, Q \(=\) 91 was chosen to estimate those Zernike coefficients in the work12. Moreover, it illustrates that those coefficients could follow some prior distribution and introduce the penalty mechanism into the aberration retrieval in the optical field. Here we further study the prior distribution of the Zernike coefficients by decomposing the simulated pupil function. As discussed in the work12, atmospheric wavefronts can be simulated using the method13 to generate 1000 random phases. Furthermore, 1000 generalized pupil functions can be generated with those phases at constant amplitude. We consider the first 91 terms in the Zernike expansion of the generalized pupil function. Since the first term has a real coefficient, there are 181 real coefficients by separating the real and imaginary parts. We set the first coefficient to be one, and compute the other 180 coefficients. By decomposing 1000 simulated pupil functions, we obtain 1000 observations for each coefficient. The box plot on Zernike coefficients is shown in Fig. 1. This box plot illustrates that those coefficients follow from different prior distributions. Moreover, their standard deviation could be dramatically diverse as shown in Fig. 2. Inspired by these findings, we assume that Zernike coefficient \(\beta _{n}^{m}\) follows a customized prior distribution, which is Gaussian with mean zero and variance \(\sigma _{n,m}^2\). The prior mentioned in our paper means a knowledge that the prior distributions for various model parameters could have diverse variances. Since we consider the Gaussian prior with zero mean, the prior distribution is determined only by its variance. The “customized priors” mean the customized variances for model parameters.
For incorporating the prior information, we further introduce customized penalties for those Zernike coefficients and adopt a global adaptive generative adjustment algorithm for estimating coefficients. The experiment results on the simulated and real data illustrate that our method, utilizing customized prior variances, provides an estimate with a lower mean squared error compared to other ENZ aberration retrieval methods.
Methods
Extended Nijboer–Zernike diffraction
In this study, we assume that the optical system is monochromatic and its aperture is circular and unobstructed. The generalized pupil function is usually expressed as14
where \(A(\rho ,\theta )\) and \(\phi (\rho ,\theta )\) are the amplitude and phase of the pupil plane, respectively. \(\mathbf{i }=\sqrt{-1}\) and \((\rho ,\theta )\) are polar coordinates, \(\rho \in [0,1]\), \(\theta \in [0,2\pi ]\). The phase can be linearly expressed by Frits Zernike expansion14
where \(\{Z_k(\rho ,\theta )\}_{k=0}^{K-1}\) are Frits Zernike basis functions, and \(\{\alpha \}_{k=0}^{K-1}\) are the coefficients of the Zernike basis. From the work15, the generalized pupil function can be decomposed by using Zernike coefficients \(\{\beta _n^m\}\) as
where m and n are integers such that \(n \ge 0\) and \(n-|m|\) is even. And \(R_n^{|m|}(\rho )\) is the radial polynomial. The first Zernike coefficient \(\beta _0^0\) is real, and the others \(\{\beta _n^m\}\) are complex for \(m \ne 0, n \ne 0\). In the simulation, we generated random phases \(\{\alpha \}_{k=0}^{K-1}\) using the method13. Thus we also obtained the phase function \(\phi (\rho ,\theta )\) by (3). Furthermore, we computed the generalized pupil function by (2) at constant amplitude. Finally, we consider the first 91 terms in the Zernike expansion (4) of the generalized pupil function.We denote all the Zernike coefficients by a coefficient vector \(\varvec{\beta }\). Thus the pupil function is a linear function on the vector \(\varvec{\beta }\), which is to be estimated as aberration retrieval.
From works11,15, the radial polynomials in (4) can be expressed as
Hence \(R_n^{|m|}(\rho )\) can be constructed before aberration retrieval for a specific choice of m, n, and \(\rho\).
In the work8, the light field on the focal plane is expressed as
where the parameter f is the camera distance from the focus plane, and \((r,\varphi )\) are polar coordinates on the image plane. From the work7, the Bessel series has the form
where \(J_m\) is a Bessel function of the first kind with order m, and
where \(q=\frac{n+|m|}{2}\), \(p=\frac{n-|m|}{2}\), and \(l=1,2,\ldots ; j=0,\ldots ,p\).
Aberration retrieval model using the ENZ approach
Using the formula (1), the PSF intensity in the focal region can be expressed as
where
is the sum of the remaining second order cross terms. In the summand, symbol \('\) means the omission of \(n=0\) terms in the summand, and symbol \(''\) means the omission of \(n_1=m_1=0\) or \(n_2=m_2=0\) terms. Re() and Im() denote the real and imaginary parts of a complex number. Symbol \(^*\) denote the complex conjugate.
The PSF intensity with an additive detector readout noise \(\varepsilon\) forms an aberration retrieval model:
where the noise \(\varepsilon (r,\varphi ;f) \sim N(0,\sigma ^2)\). The intensity \(I_{b}(r,\varphi ;f)\) can be collected to estimate coefficients \(\{\beta _n^m\}\) using (8), as shown in Fig. 3.
This aberration retrieval process, previously proposed in works10,16,17,18,19, has four main steps:
- (1)
Input the collected PSFs \(I_b\). Set the maximum iteration step K, \(I^{(0)}=I_b\), \(C^{(0)}=0\) and \(k=0\).
- (2)
Assumes that \(I^{(k)}\) can be described as linear combinations of the entrance pupil aberrations with coefficients \(\{\beta _n^m\}\). This is equivalent to omitting the cross terms of (8). And then compute \(\{\beta _n^m\}^{(k)}\).
- (3)
Calculate \(C^{(k+1)}\) using (9) by \(\{\beta _n^m\}^{(k)}\).
- (4)
Set \(I^{(k+1)} = I_b - C^{(k+1)}\), \(k=k+1\). If \(k > K\), exit. Otherwise, go to step (2).
Notice that due to phase wrapping effects occurring in the reconstructed pupil distribution, this retrieval process may fail in case that the aberration magnitude is beyond some large range. So we only considered a small wavefront error \((pv<2\pi )\) in our experiment. From works10,12,16,17, we know that the intensity \(I^{(k)}\) in Step (2) can be linearly transformed into a vector \({\mathbf {L}}^{(k)}\). We further get a classical linear model on the coefficient vector \(\varvec{\beta }\):
where \(\varvec{\delta }\) is a Gaussian random noise with zero mean. The specific forms of \({\mathbf {M}}\), \(\varvec{\beta }\), \(\varvec{\delta }\), \({\mathbf {L}}^{(k)}\) can be referred to the work12.
Usually, the least square estimate:
is used to deal with this retrieval process10,16,17. Recently, considering the potential prior information, the work12 further introduces the penalty mechanism
into the retrieval process, and propose a penalized ENZ AR algorithm.
Global adaptive generative adjustment for estimating coefficients
As shown in the “Introduction” section, those Zernike coefficients follow from different prior distributions. Especially, their standard deviation could be dramatically diverse. So we assume that coefficient \(\beta _j\) has a prior \(N(0,\tau _j^2)\) and the standard deviation \(\tau _j\) could be diverse. From the classical linear model, the observed response \({\mathbf {y}}\) is generated by a linear system \({\mathbf {X}}\varvec{\beta }+\varvec{\epsilon }\), where \(\varvec{\beta }=(\beta _1,\ldots ,\beta _p)^T\) can be viewed as the true signal, the Gaussian noise \(\varvec{\epsilon }\sim N({\mathbf {0}},\sigma ^2{\mathbf {I}})\) and \({\mathbf {I}}\) is an identity matrix.
Considering the posterior distribution of coefficients, we further obtain an objective function
with multiple hyperparameters \(\{b_j\}\). These hyperparameters can be viewed as the prior information of model parameters \(\{\beta _j\}\). In (14), n is the sample size, p is the dimension of the vector \(\varvec{\beta }\), and the hyperparameter \(b_j\), \(j=1,\ldots ,p,\) provides a customized shrinkage on the coefficient \(\beta _j\).
We adopt a global adaptive generative adjustment (GAGA) algorithm to recover a true signal \(\varvec{\beta }\). In Algorithm 1, hyperparameters and the signal are alternatively updated by a data-driven method. The inputs of this algorithm are the response vector \({\mathbf {y}}\), the design matrix \({\mathbf {X}}\), the iteration number K. The output of this algorithm is the signal estimate \(\hat{\varvec{\beta }}=\text {GAGA}({\mathbf {y}},{\mathbf {X}},K)\). The convergency analysis of Algorithm 1 and the large sample properties of its output is discussed in the work20.
The following Algorithm 2 combines the ENZ AR process and the GAGA algorithm, which utilizes the customized prior information and updates model parameters and hyperparameters alternatively.
Results
We suggest a method adopting a global adaptive generative adjustment (GAGA) algorithm for estimating the ENZ coefficients. We call this method the GAGA ENZ AR. In a previos work12, the least absolute shrinkage and selection operator (Lasso) algorithm can also be applied to the aberration retrieval. So we compared the GAGA ENZ to the Lasso ENZ AR and the conventional ENZ AR17 in the simulation with synthesized data. The characteristics of the optical system are shown in Table 1.
The simulations were implemented in three steps:
- (1)
We simulated three PSFs (images intra, in, and extra focus) from (8) with the first Q leading terms \(\{\beta _n^m\}\), where \(\beta _0^0 = 1\). In this paper, we set \(Q=91\) in the AR process. We further added Gaussian white noise to PSFs and simulated four noise levels (40 dB, 35 dB, 30 dB, 25 dB) measured using Signal-Noise Ratio (SNR).
- (2)
Using the simulated images with noise, we estimated \(\{\beta _n^m\}\) by ENZ AR, Lasso ENZ AR and GAGA ENZ AR separately.
- (3)
We divided the estimates \(\{{\hat{\beta }}_n^m\}\) by \({\hat{\beta }}_0^0\) to ensure that the first component of \(\{{\hat{\beta }}_n^m\}\) is 1, and then assessed the experimental results on the residual square error \({\Vert \hat{\varvec{\beta }}-\varvec{\beta }\Vert _2^2}\), where \(\varvec{\beta }\) was the true parameter vector, and \(\hat{\varvec{\beta }}\) was obtained by ENZ AR, Lasso ENZ AR and GAGA ENZ AR separately.
Different noise levels were chosen, and the above procedures were executed hundreds of time for each noise level. The empirical MSE of the true parameter \(\varvec{\beta }\) were calculated by (15):
where i means the ith test. N is the total number of test. The standard deviation of MSE can be further computed by:
Random aberration examples
We used the method13 to generate 100 random phases \((pv<2\pi , rms=0.2\pi )\) by the first 21 terms of Frits Zernike expansion. Some of them are shown in Fig. 4. These phases at constant amplitude further generated 100 generalized pupils, which can be expressed by (4) for \(Q=91\) in ENZ. Then we get 100 groups of \(\{\beta _n^m\}\) as the true parameters in experiments. We compared the empirical MSE (15) for the output of ENZ AR, Lasso ENZ AR and GAGA ENZ AR. In Fig. 5 we show the error bar of the empirical MSE at each given SNR, where the length of the bar is two times the standard deviation STD (16). Our method GAGA ENZ AR produced superior parameter estimates at each noise level compared to ENZ AR and Lasso ENZ AR. Though our algorithm works well for a small wavefront error \((pv<2\pi )\), it may fail to handle the AR for strong wavefront aberrations in the atmospheric disturbances.
Real data example
The phase shown in Fig. 6a was observed from the measurement of non-common path aberrations from a 1.23 m adaptive optics telescope in Changchun China. More detailed descriptions on this optics telescope can be found in these works21,22,23. The field with constant amplitude and phase (Fig. 6a, \(rms = 0.14\pi\)) has NZ coefficients shown in Fig. 6b,c.
We repeated the simulation one hundred times and showed the error bar of the MSE in Fig. 7.
When the noise SNR is 40, the conventional ENZ AR performed marginally than LASSO ENZ AR. However, GAGA ENZ AR outperformed ENZ AR and LASSO ENZ AR at each noise level. Moreover, GAGA ENZ AR also kept the empirical MSE in a low value even at a high noise level.
Conclusion
We find that to represent an optical field, Zernike coefficients have their customized priors. For incorporating the customized prior information, we adopt a global adaptive generative adjustment method for estimating coefficients in the ENZ aberration retrieval. In the simulated and real data experiments, our algorithm provides better performance on the MSE compared to ENZ AR and LASSO ENZ AR.
References
Fienup, J. R. Phase retrieval algorithms: a comparison. Appl. Opt. 21, 2758–2769 (1982).
Lyon, R. G., Miller, P. E. & Grusczak, A. HST phase retrieval: a parameter estimation. Proc. SPIE 1567, 317–326 (1991).
Paxman, R. G. & Fienup, J. R. Misalignment sensing and images reconstruction using phase diversity. J. Opt. Soc. Am. A 5, 914–923 (1988).
Blanc, A., Fusco, T., Hartung, M., Mugnier, L. M. & Rousset, G. Calibration of NAOS and CONICA static aberrations. Application of the phase diversity technique. Astron. Astrophys. 399, 373–383 (2003).
Roddier, F. Curvature sensing and compensation: a new concept in adaptive optics. Appl. Opt. 27, 1223–1225 (1988).
Forbes, F. F. & Roddier, N. A. Adaptive optics using curvature sensing. Proc. SPIE 1542, 140–147 (1991).
Janssen, A. J. E. M. Extended Nijboer–Zernike approach for the computation of optical point-spread functions. J. Opt. Soc. Am. A 19, 849–857 (2002).
Braat, J. J. M., Dirksen, P. & Janssen, A. J. E. M. Assessment of an extended Nijboer–Zernike approach for the computation of optical point-spread functions. J. Opt. Soc. Am. A 19, 858–870 (2002).
Braat, J. J. M., Dirksen, P., van Haver, S. & Janssen, A. J. E. M. Extended Nijboer–Zernike (ENZ) analysis and aberration retrieval. http://www.nijboerzernike.nl.
Dirksen, P., Braat, J. J. M., Janssen, A. J. E. M. & Juffermans, C. A. H. Aberration retrieval using the extended Nijboer–Zernike approach. J. Microlithogr. Microfabr. Microsyst. 2, 61–68 (2003).
Dirksen, P., Braat, J. J. M. & Janssen, A. J. E. M. Estimating resist parameters in optical lithography using the extended Nijboer–Zernike theory. J. Microlithogr. Microfabr. Microsyst. 5, 013005 (2006).
Wang, B., Diao, H. A., Guo, J. H., Liu, X. Y. & Wu, Y. H. Adaptive variable selection for extended Nijboer–Zernike aberration retrieval via lasso. Opt. Commun. 15, 78–86 (2017).
Roddier, N. Atmospheric wavefront simulation using Zernike polynomials. Opt. Eng. 29, 1174–1180 (1990).
Born, M. & Wolf, E. Principles of optics (Cambridge University Press, Cambridge, 1999).
van Haver, S., Braat, J. J. M., Dirksen, P. & Janssen, A. J. E. M. High-NA aberration retrieval with the extended Nijboer–Zernike vector diffraction. J. Eur. Opt. Soc. 1, 06004 (2006).
Riaud, P., Mawet, D. & Magette, A. Nijboer–Zernike phase retrieval for high contrast imaging principle, on-sky demonstration with NACO, and perspectives in vector vortex coronagraphy. Astron. Astrophys. 545, A150 (2012).
van der Avoort, C., Braat, J. J. M., Dirksen, P. & Janssen, A. J. E. M. Aberration retrieval from the intensity point-spread function in the focal region using the extended Nijboer–Zernike approach. J. Mod. Opt. 52, 1695–1728 (2005).
Braat, J. J. M., Dirksen, P., Janssen, A. J. E. M. & van de Nes, A. S. Extended Nijboer–Zernike representation of the vector field in the focal region of an aberrated high aperture optical system. J. Opt. Soc. Am. A 20, 2281–2292 (2003).
Braat, J. J. M., Dirksen, P., Janssen, A. J. E. M., van Haver, S. & van de Nes, A. S. Extended Nijboer–Zernike approach to aberration and birefringence retrieval in a high-numerical-aperture optical system. J. Opt. Soc. Am. A 22, 2635–2650 (2005).
Wang, B., Wang, X. F. & Guo, J. H. Global adaptive generative adjustment. arXiv:1911.00658 (2019).
Ma, X. X., Wang, J. L., Wang, B. & Li, H. Z. Phase diversity for calibrating noncommon path aberrations of adaptive optics system under nonideal measurement environment. Optik 125, 5029–5035 (2014).
Wang, B. et al. Calibration of no-common path aberration in AO system using multi-channel phase-diversity wave-front sensing. Opt. Precis. Eng. 21, 1683–1692 (2013) (in Chinese).
Wang, Z. Y. et al. Calibration of non-common path static aberrations by using phase diversity technology. Acta Opt. Sin. 32, 0701007 (2012) (in Chinese).
Acknowledgements
This work is supported by the Fundamental Research Funds for the Central Universities (Grant Numbers 2412019FZ030) and Jilin Provincial Science and Technology Development Plan funded Project (Grant Numbers 20180520026JH).
Author information
Authors and Affiliations
Contributions
X.W. wrote the main manuscript. B.W. conceived the idea and conducted all the experiments. Q.A. did the simulation on the curvature sensor. All authors discussed and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, B., Wang, X. & An, Q. Aberration retrieval by incorporating customized priors for estimating Zernike coefficients. Sci Rep 10, 11137 (2020). https://doi.org/10.1038/s41598-020-68012-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-68012-3
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.