The use of artificial neural networks in electrostatic force microscopy

Castellano-Hernández, Elena; Rodríguez, Francisco B; Serrano, Eduardo; Varona, Pablo; Sacha, Gomez Monivas

doi:10.1186/1556-276X-7-250

The use of artificial neural networks in electrostatic force microscopy

Nano Express
Open access
Published: 15 May 2012

Volume 7, article number 250, (2012)
Cite this article

Download PDF

You have full access to this open access article

Nanoscale Research Letters Aims and scope Submit manuscript

The use of artificial neural networks in electrostatic force microscopy

Download PDF

Elena Castellano-Hernández¹,
Francisco B Rodríguez¹,
Eduardo Serrano¹,
Pablo Varona¹ &
…
Gomez Monivas Sacha¹

3006 Accesses
5 Citations
Explore all metrics

Abstract

The use of electrostatic force microscopy (EFM) to characterize and manipulate surfaces at the nanoscale usually faces the problem of dealing with systems where several parameters are not known. Artificial neural networks (ANNs) have demonstrated to be a very useful tool to tackle this type of problems. Here, we show that the use of ANNs allows us to quantitatively estimate magnitudes such as the dielectric constant of thin films. To improve thin film dielectric constant estimations in EFM, we first increase the accuracy of numerical simulations by replacing the standard minimization technique by a method based on ANN learning algorithms. Second, we use the improved numerical results to build a complete training set for a new ANN. The results obtained by the ANN suggest that accurate values for the thin film dielectric constant can only be estimated if the thin film thickness and sample dielectric constant are known.

PACS: 07.79.Lh; 07.05.Mh; 61.46.Fg.

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Background

When electrostatic force microscopy (EFM) [1–6] is working at the nanoscale, several interacting parameters have a strong influence in the signal [7]. Since the electrostatic force is a long-range interaction, macroscopic parameters such as the shape of the tip or the sample thickness can strongly modify the electrostatic interaction [8, 9]. However, in many experimental situations, it is not possible to obtain accurate values for all of these parameters, and it is very difficult to achieve quantitative experimental results [10]. Previous results [11] have shown that artificial neural networks (ANNs) [12] are a useful tool to characterize dielectric samples in highly undetermined EFM systems. Using known force vs. distance curves as inputs for their training, ANNs have been able to estimate the dielectric constant of a semi-infinite sample in a system where the tip radius and shape were not known.

In this paper, we demonstrate that ANNs can be used to improve the accuracy of numerical simulations in EFM and to quantitatively estimate the thin film dielectric constant from vertical force curves. First, we compare standard minimization and ANN techniques, demonstrating that ANN techniques provide a better control of the final result of the simulation. The improved numerical results are also used to create a complete training set of an ANN that estimates the dielectric constant of a thin film placed over a dielectric sample.

As it has been shown before [11], ANNs are able to estimate physical magnitudes in highly undetermined systems. In this article, we train an ANN with a complete thin film sample to study the necessity of knowing the geometry of the sample in the estimations of the thin film dielectric constant. Although the influence of the thin film thickness is much larger than that of the substrate dielectric constant, we demonstrate that accurate values of the thin film dielectric constant can only be obtained when both magnitudes are known.

Methods

Artificial neural network formalism for the calculation of electric fields

To briefly illustrate how ANNs can be related to the problem of estimating unknown parameters in EFM setups, let us focus in the scheme shown in Figure 1a. Here, we have a set of metallic objects that are connected to a battery that provides a constant electric potential. The calculation of electric magnitudes such as the electrostatic potential or the force between these elements is, in general, very difficult, and only a few specific geometries can be analytically calculated [13]. To solve electrostatic problems with arbitrary geometries, an algorithm called the generalized image charge method [14, 15] (GICM) has been developed. The GICM replaces the surface charge density by a set of charges inside the metallic objects. The value, position, and number of charges are obtained after a standard least-squares minimization (LSM) routine for the electrostatic potential at the metallic surfaces. An alternative to the LSM is to use the ANN formalism by considering the value of the charges q_i as the weights w_i and the potential at the metallic surfaces V_j as the expected output values y_j (see inset of Figure 1a). The input patterns x_ij play the role of the Green functions G_i(r_ijσ_i), where r_ij is the distance between the i-charge element and the j-surface point. σ_i represents the geometrical parameters that may be used to adequately calculate the electrostatic potential generated by the i-charge element (for example, if the element is a charged line, σ_i would represent the length of the line). Following this formalism, the electrostatic potential V_j can be expressed as

V_{j} = \sum_{i = 1}^{N_{C}} q_{i} G_{i} (r_{ij}, σ_{i}) = \sum_{i = 1}^{N_{C}} w_{i} x_{ij},

(1)

where N_C is the number of charged elements q_i inside the tip. The most right-hand-side term in Equation 1 represents the electrostatic potential in the notation of a single-output ANN, where x_ij represents the inputs to the output neuron y_j, and w_i are the connection weights from the inputs (i = 1,…,) to this neuron (see Figure 1a). A neural network learns by example. The task of the learning algorithm of the network (i.e., the delta rule, backpropagation, etc. [12, 16]) is to determine w_i (i.e., q_i) from available x_ij data. Previous knowledge can help us to decide the best values for N_C (by the selection of the number of neurons) and G_i (by the selection of input patterns).

To compare both minimization techniques (LSM and ANN), we have simulated the EFM shown in Figure 1b. To illustrate the advantages of using the ANN minimization routine, we have calculated the q_i coefficients for the tip-sample distance described in Figure 1b with both the ANN and standard LSM routines. We have used the winGICM software v1.1 [17] which also uses ANNs to estimate the number of punctual charges and the number of segments (4 and 12, respectively). The LSM lowest error (located at x = 0.9876 R z = 0.8896 R) at the tip surface was 0.0019 V₀, where V₀ is the voltage applied to the tip. In Figure 2, we show the electrostatic potential distribution obtained for different numbers of iterations N_it in the training process. We initialized q_i = 0 and fixed the learning rate to 0.1. When N_it = 100,000, the equipotential distribution looks identical than that obtained by the LSM. However, the highest error at the tip surface is 0.0076 V₀ (four times larger than that from LSM). At this point, it seems that LSM is a better minimization technique since it gives a lower error and does not use any iterative process. However, the q_i values obtained by the LSM are not adequate for several physical applications. As we can see in Table1, the ANN q_i absolute values are much smaller than those from LSM. This fact is not important when q_i do not have any physical meaning. However, in our case, q_i correspond to the charge inside the tip and must be used to calculate electric magnitudes like the capacitance or the electrostatic force F (used in the following section). By using ANN q_i values, these magnitudes can be calculated with improved accuracy since the low values of the charges strongly reduce the numerical noise. In conclusion, the ANN minimization allows the user to choose the balance between numerical accuracy and physical meaning of the simulations to adapt them to the necessity of the problem.

Table 1 Coefficients obtained by the ANN and LSM algorithms for an EFM system

Full size table

Results and discussion

Thin film dielectric constant estimation

In this section, we are going to use the GICM force vs. tip-sample distance (F vs. D) curves for an EFM tip over a thin film to train an ANN that will be able to estimate the dielectric constant ϵ₁ of the thin film with thickness h₁ (see Figure 1b). The thin film will be placed over a semi-infinite dielectric substrate characterized by its dielectric constant ϵ₂. It is worth noting that in realistic systems where h₁ is very small, ϵ₁ should be considered an effective [6] dielectric constant since several nanoscale effects can modify the response of the thin material and change the ϵ₁ value. Some examples of physical phenomena that could affect ϵ₁ are the roughness of the thin film surface, the presence of a water layer over the thin film [5], or the finite amount of free charge due to the small size of the film. The ANN architecture is shown in Figure 3. We used a multilayer perceptron with sigmoid activation functions. The input layer is composed of 20 neurons for the F vs. D curves that are calculated for D = {2.5, 5,…,50} nm. Additional neurons are added in the cases where ϵ₂ and h₁ are included as input values. We used two hidden layers composed of 10 neurons with no bias applied. The output layer contains a single neuron which provides the estimate values for ϵ₁. We have considered three different inputs: the F vs. D curves, h₁, and ϵ₂. GICM F vs. D curves included in the training were calculated for ϵ₁ = {5, 15, 25,…,105}, ϵ₂ = {5, 15, 25,…,105}, and h₁ = {1, 2,…,10}. The ANN has been tested with 100 F vs. D curves (not used during the training) with randomly selected ϵ₁ϵ₂ (between 5 and 105), and h₁ (between 1 and 10) values. As we can see in Figure 4a, although the ANN is able to estimate ϵ₁ when F vs. D and h₁ (excluding ϵ₂) are used as input parameters, it gives the best results when all the input parameters are included. In Figure 4b, we show the error obtained by the ANN when all the inputs are included. The error is always smaller than 9% for all the ϵ₁ values.

The ANN can be used with realistic experimental curves without any previous treatment, which is one of the advantages of using this technique [11]. In this case, experimental curves with a high error could make the ANN give wrong ϵ₁ estimations. This problem can be easily solved by training the ANN with a mixture of experimental and numerical F vs. D curves. This strategy would make the ANN more robust against experimental noise (by the use of experimental curves) and still effective on the ϵ₁ estimations (by the use of a whole set of numerical curves).

Recently, a simple analytical expression has been developed that demonstrates that a sample composed by a thin film over a dielectric substrate gives the same response as that of a semi-infinite uniform dielectric sample [18]. The fact that different combinations of ϵ₁ϵ₂, and h₁ can correspond to the same effective dielectric constant is in agreement with the results found in Figure 4a since including ϵ₂ and h₁ as input values improves the ANN performance in the ϵ₁ estimations.

Conclusions

We have demonstrated that ANNs can strongly improve the efficiency of the characterization of samples by electrostatic force microscopy. First, we have demonstrated that the generalized image charge method can be modified to use a neural network minimization algorithm. Using this technique, we have increased the accuracy of the electrostatic force and capacitance calculations. By using electrostatic force simulations, we have been able to train an ANN to estimate the dielectric constant of thin films. The analysis of the results of the ANN suggests that the thin film dielectric constant can only be obtained when the thin film thickness and the dielectric nature of the sample are known. Note that the methods explained in this paper can be easily applied to experimental data by providing this kind of input to the ANN. If enough data are available, experimental curves can be used for the ANN training alone or together with theoretical curves.

Abbreviations

ANNs:: Artificial neural networks
EFM:: Electrostatic force microscopy
F vs. D:: Force vs. tip-sample distance
GICM:: Generalized image charge method
LSM:: Least-squares minimization.

References

Kalinin SV, Jesse S, Rodriguez BJ, Eliseev EA, Gopalan V, Morozovska AN: Quantitative determination of tip parameters in piezoresponse force microscopy. Appl Phys Lett 2007, 90: 212905. 10.1063/1.2742900
Article Google Scholar
Lyuksyutov SF, Vaia RA, Paramonov PB, Juhl S, Waterhouse L, Ralich RM, Sigalov G, Sancaktar E: Electrostatic nanolithography in polymers using atomic force microscopy. Nat Mater 2003, 2: 468–472. 10.1038/nmat926
Article Google Scholar
Guriyanova S, Golovko DS, Bonaccurso E: Cantilever contribution to the total electrostatic force measured with the atomic force microscope. Measurement Science & Technology 2010, 21: 025502. 10.1088/0957-0233/21/2/025502
Article Google Scholar
Palacios-Lidon E, Abellan J, Colchero J, Munuera C, Ocal C: Quantitative electrostatic force microscopy on heterogeneous nanoscale samples. Appl Phys Lett 2005, 87: 154106. 10.1063/1.2099527
Article Google Scholar
Hu J, Xiao XD, Salmeron M: Scanning polarization force microscopy - a technique for imaging liquids and weakly adsorbed layers. Appl Phys Lett 1995, 67: 476–478. 10.1063/1.114541
Article Google Scholar
Morozovska AN, Eliseev EA, Kalinin SV: The piezoresponse force microscopy of surface layers and thin films: effective response and resolution function. J Appl Phys 2007, 102: 074105. 10.1063/1.2785824
Article Google Scholar
Butt HJ, Cappella B, Kappl M: Force measurements with the atomic force microscope: technique, interpretation and applications. Surf Sci Rep 2005, 59: 1–152. 10.1016/j.surfrep.2005.08.003
Article Google Scholar
Sacha GM, Gomez-Navarro C, Saenz JJ, Gomez-Herrero J: Quantitative theory for the imaging of conducting objects in electrostatic force microscopy. Appl Phys Lett 2006, 89: 173122. 10.1063/1.2364862
Article Google Scholar
Sacha GM, Saenz JJ: Cantilever effects on electrostatic force gradient microscopy. Appl Phys Lett 2004, 85: 2610–2612. 10.1063/1.1797539
Article Google Scholar
Sacha GM, Verdaguer A, Martinez J, Saenz JJ, Ogletree DF, Salmeron M: Effective tip radius in electrostatic force microscopy. Appl Phys Lett 2005, 86: 123101. 10.1063/1.1884764
Article Google Scholar
Sacha GM, Rodriguez FB, Varona P: An inverse problem solution for undetermined electrostatic force microscopy setups using neural networks. Nanotechnology 2009, 20: 085702. 10.1088/0957-4484/20/8/085702
Article Google Scholar
Haykin S: Feedforward Neural Networks: An Introduction. Prentice-Hall, Englewood; 1999.
Google Scholar
Hänninen JJ, Lindell IV, Nikoskinen KI: Electrostatic image theory for an anisotropic boundary of an anisotropic half-space. Progress in Electromagnetics Research-Pier 2004, 47: 236–262.
Article Google Scholar
Sacha GM, Sahagun E, Saenz JJ: A method for calculating capacitances and electrostatic forces in atomic force microscopy. J Appl Phys 2007, 101: 024310. 10.1063/1.2424524
Article Google Scholar
Sacha GM: Página de Sacha. [www.ii.uam.es/~sacha] []
Tetko IV, Livingstone DJ, Luik AI: Neural-network studies.1.Comparison of overfitting and overtraining. Journal of Chemical Information and Computer Sciences 1995, 35: 826–833. 10.1021/ci00027a006
Google Scholar
Sacha GM, Rodriguez FB, Serrano E, Varona P: Generalized image charge method to calculate electrostatic magnitudes at the nanoscale powered by artificial neural networks. Journal of Electromagnetic Waves and Applications 2010, 24: 1145–1155. 10.1163/156939310791586160
Article Google Scholar
Castellano-Hernández E, Sacha GM: Ultrahigh dielectric constant of thin films obtained by electrostatic force microscopy and artificial neural networks. Applied Physics Letters 2012, 100: 023101. 10.1063/1.3675446
Article Google Scholar

Download references

Acknowledgments

This work was supported by TIN2010-19607 and BFU2009-08473. GMS acknowledges support from the Spanish Ramón y Cajal Program.

Author information

Authors and Affiliations

Grupo de Neurocomputación Biológica, Departamento de Ingeniería Informática, Escuela Politécnica Superior, Universidad Autónoma de Madrid, Campus de Cantoblanco, Madrid, 28049, Spain
Elena Castellano-Hernández, Francisco B Rodríguez, Eduardo Serrano, Pablo Varona & Gomez Monivas Sacha

Authors

Elena Castellano-Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Francisco B Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Serrano
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Varona
View author publications
You can also search for this author in PubMed Google Scholar
Gomez Monivas Sacha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gomez Monivas Sacha.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

ECH carried out the numerical simulations. FBR and ES participated in the design of the artificial neural networks and mathematical formalism. PV participated in the design of the artificial neural networks and helped draft the manuscript. GMS conceived the study, participated in its design and coordination, and drafted the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Castellano-Hernández, E., Rodríguez, F.B., Serrano, E. et al. The use of artificial neural networks in electrostatic force microscopy. Nanoscale Res Lett 7, 250 (2012). https://doi.org/10.1186/1556-276X-7-250

Download citation

Received: 06 January 2012
Accepted: 15 May 2012
Published: 15 May 2012
DOI: https://doi.org/10.1186/1556-276X-7-250

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The use of artificial neural networks in electrostatic force microscopy

Abstract

Explore related subjects

Background

Methods

Artificial neural network formalism for the calculation of electric fields

Results and discussion

Thin film dielectric constant estimation

Conclusions

Abbreviations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation