1 Introduction

Numerical methods allow finding solutions of non-linear equations that cannot be solved by algebraic procedures. The development and improvement of these methods and their behaviour constitute a field of intense research and a vast literature related to this topic can be found (see, for example [21, 24, 25, 28] [26] and references therein).

Iterative methods solve non-linear equations by generating successive approximations that may eventually converge to the solution. The so-called one-point-methods (see [28]) start with an initial guess of the solution to proceed iteratively so that each approximation is used to obtain the next one, until a desired level of convergence is reached. That is, an approximation of the solution x of an equation f(x) = 0 can be found by applying an iterative scheme of the form:

$$ x_{k+1}=\phi (x_{k}), \ \ k=0,1,2,... $$

where xk is an approximation of x and ϕ is the iteration operator. The function ϕ may depend on derivatives of f in order to increase the order of convergence.

The simplest and most popular method using this scheme is the well-known Newton’s method, also known as Newton–Raphson’s method (see [4]), given by:

$$ x_{k+1}=N_{f}(x_{k}):=x_{k}-\frac{f(x_{k})}{f^{\prime}(x_{k})}. $$

To improve the efficiency of one-point methods, multi-point methods were introduced. For them, each step is not exclusively based on the previous iteration, but also includes information of intermediate evaluations. These schemes, mainly variants of Newton’s method, have recently gained interest because they provide root-finding algorithms which improve both convergence order and computational efficiency in comparison with one-point ones. Such advantages allow to optimise computational resources. However, the radii of convergence which ensure that the solution of the method is correct decrease when the order of the method increases (see [23], for example).

One way to assess the balance between benefits and drawbacks is to study the dynamical behaviour of these methods. By considering a discrete dynamical system and performing a qualitative study of it, we can obtain dynamical spaces that identify the regions where the method displays good behaviour. Moreover, if the method has parameter dependence constituting a family of methods, we can plot the corresponding parameter spaces to find the members of the family with better behaviour (see [6, 14], for example).

The first works in this sense ([19], for example) show the dynamical planes of the best known numerical methods acting on polynomials of degree two or three. In [14], the authors began the dynamical study of the one-parameter family called Chebyshev-Halley.

In the present paper, we study multipoint Newton-like methods, that is, methods where the intermediate evaluations are variations of Newton’s scheme.

As can be observed in the literature (see [2, 5, 7, 8, 12,13,14, 20, 30], for example), when Newton-like algorithms are applied to the quadratic polynomials p(z) = z2c, the operator obtained is conjugated (via a Möbius map that sends the \(\pm \sqrt {c}\) to 0 and \(\infty \)) with a rational map which has the following generic expression:

$$ O(z)=z^{n} \frac{a_{k}+a_{k-1} z+...+a_{1} z^{k-1}+ z^{k}}{1+a_{1} z+...+a_{k-1} z^{k-1}+a_{k} z^{k}}. $$
(1)

The main goal of this paper is to justify why this kind of operator is obtained regardless of the Newton-like algorithm used. This is done in Theorem 2.7. This theorem follows from a more general symmetry property of the iterative schemes obtained from Newton-like algorithms applied to the degree d polynomials p(z) = zdc (Theorem 2.2). The symmetry was noted by Chun et al. [11], but they did not draw conclusions.

The idea of how to justify that operator (1) is obtained when applying Newton-like algorithms to p(z) = z2c is the following. In Theorem 2.2, we describe a recurrent procedure which can be used to describe the operators obtained from Newton-like root-finding algorithms when applied to the degree d polynomials p(z) = zdc. Moreover, we prove that the maps obtained from Theorem 2.2 are symmetric with respect to rotation by a d th root of the unity. By restricting to d = 2 and applying a conjugacy, we conclude (Theorem 2.7) that the operators obtained for the different families of Newton-like methods when they are applied on quadratic polynomials are given by (1).

In Section 3, we carry out a dynamical study of the operator (1) in order to obtain generic conclusions of this type of methods. In Proposition 3.4, we show that the points z = 0, \(z=\infty \) and z = 1 are always fixed points of this operator and z = − 1 is also a fixed point if n + k is odd. The points z = 0 and \(z=\infty \) correspond to the zeros of the quadratic polynomial, which are superattracting fixed points of local degree n (the numerical methods have order of convergence n to the roots). In order to establish the existence of stable behaviour other than the basins of attraction of the roots, we also study the sets of parameters where z = 1 and z = − 1 are attracting fixed points (Propositions 3.5 and 3.6). This study is done under the extra hypothesis that the relation between the coefficients of the rational function and the parameter of the family is linear, which is a common phenomenon in the literature.

Section 4 is devoted to display some known examples of Newton-like algorithms that correspond to families with one free critical point, which allows us to draw the parameter planes of the families. The operators of this section follow (1). It is also observed that, under given conditions, two different methods whose operators, expressed as (1), have the same values of n and k present similar dynamics (Section 4.1). Moreover, in Section 4.3, we study the particular case in which n = k and all the coefficients of the rational function, except ak, are real.

Finally, we want to emphasise that the functions introduced in Theorem 2.2 can be used to generate new Newton-type algorithms for solving nonlinear equations. In fact, in Section 2, we show that the algorithms of the most usual methods, such as Newton, Traub, Halley, Chebyshev and Jarratt, are obtained from the functions introduced in Theorem 2.2. Although we do not intend to carry out an exhaustive verification, it is easy to extend this assertion to other Newton-like algorithms.

In the construction of new Newton-type algorithms, parameters must be introduced in the iterative procedure applied to functions given in Theorem 2.2. These parameters must be adjusted so that the method obtained has the desired order of convergence.

We also want to emphasise that the main goal of this paper is to study general properties of Newton-like algorithms and to make a qualitative study of them. The construction of methods using the iterative procedure provided by Theorem 2.2 is the aim of a coming paper.

For the sake of completeness, in Section 1.1, we present some of the best known Newton-like methods of different degree and in Section 1.2, we recall some basic concepts of complex dynamics.

1.1 Newton-like methods

As mentioned before, multi-point iteration methods were introduced to improve the order of convergence and the efficiency of iterative methods and they are described by means of the expressions w1(xk),w2(xk),...,wn(xk). The iteration function ϕ, defined as

$$ x_{k+1}=\phi \left( x_{k},w_{1}(x_{k}),w_{2}(x_{k}),...,w_{n}(x_{k})\right) $$

is called a multi-point iteration function without memory.

The simplest examples are Steffensen’s method, with w1(xk) = xk + f(xk):

$$ x_{k+1}=x_{k}-\frac{ f(x_{k})^{2}}{f(x_{k}+f(x_{k}))-f(x_{k})} $$

and Traub-Steffensen’s method, with w1(xk) = xk + γf(xk):

$$ x_{k+1}=S(x_{k}):=x_{k}-\frac{\gamma f(x_{k})^{2}}{f(x_{k}+\gamma f(x_{k}))-f(x_{k})}. $$

There are many multi-point methods and we do not pretend here to do an exhaustive study of them; we only point out those that we consider best known. A more comprehensive study can be seen in [24, 25] and [28], for example.

Following the summary provided in [26], different two-step methods can be built by using Newton’s method as pre-conditioner, such as Traub’s scheme [28]:

$$ \begin{array}{@{}rcl@{}} y_{k} &=& x_{k}-\frac{f(x_{k})}{f^{\prime}(x_{k})} \\ x_{k+1} &=& y_{k} - \frac{f(y_{k})}{f^{\prime}(x_{k})} \end{array} $$

or Ostrowski’s scheme [24]:

$$ \begin{array}{@{}rcl@{}} y_{k} &=& x_{k}- \frac{f(x_{k})}{f^{\prime}(x_{k})} \\ x_{k+1} &=& y_{k}-\frac{f(y_{k})}{f^{\prime}(x_{k})} \frac{f(x_{k})}{f(x_{k})-2f(y_{k})}. \end{array} $$

The latest was generalised by King [18], who defined the scheme:

$$ \begin{array}{@{}rcl@{}} y_{k} &=& x_{k}- \frac{f(x_{k})}{f^{\prime}(x_{k})}\\ x_{k+1} &=& y_{k}-\frac{f(y_{k})}{f^{\prime}(x_{k})} \frac{f(x_{k})+\beta f(y_{k})}{f(x_{k})+(\beta-2)f(y_{k})}, \end{array} $$
(2)

obtaining a family of iterative methods depending on one parameter. It was shown by Chun et al. [11] that the best parameter is β = 0.

If instead of using the Newton’s method as the first step, the so-called Jarratt method step is used, a method of order of convergence four is obtained. The resulting scheme is called Jarratt method [16]:

$$ \begin{array}{@{}rcl@{}} y_{n} &=& x_{n}- \frac{2}{3} \frac{f(x_{n})}{f^{\prime}(x_{n})} \\ x_{n+1} &=& x_{n}- J_{f}(x_{n}) \frac{f(x_{n})}{f^{\prime}(x_{n})} \end{array} $$
(3)

where:

$$ J_{f}(x_{n}) = \frac{3f^{\prime}(y_{n})+f^{\prime}(x_{n})}{2\left( 3f^{\prime}(y_{n})-f^{\prime}(x_{n})\right)}. $$

From the two-step methods, other three-step methods can be obtained, with general scheme:

$$ \begin{array}{@{}rcl@{}} y_{k} &=& x_{k}-\frac{f(x_{k})}{f^{\prime}(x_{k})} \\ z_{k} &=& \phi (x_{k},y_{k}) \\ x_{k+1} &=& z_{k}-\frac{f(z_{k})}{f^{\prime}(z_{k})}. \end{array} $$

Some examples of schemes obtained using this procedure are the sixth-order convergence method deduced by Wang et al. [29],

$$ \begin{array}{@{}rcl@{}} x_{n+1} &=&x_{n}-J_{f}(x_{n}) \frac{f(x_{n})}{f^{\prime}(x_{n})}\\ y_{n} &=& x_{n}- \frac{2}{3} \frac{f(x_{n})}{f^{\prime}(x_{n})} \\ J_{f}(x_{n}) &=& \frac{3f^{\prime}(y_{n})+f^{\prime}(x_{n})}{6f^{\prime}(y_{n})-2f^{\prime}(x_{n})}, \end{array} $$

the fourth-order family of iterative method introduced by Amat et al. in [1],

$$ \begin{array}{@{}rcl@{}} u_{f}(z)&=& \frac{f(x_{n})}{f^{\prime}(x_{n})} \\ h_{f}(z) &=& \frac{f^{\prime}\left( z-\frac{2}{3}u_{f}(z)\right)-f^{\prime}(z)}{f^{\prime}(z)}\\ J(z) &=& z - u_{f}(z)+\frac{3}{4}u_{f}(z)h_{f}(z) \frac{1+\beta h_{f}(z)}{1+\left( \frac{3}{2}+\beta\right)h_{f}(z)}, \end{array} $$
(4)

and the family of sixth-order methods defined by Chun [10],

$$ \begin{array}{@{}rcl@{}} y_{n} &=& x_{n}- \frac{2}{3} \frac{f(x_{n})}{f^{\prime}(x_{n})} \\ z_{n} &=&x_{n}-J_{f}(x_{n}) \frac{f(x_{n})}{f^{\prime}(x_{n})}\\ x_{n+1} &=& z_{n} - \frac{f(z_{n})}{\alpha (z_{n}-x_{n})(z_{n}-y_{n})+\frac{3}{2}J_{f}(x_{n}) f^{\prime}(y_{n})+\left( 1-\frac{3}{2}J_{f}(x_{n})\right)f^{\prime}(x_{n})}. \end{array} $$

One of the best known Newton-like method corresponds to the one-parameter family called Chebyshev-Halley.

$$ \begin{array}{@{}rcl@{}} y_{k} &=& x_{k}-\frac{f(x_{k})}{f^{\prime}(x_{k})} \\ L_{f}(x_{k}) &=& \frac{f(x_{k}) f^{\prime\prime}(x_{k})}{(f^{\prime}(x_{k}))^{2}}\\ x_{k+1} &=& y_{k}-\frac{1}{2}\frac{L_{f}(x_{k})}{1-\alpha L_{f}(x_{k})}\frac{f(x_{k})}{f^{\prime}(x_{k})}. \end{array} $$
(5)

This family includes the Chebyshev’s method when the parameter α is equal to 0, Halley’s scheme for α = 1/2 and Newton’s method when α tends to \( \infty \). In [14], the authors began the dynamical study of this family applied on arbitrary polynomials of degree two.

This type of dynamical study has been extended to other families of numerical methods, such as the King family [12] and the c-family [7]. Nowadays, there is a wide literature expanding this study to methods with higher order of convergence (see, for example [2, 11, 17] and references therein). In these papers, the dynamical behaviour of these families applied on quadratic polynomials is considered. Dynamical studies of methods, or families of numerical methods, applied on polynomials of higher degree can be found (see [5, 6, 15], for example).

1.2 Basic concepts of complex dynamics

Before starting the dynamical study of these operators, we briefly recall the basic concepts of complex (as opposed to real) dynamics that we use in this paper. For a more detailed introduction to the topic of complex dynamics, we refer to [22].

Given a rational map \(R:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\), where \(\widehat {\mathbb {C}}\) denotes the Riemann sphere, we consider the dynamical system given by the iterates of R. We say that a point \(z_{0}\in \widehat {\mathbb {C}}\) is fixed if R(z0) = z0 and periodic of period p if Rp(z0) = z0, where p is minimal. The multiplier of a fixed point is given by \(\lambda (z_{0})=R^{\prime }(z_{0})\). Analogously, the multiplier of a periodic point is given by \(\lambda (z_{0})=(R^{p})'(z_{0})=R^{\prime }(z_{0})\cdot R^{\prime }(R(z))\cdot \ldots \cdot R^{\prime }(R^{p-1}(z))\). A fixed or periodic point z0 is called attracting if |λ(z0)| < 1 (superattracting if |λ(z0)| = 0), repelling if |λ(z0)| > 1 and indifferent if |λ(z0)| = 1. An indifferent point is called parabolic if λ(z0) = e2πip/q, where \(p,q\in \mathbb {N}\). All attracting and parabolic fixed points have a basin of attraction associated to them which consists of the set of points which converge to z0 under iteration of R. Analogously, every attracting or parabolic periodic point z0 has a basin of attraction associated to it which consists of the set of points which converge to the cycle 〈z0〉 = {z0,R(z0),⋯ ,Rp− 1(z0)}.

The dynamics of R provides a totally invariant partition of the Riemann sphere. The Fatou set \(\mathcal {F}(R)\) consists of the set of points \(z\in \widehat {\mathbb {C}}\) for which the family of iterates of R(z), {R(z),R2(z),…,Rn(z),…}, is normal (or equivalently equicontinuous) in some open neighbourhood of z. The Fatou set is open and consists of the set of points for which the dynamics presents stable behaviour. Its complement, the Julia set \(\mathcal {J}(f)\), is closed and consists of the set of points which present chaotic behaviour. The connected components of the Fatou set, called Fatou components, are mapped amongst themselves under iteration of R. In view of Sullivan’s No-Wandering Theorem (see [27]), all Fatou components are either periodic or preperiodic. Every periodic Fatou component either belongs to the basin of attraction of an attracting or parabolic point, or is a simply connected rotation domain (a Siegel disk), or is a doubly connected rotation domain (a Herman ring). Moreover, periodic Fatou components of rational maps can be related to critical points, i.e. points where \(R^{\prime }(z)=0\). Indeed, every cycle of attracting or parabolic Fatou components contains, at least, a critical point. On the other hand, Siegel disks and Herman rings have critical points whose orbits accumulate on their boundaries.

Along the paper, we use these concepts to draw both dynamical and parameter planes. Dynamical planes show the dynamics of points in a given range. They are drawn by creating a grid of points in the specified range. Afterwards, the point is iterated up to 150 times. If the point converges to a root (the distance to the root is smaller than 10− 4), the iteration stops and we use a scaling from red (fast convergence), to yellow, to green, to blue and to grey (slow convergence) to draw the point. If after 150 iterates the point has not converged to a root, the point is plotted in black.

When a family depends on parameters, it also makes sense to draw parameter planes. Parameter planes describe the possible dynamics of a map depending on the parameter. Since all Fatou components of rational maps are related to critical points, in order to draw parameter planes, it is enough to study the orbits of the critical points, the critical orbits. In this paper, we restrict our drawings to families which only have one free critical orbit, i.e. an orbit of a critical point other than the superattracting fixed points of the method, up to symmetry (compare Section 2). Therefore, in order to draw a parameter plane, we restrict to a grid of points in a given range of parameters and iterate a free critical point. If the orbit of the critical point converges to a root (the distance to the root is smaller than 10− 4), we conclude that there can be no stable behaviour other than convergence to the roots. In that case, the iteration stops and we use a scaling from red (fast convergence), to yellow, to green, to blue and to grey (slow convergence) to draw the point. If after 150 iterates the point has not converged to a root, the point is plotted in black. These points correspond to parameters for which there might be other stable behaviour than convergence to the roots.

2 Symmetries of operators

Symmetries in the dynamical planes of Newton-like root-finding algorithms applied on polynomials zdc have been observed for different families (see, e.g. [6] and [9]). In this section, we prove the need for this symmetry to appear and use it to obtain the operator that appears by applying Newton-like root-finding algorithms on degree 2 polynomials. We start studying the families of maps presenting these symmetry properties and we show that these maps which can be used to write the expressions of Newton-like root-finding algorithms. First, we introduce the concepts of λd-odd and λd-even.

Definition 2.1

Let d ≥ 2. We say that a map \(f:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) is λd-odd if f(λz) = λf(z) for all \(\lambda \in \mathbb {C}\) such that λd = 1 and all \(z\in \mathbb {C}\). Respectively, we say that f is λd-even if f(λz) = f(z).

Notice that λ2-odd and λ2-even maps are, precisely, odd and even maps (the only second roots of the unity are 1 and -1). The dynamics of a λd-odd map is symmetric with respect to rotation by d th roots of the unity. Indeed, λd-odd maps are conjugated with themselves by multiplication with a d th-root of the unity: f(z) = λ− 1f(λz). We refer to this property as the symmetry of λd-odd maps. This symmetry is relevant since it allows us to decrease the degrees of freedom of λd-odd maps. It is easy to show that if \(\kappa \neq 0,\infty \) is a critical point of a λd-odd map f, then the points λdκ, where λd = 1, are also critical points of f. Moreover, their dynamics is tied by the symmetry, so it is enough to control the dynamics of a single critical orbit to know the asymptotic behaviour of d different critical points. The next proposition shows different ways in which we can use the polynomials p(z) = zdc, \(c\in \mathbb {C}\setminus \{0\}\), to obtain λd-odd maps.

Theorem 2.2

Let p(z) = zdc, where d ≥ 2, \(c\in \mathbb {C}\setminus \{0\}\). Let \(g,h:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) be λd-odd maps and let \(H:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) be a λd-even map. Then, the following maps are λd-odd:

  1. i)

    (the identity) f(z) = z;

  2. ii)

    (the linear combination) f(z) = ag(z) + bh(z), where \(a,b\in \mathbb {C}\);

  3. iii)

    (the composition) f(z) = g(h(z));

  4. iv)

    \( f(z)=\frac {p(g(z))}{p^{\prime }(h(z))}\);

  5. v)

    f(z) = h(z) ⋅ H(z). In particular, H(z) can be obtained as follows:

    1. v.1)

      \(H(z)={\prod }_{i=1}^{k} \frac { p^{(n_{i})}\left (g_{i}(z)\right )}{ p^{(m_{i})}\left (h_{i}(z)\right )}\) is λd-even, where p(n) denotes the n th derivative of p, k ≥ 1, 0 ≤ ni,mid, \({\sum }_{i=1}^{k} m_{i}={\sum }_{i=1}^{k} n_{i}\) and \(g_{i},h_{i}:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) are λd-odd maps.

    2. v.2)

      \(H(z)=\frac { \sum \limits _{i=1}^{k} a_{i} \cdot p^{(n)}\left (g_{i}(z)\right ) }{ \sum \limits _{i=1}^{\ell } b_{i}\cdot p^{(n)}\left (h_{i}(z) \right )}\), where n ≥ 0, p(n) denotes the n th derivative of p, k, ≥ 1, \(a_{i},b_{i}\in \mathbb {C}\setminus \{0\}\) and \(g_{i},h_{i}:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) are λd-odd maps.

    3. v.3)

      H(z) = a1H1(z) + a2H2(z), where \(H_{1}, H_{2}:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) are λd-even and \(a_{1},a_{2}\in \mathbb {C}\).

    4. v.4)

      H(z) = H1(z)/H2(z), where \(H_{1},H_{2}:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) are λd-even.

Proof

The cases i), ii) and iii) are straightforward. We prove now case iv). Let \(f(z)=p(g(z))/p^{\prime }(h(z))\). Using that λd = 1 and λd− 1 = λ− 1, we have

$$ \begin{array}{@{}rcl@{}} f(\lambda z)&=&\frac{p(g(\lambda z))}{p^{\prime}(h(\lambda z))}=\frac{p(\lambda g(z))}{p^{\prime}(\lambda h(z))}= \frac{(\lambda g(z))^{d}-c}{d (\lambda h(z))^{d-1}}= \frac{\lambda^{d} g(z)^{d}-c}{d \cdot \lambda^{d-1} h(z)^{d-1}}=\frac{ g(z)^{d}-c}{d \cdot \lambda^{-1} h(z)^{d-1}}\\ &=&\lambda \frac{ g(z)^{d}-c}{d\cdot h(z)^{d-1}}=\lambda f(z). \end{array} $$

For v), the fact that f(z) = h(z) ⋅ H(z) is λd-odd follows directly from the definition of λd-odd and λd-even. We have to see that the proposed functions H(z) are λd-even. We start with

$$ \prod\limits_{i=1}^{k} \frac{p^{(n_{i})}\left( g_{i}(z)\right)}{p^{(m_{i})}\left( h_{i}(z)\right)}. $$

Notice that if ni = 0 (or mi = 0) then

$$ p^{(0)}\left( \lambda g_{i}(z)\right)= \left( \lambda g_{i}(z)\right)^{d}-c= p^{(0)}(g_{i}(z)). $$

If 0 < nid, then

$$ p^{(n_{i})}\left( \lambda g_{i}(z)\right)=\frac{d!}{(d-n_{i})!}(\lambda g_{i}(z))^{d-n_{i}}= \lambda^{-n_{i}}p^{(n_{i})}\left( g_{i}(z)\right). $$

Analogously, if 0 < mid, then

$$ p^{(m_{i})}\left( \lambda g_{i}(z)\right)= \lambda^{-m_{i}}p^{(m_{i})}\left( g_{i}(z)\right). $$

The same equalities work if we replace gi by hi. Then, the fact that H is λd-even follows then easily from \({\sum }_{i=1}^{k} m_{i}={\sum }_{i=1}^{k} n_{i}\).

Similarly, the fact that the function H in v.2) is λd-even holds easily using that \(p^{(n)}\left (\lambda g_{i}(z)\right )= \lambda ^{-n}p^{(n)}\left (g_{i}(z)\right )\). The cases v.3) and v.4) follow directly from the definition of λd-even. □

Remark 2.3

An important property of the maps constructed in Theorem 2.2 is that they can be used to obtain new Newton-like algorithms.

Another important feature of the maps constructed in the previous Theorem is that they generally have \(z=\infty \) as a simple fixed point: f is a rational map and the degree of its numerator equals the degree of its denominator plus 1. In the next proposition, we describe how this property is preserved under the different constructions in Theorem 2.2. The proof is straightforward by computing the degree of the numerator and the denominator of the resulting maps.

Proposition 2.4

Let p(z) = zdc, where d ≥ 2, \(c\in \mathbb {C}\setminus \{0\}\). Let \(g,h:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) be two maps that have \(z=\infty \) as simple fixed point. Then, the maps f(z) = g(h(z)) and \( f(z)=\frac {p(g(z))}{p^{\prime }(h(z))}\) have \(z=\infty \) as simple fixed point. Moreover:

  1. i.)

    The map f(z) = ag(z) + bh(z), where \(a,b\in \mathbb {C}\), has \(z=\infty \) as simple fixed point unless the linear combinations cancel out the highest order term of the numerator.

  2. ii.)

    The map f(z) = h(z) ⋅ H(z), where \(H:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\), has \(z=\infty \) as simple fixed point provided that H(z) is a rational map whose numerator and the denominator have the same degree.

Remark 2.5

We can analyse whether the different options for H in Theorem 2.2 v) satisfy the conditions of Proposition 2.4 ii). The numerator and denominator of the map H obtained from v.1) have the same degree. The map obtained from v.2) also satisfies the property provided that the lineal combinations do not decrease the degree at the numerator or the denominator. Analogously, the map H in v.3) satisfies the property provided that H1 and H2 also do and there are no simplifications of the higher order term at the numerator. Finally, the numerator and denominator of the map H in v.4) have the same degree provided that H1 and H2 also do.

Since the base map used in Theorem 2.2, f(z) = z (the identity map), has \(z=\infty \) as simple fixed point, we can conclude that the maps obtained from Theorem 2.2 have \(z=\infty \) as simple fixed point provided that the linear combinations given by ii), v.2) and v.3) keep the maximal degree. Otherwise, the degree of \(z=\infty \) as a fixed point may increase (it becomes a superattracting fixed point) or decrease (\(z=\infty \) is no longer a fixed point).

As it has been stated, the different steps in Theorem 2.2 are relevant for root-finding algorithms since they can be used to obtain many different methods. For instance:

  • Newton’s method \(N_{p}(z)=z-p(z)/p^{\prime }(z)\) can be obtained from i), ii) and iv);

  • Traub’s method \(T_{p}(z)=N_{p}(z)-p\left (N_{p}(z)\right )/p^{\prime }(z)\) can be obtained from i), ii) and iv);

  • Chebyshev’s method \(C_{p}(z)=z-\left (1+\frac {1}{2}L(z)\right )p(z)/p^{\prime }(z)\), where \(L(z)=p(z)p^{\prime \prime }(z)/p^{\prime }(z)^{2}\), can be obtained from i), ii), iv) and v.1).

  • Halley’s method \(H_{p}(z)=z-L(z)p(z)/p^{\prime }(z)\), where \(L(z)=\left (1-\frac {p(z)\cdot p^{\prime \prime }(z)}{2p^{\prime }(z)^{2}}\right )^{-1}\), can be obtained from i), ii), iv) and v) (L(z) is obtained combining v.1), v.3) and v.4)).

  • Jarratt’s method (see (3)) can be obtained from i), ii), iii), iv) and v.2).

Let f be a map which is obtained by applying a root finding algorithm that can be built using Theorem 2.2 with p(z) = z2c. Then, f is odd (by definition λ2-odd maps are odd) and \(f(\pm \sqrt {c})=\pm \sqrt {c}\) (the roots need to be attracting fixed points of f ). Following Proposition 2.4, we may also assume that \(f(\infty )=\infty \). By taking a conjugation τ which sends \(\sqrt {c}\) to \(\infty \), \(-\sqrt {c}\) to 0 and \(\infty \) to 1, we obtain a new map \(\tilde {f}\) with the same dynamics (we just move the roots to 0 and \(\infty \)). In the next proposition, we describe some properties of \(\tilde {f}\).

Proposition 2.6

Let \(c\in \mathbb {C}\setminus \{0\}\) and let \(f:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) be such that \(f(\sqrt {c})=\sqrt {c}\), \(f(-\sqrt {c})=-\sqrt {c}\), \(f(\infty )=\infty \) and f(−z) = −f(z) for all \(z\in \mathbb {C}\). Let \(\tau (z)=(z+\sqrt {c})/(z-\sqrt {c})\). Then the map \(\tilde {f}(z)=\tau \circ f\circ \tau ^{-1}(z)\) satisfies \(\tilde {f}(1)=1\) and \(\iota \circ \tilde {f}\circ \iota ^{-1}(z)=\tilde {f}(z)\), where ι(z) = 1/z.

Proof

Notice that \(\tau ^{-1}(z)=\sqrt {c}(z+1)/(z-1)\) and ι(z) = ι− 1(z). Let ς(z) = −z. We have

$$ \begin{array}{@{}rcl@{}} {\varsigma}\circ\tau^{-1}\circ \iota (z)&=&-\sqrt{c}\frac{1/z+1}{1/z-1}=-\sqrt{c}\frac{1+z}{1-z}=\sqrt{c}\frac{1+z}{z-1}=\tau^{-1}(z); \\ \iota\circ\tau \circ{\varsigma}(z)&=&\frac{1}{\frac{-z+\sqrt{c}}{-z-\sqrt{c}}}=\frac{-z-\sqrt{c}}{-z+\sqrt{c}}=\frac{z+\sqrt{c}}{z-\sqrt{c}}=\tau(z). \end{array} $$

Using that f(z) = ςfς(z) we conclude:

$$ \iota\circ \tilde{f}\circ \iota^{-1}(z)=\iota\circ \tau \circ f \circ \tau^{-1}\circ \iota(z)=\iota\circ \tau \circ {\varsigma}\circ f\circ{\varsigma} \circ \tau^{-1}\circ \iota(z)=\tau\circ f\circ \tau^{-1}(z)=\tilde{f}. $$

In Proposition 2.6, we have proved that \(\tilde {f}\) satisfies \(\tilde {f}(1)=1\) and \(\iota \circ \tilde {f}\circ \iota (z)=\tilde {f}(z)\), where ι(z) = 1/z. In the next theorem, we provide a classification of all rational maps satisfying these properties.

Theorem 2.7

Let \(R:\widehat {\mathbb {C}}\rightarrow \widehat {\mathbb {C}}\) be a degree n rational map and let ι(z) = 1/z. Then, the following items are equivalent:

  1. 1.

    R(z) satisfies (i) R(1) = 1 and (ii) ιRι− 1(z) = R(z).

  2. 2.

    \( R(z)=\prod \limits _{i=1}^{n}\frac {z-r_{i}}{1-r_{i}z}\), where \(r_{i}\in \mathbb {C}\), i = 1,...,n.

  3. 3.

    \( R(z)=\frac {a_{n} +a_{n-1} z+...+a_{1} z^{n-1}+a_{0} z^{n}}{a_{0}+a_{1}z+...+a_{n-1} z^{n-1}+a_{n} z^{n}}=\frac {P(z)}{\widehat {P}(z)}\), where \(a_{i}\in \mathbb {C}\), i = 0,...,n.

Proof

1 ⇒ 2) As R(z) is a degree n rational map, it can be written as:

$$ R(z)=C\frac{(z-r_{1})\cdot ...\cdot (z-r_{n_{1}})}{(z-s_{1})\cdot ...\cdot (z-s_{n_{2}})}, $$
(6)

where at least n1 = n or n2 = n and \(C\in \mathbb {C}\). Since ι− 1(z) = ι(z), we have that:

$$ \begin{array}{@{}rcl@{}} \iota\circ R \circ \iota^{-1}(z)&=&\iota\left( R\left( 1/z\right)\right)=\iota\left( C \frac{(\frac{1}{z}-r_{1})\cdot ...\cdot (1/z-r_{n_{1}})}{(1/z-s_{1})\cdot ...\cdot (1/z-s_{n_{2}})}\right)\\ &=&\frac{1}{C}\frac{(1/z-s_{1})\cdot ...\cdot (1/z-s_{n_{2}})}{(1/z-r_{1})\cdot ...\cdot (1/z-r_{n_{1}})}=R(z). \end{array} $$

From this expression and comparing with (6), we have that:

$$ \left. \begin{array}{l} z=1/s_{i},\text{ }i=1,...,n_{2},\text{ are roots of }R \\ z=r_{i},\text{ }i=1,...,n_{1},\text{ are roots of }R \end{array} \right\} \Longrightarrow n_{2}=n_{1}=n \text{ and } \frac{1}{s_{i}}=r_{j} \text{ for some } j. $$

Let us write \(\frac {1}{s_{i}}=r_{i},\) i = 1,...,n. Then, we have that:

$$ R(z)=\frac{1}{C}\frac{(1/z-s_{1})\cdot ...\cdot (1/z-s_{n})}{ (1/z-r_{1})\cdot ...\cdot (1/z-r_{n})}=\frac{1}{C}\frac{ (1-zs_{1})\cdot ...\cdot (1-z s_{n})}{(1-z r_{1})\cdot ...\cdot (1-z r_{n})} $$
(7)

Denominators in (6) and (7) are two degree n polynomials p(z) and q(z) with the same roots (including multiplicity), then p(z) = Kq(z) with \(K\in \mathbb {C}\ \) and

$$ R(z)=C\frac{(z-r_{1})\cdot ...\cdot (z-r_{n})}{K(1-zr_{1})\cdot ...\cdot (1-zr_{n})}. $$

Since R(1) = 1, we have that \(\frac {C}{K}=1\). We obtain

$$ R(z)=\frac{(z-r_{1})\cdot ...\cdot (z-r_{n})}{(1-zr_{1})\cdot ...\cdot (1-zr_{n})}. $$

2 ⇒ 3) For this implication, we use the induction method. This is obviously true for n = 1:

$$ \frac{(z-a)}{(1-a z)}. $$

Let us suppose that it is true for n − 1:

$$ \frac{a_{n-1} +a_{n-2} z+...+a_{1} z^{n-2}+a_{0} z^{n-1}}{a_{0}+a_{1}z+...+a_{n-1} z^{n-1}}={\prod}_{i=1}^{n-1} \frac{(z-r_{i})}{(1-r_{i} z)} $$

and we check that it is true for n:

$$ \begin{array}{@{}rcl@{}} & &\frac{(z-r_{1})(z-r_{2})...(z-r_{n-1})(z-r_{n})}{(1-r_{1} z)(1-r_{2} z)...(1-r_{n-1} z)(1-r_{n} z)} \\ &=&\frac{a_{n-1} +a_{n-2} z+...+a_{1} z^{n-2}+a_{0} z^{n-1}}{a_{0}+a_{1}z+...+a_{n-2} z^{n-2}+a_{n-1} z^{n-1}}\frac{z-r_{n}}{1-r_{n} z} \\ &=& \frac{-r_{n} a_{n-1}+(a_{n-1}-r_{n} a_{n-2})z+...+(a_{1}-r_{n} a_{0})z^{n-1}+a_{0} z^{n}}{a_{0}+(a_{1} -r_{n} a_{0})z+...+(a_{n-1}-r_{n} a_{n-2})z^{n-1}-a_{r-1} r_{n} z^{n}}\\ &=& \frac{b_{n} +b_{n-1} z+...+b_{1} z^{n-1}+b_{0} z^{n}}{b_{0}+b_{1}z+...+b_{n-1} z^{n-1}+b_{n} z^{n}}. \end{array} $$

3 ⇒ 1) Any rational function that satisfies 3) also satisfies 1). Indeed, it verifies R(1) = 1 and

$$ \begin{array}{@{}rcl@{}} \iota\circ R \circ \iota^{-1}(z)&=& \iota\left( R\left( \frac{1}{z}\right)\right)=\iota\left( \frac{a_{n}+a_{n-1}\frac{1}{z}+...a_{1} \frac{1}{z^{n-1}}+a_{0} \frac{1}{z^{n}}} {a_{0}+a_{1} \frac{1}{z}+...+a_{n-1} \frac{1}{z^{n-1}}+a_{n} \frac{1}{z^{n}}}\right)\\ &=& \iota\left( \frac{a_{n} z^{n}+a_{n-1} z^{n-1}+...+a_{1} z+a_{0}}{a_{0} z^{n}+a_{1} z^{n-1}+...+a_{n-1}z+a_{n}} \right)\\ &=& \frac {a_{0} z^{n}+a_{1} z^{n-1}+...+a_{n-1}z+a_{n}}{a_{n} z^{n}+a_{n-1} z^{n-1}+...+a_{1} z+a_{0}}=R(z). \end{array} $$

3 Dynamical study of Newton-like operators

In Section 2, we have justified that Newton-like methods applied on complex quadratic polynomials p(z) = z2c lead to an operator O(z) of the form (1), where n denotes the order of convergence to the roots of p(z). Examples of this fact can be found in [5, 7, 8, 13, 14, 30], among many other papers. In [5], we study this family of operators when the polynomials of the rational function have k equal roots. In this paper, we delve deeper into the dynamical study of the operator O(z). Following the results from Section 2, we can write the operator O(z) as follows:

$$ O(z)=z^{n} \frac{a_{k}+a_{k-1} z+...+a_{1} z^{k-1}+ z^{k}}{1+a_{1} z+...+a_{k-1} z^{k-1}+a_{k} z^{k}}=z^{n} \prod\limits_{i=1}^{k} \frac{(z-r_{i})}{(1-r_{i} z)}, $$
(8)

with ak≠ 0. Its derivative can be written as follows:

$$ O^{\prime}(z)=z^{n-1} \prod\limits_{i=1}^{k}\frac{(z-r_{i})}{(1-r_{i} z)} \left( n+z \sum\limits_{j=1}^{k} \frac{1-{r_{j}^{2}}}{(z-r_{j})(1-r_{j} z)} \right). $$
(9)

The relations among the coefficients ai and the roots ri of the polynomial \(P(z)=a_{k}+a_{k-1} z+...+a_{1} z^{k-1}+ z^{k}={\prod }_{i=1}^{k}(z-r_{i})\) are given by:

$$ \begin{array}{@{}rcl@{}} a_{k} &=& (-1)^{k}\prod\limits_{i=1}^{k}(r_{i}) \\ a_{k-1} &=& (-1)^{k-1}(r_{1}r_{2}...r_{k-1}+r_{1}r_{3}...r_{k}+...) \\ ...\\ a_{2} &=& r_{1}r_{2}+r_{1}r_{3}+.... \\ a_{1} &=& -\sum\limits_{i=1}^{k}r_{i}. \end{array} $$
(10)

Observe that the subscript of the coefficients indicates the number of roots that appear in the different products.

Remark 3.1

Along this paper, we suppose that z = 1 is not a root of the polynomial P(z) since, if z = 1 were a root of P(z), we could simplify the rational function of the operator. Let us suppose that rk = 1. Then,

$$ \begin{array}{@{}rcl@{}} O(z)&=&z^{n} \frac{a_{k}+a_{k-1} z+...+a_{1} z^{k-1}+ z^{k}}{1+a_{1} z+...+a_{k-1} z^{k-1}+a_{k} z^{k}}=z^{n} \prod\limits_{i=1}^{k} \frac{(z-r_{i})}{(1-r_{i} z)}\\ &=& - z^{n} \prod\limits_{i=1}^{k-1}\frac{(z-r_{i})}{(1-r_{i} z)}= -z^{n} \frac{b_{k-1}+...+b_{1} z^{k-2}+ z^{k-1}}{1+b_{1} z+...+b_{k-1} z^{k-1}}. \end{array} $$

In this case, the point z = 1 is not a fixed point of the operator, it forms an orbit of period 2 with z = − 1 if k + n even or it is a pre-image of z = − 1 if k + n odd. Therefore, in this paper, we assume that 1 + a1 + ... + ak≠ 0.

In the following lemma, we provide another relation between the roots and the coefficients of the polynomial.

Lemma 3.2

Given a polynomial P(z) = ak + ak− 1z + ... + a1zk− 1 + zk with roots ri, i = 1,...,k, the following relationship holds:

$$ \sum\limits_{j=1}^{k} \frac{1+r_{j} }{1-r_{j}} = k - 2 \frac{a_{1}+2a_{2}+...+k a_{k}}{1+a_{1}+a_{2}+...+a_{k}}. $$

Proof

Let us prove this relation by induction. For k = 1, we have the polynomial P1(z) = a1 + z with a1 = −r1. Therefore,

$$ \frac{1+r_{1}}{1-r_{1}}=1+\frac{2r_{1}}{1-r_{1}}= 1- 2 \frac{a_{1}}{1+a_{1}}. $$

Now we assume that for the degree k − 1 polynomial

$$ P_{k-1}(z)=a_{k-1}+a_{k-2} z+...+a_{1} z^{k-2}+ z^{k-1} $$

the relationship

$$ \sum\limits_{j=1}^{k-1} \frac{1+r_{j} }{1-r_{j}} = (k-1) - 2 \frac{a_{1}+2a_{2}+...+(k-1) a_{k-1}}{1+a_{1}+a_{2}+...+a_{k-1}} $$

holds. Finally, we prove the relation for the degree k polynomial Pk(z) = (zrk)Pk− 1(z) = bk + bk− 1z + ... + b1zk− 1 + zk:

$$ \begin{array}{@{}rcl@{}} \sum\limits_{j=1}^{k} \frac{1+r_{j} }{1-r_{j}} &=& {\sum}_{j=1}^{k-1} \frac{1+r_{j} }{1-r_{j}} +\frac{1+r_{k}}{1-r_{k}}=k -1- 2 \frac{a_{1}+2a_{2}+...+(k-1) a_{k-1}}{1+a_{1}+a_{2}+...+a_{k-1}}+1+\frac{2r_{k}}{1-r_{k}}\\ &=& k-2\frac{(a_{1}+2a_{2}+...+(k-1) a_{k-1})(1-r_{k})-r_{k}(1+a_{1}+a_{2}+...+a_{k-1})}{(1+a_{1}+a_{2}+...+a_{k-1})(1-r_{k})} \\ &=& k-2\frac{a_{1}-r_{k}+2(a_{2} - r_{k}a_{1})+3(a_{3}-r_{k}a_{2})+...+(k-1)(a_{k-1} - r_{k}a_{k-2})- k a_{k-1} r_{k}}{(1-r_{1})...(1-r_{k-1})(1-r_{k})} \\ &=& k - 2 \frac{b_{1}+2b_{2}+...+(k-1) b_{k-1}+k b_{k}}{1+b_{1}+b_{2}+...+b_{k}}, \end{array} $$

where the relations (10) have been used. □

Lemma 3.3

Given a polynomial P(z) = ak + ak− 1z + ... + a1zk− 1 + zk with roots ri, i = 1,...,k, the following relationship holds:

$$ \sum\limits_{j=1}^{k} \frac{1-r_{j} }{1+r_{j}} = k - 2 \frac{-a_{1}+2 a_{2}-...+(-1)^{k} k a_{k}}{1-a_{1}+a_{2}+...+(-1)^{k} a_{k}}. $$

Proof

The proof is similar to the previous lemma. □

We continue by studying some properties of the Newton-like operators. We focus on fixed points. Let f be an operator obtained by applying a root finding algorithm, that can be described by Theorem 2.2, on a quadratic polynomial p(z) = z2c. Then, the points \(\pm \sqrt {c}\) are attracting fixed points of f. Assume also that \(f(\infty )=\infty \). In Proposition 2.6, the operator (8) was obtained by applying a conjugacy that sends the fixed points \(-\sqrt {c}, \sqrt {c},\) and \(\infty \) to 0, \(\infty \), and 1, respectively. Therefore, 0 and \(\infty \) are fixed points under (8) which correspond to the roots of the polynomial while 1 is a strange fixed point (a fixed point which does not correspond to a root). Moreover, the term zn in (8) indicates that the method has order of convergence n to the roots. Next statement, whose proof is straightforward, indicates that the points 0, 1 and \(\infty \) are fixed points of (8) (even if the operator does not come from a root finding algorithm).

Proposition 3.4

The points z = 0, \(z=\infty \) and z = 1 are fixed points of operator (8). Moreover, z = − 1 is also a fixed point of (8) if n + k is odd and z = − 1 is a preimage of z = 1 if n + k is even.

When a Newton-like operator (8) comes from a one-parameter family of numerical methods, the coefficients ai usually have a linear dependence with respect to the parameter of the family, as it can be observed in the following subsections. In paper [5], we study a family of Newton-like operators where the coefficients have a quadratic dependence on the parameter that produces a duplicity of the information in the parameter plane. This duplicity was avoided by redefining the parameter in order to obtain a linear dependence.

As mentioned before, the point z = 1 is a strange fixed point of the operator (8), so it is important to analyse its dynamical behaviour. Assuming a linear relationship between the coefficients and the parameter of the family and using Lemma 3.2, we obtain the regions where the point z = 1 is attracting in terms of the coefficients.

Proposition 3.5

Let ai = Ai + Biα be the linear relationships between the coefficient ai of the operator (8) and the parameter α of the family of rational operators, with \(A_{i}, B_{i}\in \mathbb {R} \) and \(\alpha \in \mathbb {C}\). Let us denote:

$$ \begin{array}{@{}rcl@{}} A&=& n+k+ \sum\limits_{j=1}^{k} (n+k-2j)A_{j}, \ \ \ B= \sum\limits_{j=1}^{k} (n+k-2j)B_{j}, \\ A^{\prime}&=& 1+ \sum\limits_{j=1}^{k} A_{j}, \ \ \ B^{\prime}= \sum\limits_{j=1}^{k} B_{j}, \\ c&=&(AB-A^{\prime}B^{\prime})/(B^{2}-B^{\prime2}), \ \ \ r=(A^{\prime}B-AB^{\prime})/(B^{2}-B^{\prime2}). \end{array} $$

Then, the fixed point z = 1 satisfies the following statements:

  1. 1.

    For \(B^{2}-B^{\prime 2} \neq 0 \),

    1. i)

      z = 1 is indifferent on the circle C: \( \left | \alpha +c \right | =\left | r\right | \),

    2. ii)

      z = 1 is attracting inside C if \(B^{2}-B^{{\prime}{2}} > 0\) and outside C if \(B^{2}-B^{\prime 2} < 0\).

    3. iii)

      z = 1 is repelling outside C if \(B^{2}-B^{\prime 2} > 0\) and inside C if \(B^{2}-B^{\prime 2} < 0\).

  2. 2.

    For \(B^{2}-B^{\prime 2} =0\):

    1. a)

      If \(B=B^{\prime }\neq 0\), then

      1. i)

        z = 1 is indifferent if \(A=A^{\prime }\),

      2. ii)

        z = 1 is attracting if \(Re(\alpha )<-\frac {A+A^{\prime }}{2B}\) and \(B(A-A^{\prime })>0\) or HCode \(Re(\alpha )>-\frac {A+A^{\prime }}{2B}\) and \(B(A-A^{\prime })<0\).

      3. (i)

        z = 1 is repelling in other case.

    2. b)

      If \(B=-B^{\prime }\neq 0\), then

      1. i)

        z = 1 is indifferent if \(A=-A^{\prime }\),

      2. ii)

        z = 1 is attracting if \(Re(\alpha )<\frac {A^{\prime }-A}{2B}\) and \(B(A+A^{\prime })>0\) or \(Re(\alpha )>\frac {A^{\prime }-A}{2B}\) and \(B(A+A^{\prime })<0\).

      3. iii)

        z = 1 is repelling in other case.

    3. c)

      If \(B=B^{\prime }= 0\), then

      1. i)

        z = 1 is indifferent if \(|A|=|A^{\prime }|\),

      2. ii)

        z = 1 is attracting if \(|A|<|A^{\prime }|\),

      3. iii)

        z = 1 is repelling \(|A|>|A^{\prime }|\).

Moreover, if \( \alpha =-\frac {A}{B} \) the fixed point z = 1 is superattracting.

Proof

By sustituting the point z = 1 in (9) and by applying the result obtained in Lemma 3.2, we have:

$$ \left| O^{\prime} (1)\right|= \left|n+\sum\limits_{j=1}^{k}\frac{1+r_{j}}{1-r_{j}} \right|=\left|n+k-2 \frac{a_{1}+2a_{2}+...+k a_{k}}{1+a_{1}+a_{2}+...+a_{k}}\right|. $$

Assuming the relation ai = Ai + Biα, we obtain:

$$ \left| O^{\prime} (1)\right|= \left|\frac{(n+k)(1+A_{1}+B_{1} \alpha +...+A_{k}+B_{k} \alpha)-2 (A_{1}+B_{1} \alpha+...+k (A_{k}+B_{k} \alpha))}{1+A_{1}+B_{1} \alpha+...+A_{k}+B_{k} \alpha}\right| $$

that can be written as:

$$ \left| O^{\prime} (1)\right|=\left|\frac{n + k\!+{\sum}_{j=1}^{k} (n + k - 2j)A_{j} \!+\alpha {\sum}_{j=1}^{k} (n\!+k - 2j)B_{j}}{1+{\sum}_{j=1}^{k} A_{j} +\alpha {\sum}_{j=1}^{k}B_{j}}\right|=\left|\frac{A +\alpha B}{ A^{\prime} +\alpha B^{\prime}}\right|. $$

Now, we look for the values of the parameter that make \(\left | O^{\prime } (1)\right |<1\). As α is a complex parameter, we can write it as α = p + iq, \(\ p,q\in \mathbb {R} \). Then,

$$ \left|A + B(p + i q)|\!\!<\!|A^{\prime} + B^{\prime\!}(p\!+i q)\right| \ \ \!\!\Rightarrow\!\!\! \ \ p^{2}(B^{2}-B^{\prime2})+2p(AB-A^{\prime}B^{\prime})+q^{2}(B^{2}-B^{\prime2})\!<\!A^{\prime2}-A^{2}. $$

For the case \(B^{2}-B^{\prime 2} \neq 0\), this expression can be divided by \(B^{2}-B^{\prime 2}\) and, after looking for a perfect square, it can be written, in the complex plane, as the equation of the disk:

$$ \left( p+\frac{AB-A^{\prime}B^{\prime}}{B^{2}-B^{\prime2}}\right)^{2}+ q^{2}<\left( \frac{A^{\prime}B-AB^{\prime}}{B^{2}-B^{\prime2}}\right)^{2} $$

if \(B^{2}-B^{\prime 2}>0\), or the disk

$$ \left( p+\frac{AB-A^{\prime}B^{\prime}}{B^{2}-B^{\prime2}}\right)^{2}+ q^{2}>\left( \frac{A^{\prime}B-AB^{\prime}}{B^{2}-B^{\prime2}}\right)^{2} $$

if \(B^{2}-B^{\prime 2}<0\), obtaining the regions where the point z = 1 is attractive.

The point z = 1 is indifferent on the circle

$$ \left( p+\frac{AB-A^{\prime}B^{\prime}}{B^{2}-B^{\prime2}}\right)^{2}+ q^{2}=\left( \frac{A^{\prime}B-AB^{\prime}}{B^{2}-B^{\prime2}}\right)^{2} $$

and the point z = 1 is repelling for the other regions.

For the case \(B^{2}-B^{\prime 2} =0\), the proof of the latter cases is straightforward without more than substituting the conditions in \(\left | O^{\prime }(1)\right |\).

Finally, from \(O^{\prime }(1)=0\), we obtain that the point z = 1 is superattracting for α = −A/B. □

Moreover, if z = − 1 is also a fixed point of (8), i.e. n + k is odd, we have the following result. The proof is similar to the previous proposition from the substitution of z = − 1 in (8) and using Lemma 3.3.

Proposition 3.6

Let ai = Ai + Biα be the lineal relationship between the coefficients ai of the operator (8) and the parameter α of the family of rational operators, with \(A_{i}, B_{i} \in \mathbb {R} \), \(\alpha \in \mathbb {C}\) and n + k odd. Let us denote:

$$ \begin{array}{@{}rcl@{}} C&=& n+ k+ \sum\limits_{j=1}^{k} (-1)^{j} (n+ k-2j)A_{j}, \ \ \ D= {\sum}_{j=1}^{k} (-1)^{j} (n+ k-2j)B_{j}, \\ C^{\prime}&=& 1+ {\sum}_{j=1}^{k} (-1)^{j} A_{j}, \ \ \ D^{\prime}= \sum\limits_{j=1}^{k}(-1)^{j} B_{j},\\ c&=&(CD-C^{\prime}D^{\prime})/(D^{2}-D^{\prime2}), \ \ \ r=(C^{\prime}D-CD^{\prime})/(D^{2}-D^{\prime2}). \end{array} $$

Then, the fixed point z = − 1 satisfies the following statements:

  1. 1.

    For \(D^{2}-D^{\prime 2} \ne 0 \),

    1. i.)

      z = − 1 is indifferent on the circle S: \( \left | \alpha +c \right | =\left | r\right | \).

    2. ii.)

      z = − 1 is attracting inside the circle S if \(D^{2}-D^{\prime 2} > 0 \) and outside the circle S if \(D^{2}-D^{\prime 2}<0 \).

    3. iii.)

      z = − 1 is repelling outside the circle S if \(D^{2}-D^{\prime 2}>0 \) and inside the circle S if \(D^{2}-D^{\prime 2}<0 \).

  2. 2.

    For \(D^{2}-D^{\prime 2} =0\),

    1. a.)

      If \(D=D^{\prime }\neq 0\), then

      1. i.)

        z = − 1 is indifferent if \(C=C^{\prime }\).

      2. ii.)

        z = − 1 is attracting if \(Re(\alpha )<-\frac {C+C^{\prime }}{2D}\) and \(D(C-C^{\prime })>0\) or \(Re(\alpha )>-\frac {C+C^{\prime }}{2D}\) and \(D(C-C^{\prime })<0\).

      3. iii.)

        z = − 1 is repelling in other case.

    2. b.)

      If \(D=-D^{\prime }\neq 0\), then

      1. i.)

        z = − 1 is indifferent if \(C=-C^{\prime }\),

      2. ii.)

        z = − 1 is attracting if \(Re(\alpha )<\frac {C^{\prime }-C}{2D}\) and \(D(C+C^{\prime })>0\) or \(Re(\alpha )>\frac {C^{\prime }-C}{2D}\) and \(D(C+C^{\prime })<0\).

      3. iii.)

        z = − 1 is repelling in other case.

    3. c.)

      If \(D=-D^{\prime }= 0\), then

      • i.) z = − 1 is indifferent if \(|C|=|C^{\prime }|\),

      • ii.) z = − 1 is attracting if \(|C|<|C^{\prime }|\),

      • iii.) z = − 1 is repelling \(|C|>|C^{\prime }|\).

Moreover, if \( \alpha =-\frac {C}{D} \) the fixed point z = − 1 is superattracting.

As was observed in [5], the dynamical behaviour of the rational operators depends on the relationship between the exponent n of z and the degree k of the polynomial in the rational function of the operator. In fact, the numerical behaviour of the method is very bad when n < k and the better numerical behaviour occurs when n > k + 1.

4 Examples of Newton-like operators

In this section, we overview different methods in the literature. For the case n = k, we show some results for a particular case of dependence of the parameter. Along the paper, we mention examples of numerical methods for which there is exactly one free critical point, modulo symmetry; so, we can plot their parameter planes iterating a single critical point.

4.1 The case n > k

The first family of numerical methods that we studied satisfying these conditions is the Chebyshev-Halley family, defined by (5). A more complete study of the dynamical behaviour of this family can be found in [14] when this family is applied on quadratic polynomials and in [6], when it is applied on a family of n-degree polynomials.

After applying the corresponding algorithm on two-degree polynomials p(z) = z2c and enforcing the conjugacy map, we obtained the operator:

$$ O_{\alpha}(z)=z^{3} \frac{z-2(\alpha-1)}{1-2(\alpha-1)z}, $$
(11)

which corresponds to the operator (8) for n = 3 and k = 1. The parameter plane can be seen in Fig. 1.

Fig. 1
figure 1

Parameter plane of the Chebyshev-Halley family, n = 3, k = 1

As described in Section 1.2, the black zones correspond to values of the parameter for which the corresponding dynamical planes have attractors that do not correspond to the roots of the polynomial p(z) = z2c.

King’s family is another example of a family that leads to an operator of the form (8) for n = 4 and k = 2, see [12]. The operator of this iterative scheme is given in (2) which leads to:

$$ O_{\beta}(z)=z^{4} \frac{5+2\beta+(4+\beta)z+z^{2}}{1+(4+\beta)z+(5+2\beta)z^{2}} $$
(12)

after applying it on a quadratic family of polynomials. The parameter plane is shown in Fig. 2 (left).

Fig. 2
figure 2

Parameter planes of King’s family (left) and a family coming from Jarrat method (right). In both cases, n = 4, k = 2

As mentioned in the introduction, many methods built from Jarratt’s method also satisfy the conditions of Theorem 2.2. For example, the family of order four studied by Amat et al. in [1] with scheme (4). Applying it on two-degree polynomials p(z) = z2c and enforcing the conjugacy map, the operator obtained is:

$$ JA_{\beta} \left( z\right) = z^{4}\frac{8\beta-3+(4\beta-6)z-3z^{2}}{-3+(4\beta-6)z+(8\beta-3)z^{2}}. $$
(13)

The parameter plane can be seen in Fig. 2 (right).

Remark 4.1

Let us notice that the operators given in (12) and (13) correspond to the case n = 4 and k = 2. In fact, a change \(\beta \Rightarrow -\frac {4}{3}\beta -2\) leads to the same operator. So, the parameter planes are the same except for the symmetry induced by the negative sign in the change.

We find an example of n = 5 and k = 3 in the subfamily S2 coming from the Ostrowski-Chun methods (see [8]), whose operator is

$$ \begin{array}{@{}rcl@{}} OS2_{a}\left( z\right) &=& z^{5}\frac{14+5a+2(7+2a)z+(6+a)z^{2}+z^{3}}{1+(6+a)z+2(7+2a)z^{2}+(14+5a)z^{3}}. \end{array} $$
(14)

The parameter plane can be seen in Fig. 3.

Fig. 3
figure 3

Parameter plane of the subfamily S2, n = 5, k = 3

Let us notice that the numerical behaviour of these families is quite good due to the fact that the operators with presence of strange attractors correspond to values of the parameter included in the small black zones of the parameter plane.

4.2 A degenerate case

An example that coincides with the degenerate case of Remark 3.1 can be seen in paper [8] for the subfamily S5 of the Ostrowski-Chun methods. The operator is:

$$ \begin{array}{@{}rcl@{}} OS5_{a}\left( z\right) &=& z^{4} \frac{-5(7+2a)+(14+5a)z+(14+4a)z^{2}+(6+a)z^{3}+z^{4}}{1+(6+a)z+(14+4a)z^{2}+(14+5a)z^{3}-5(7+2a)z^{4}}\\ &=& -z^{4}\frac{5(7+2a)+(21+5a)z+(7+a)z^{2}+z^{3}}{1+(7+a)z+(21+5a)z^{2}+5(7+2a)z^{3}}. \end{array} $$
(15)

The points z = 1 and z = − 1 form an attractive 2-cycle of this operator.

The parameter plane can be seen in Fig. 4 (top). In this figure, green color corresponds to values of the parameter where the free critical point is located in the basin of attraction of the 2-cycle. Black color corresponds to values of the parameter where the orbit of the free critical point does not converge neither to any of the roots of the polynomial nor to the 2-cycle {− 1,1}.

Fig. 4
figure 4

The subfamily S5. The parameter plane (top) and dynamical planes for a = 2 − 9.3i (bottom left) and a = 0 (bottom right)

In Fig. 4 (bottom), we also show two dynamical planes of this family. Colour green indicates the basin of attraction of the 2-cycle {− 1,1}. The left figure corresponds to a value of the parameter where the critical point belongs to the basin of attraction of another attracting cycle. The right figure corresponds to a value of the parameter for which there is no other stable behaviour than the basins of attraction of the roots of the polynomial or to the 2-cycle {− 1,1}.

4.3 The case n = k

For n = k, the operator is:

$$ O_{\alpha}(z)=z^{n} \frac{a_{n}+a_{n-1} z+...+a_{1} z^{n-1}+ z^{n}}{1+a_{1} z+...+a_{n-1} z^{n-1}+a_{n} z^{n}}= z^{n} \prod\limits_{i=1}^{n}\frac{(z-r_{i})}{(1-r_{i} z)}. $$
(16)

This operator is obtained in many cases (see [7, 8, 13, 30], for example).

Next, we consider that all the coefficients of the operator (8) are real except an; that is, \(a_{i}\in \mathbb {R}\) for 1 = 1,2,...,n − 1 and \(a_{n}=\alpha \in \mathbb {C}\) and we delve deeper into the study of this type of operators.

In this case, the point z = − 1 is a preimage of z = 1. In the following result, we establish the set of parameters where the fixed point z = 1 is attracting for operator (16). We obtain that z = 1 is attracting outside a disk with radius related with the coefficients of the polynomial in the numerator of (16). The next proposition is a particular case of Proposition 3.5.

Proposition 4.2

The fixed point z = 1 of Oα satisfies the following statements:

  1. i)

    z = 1 is attracting if \(\left | \alpha +A^{\prime } \right | > \left |A\right | \),

  2. ii)

    z = 1 is indifferent if \(\left |\alpha +A^{\prime } \right | = \left |A\right | \),

  3. iii)

    z = 1 is repelling if \(\left | \alpha +A^{\prime } \right | < \left |A\right | \),

where A and \(A^{\prime }\neq 0\) are defined as follows:

$$ A= 2n+ 2\sum\limits_{j=1}^{n-1} (n-j)a_{j}, \ \ \ A^{\prime}= 1+ \sum\limits_{j=1}^{n-1} a_{j}. $$

Moreover, if A = 0 the fixed point z = 1 is superattracting.

Proof

The proof is straightforward from Proposition 3.5 taking into account that, in this case, an = α, ai = Ai for i = 1...n − 1, An = 0, Bi = 0 for i = 1...n − 1, Bn = 1; so B = 0 and \(B^{\prime }=1\) and \(A= 2n+ 2{\sum }_{j=1}^{n-1} (n-j)a_{j}\) and \(A^{\prime }= 1+ {\sum }_{j=1}^{n-1} a_{j}\). Then,

$$ |O^{\prime}(1)|=|\frac{A}{A^{\prime}+\alpha}| $$

From the previous expression, it is easy to see that z = 1 is superattracting, i.e. HCode \(O_{\alpha }^{\prime }(1)=0\), for A = 0. □

This type of operator is obtained in [7], where the c-family of iterative methods is applied on quadratic polynomials. After applying the conjugacy map (see [3]), the operator is:

$$ O_{c}\left( z\right)=z^{3}\frac{2\left( 1-2c\right)+5z+4z^{2}+z^{3} }{1+4z+5z^{2}+2\left( 1-2c\right) z^{3}} $$
(17)

and the parameter plane can be observed in Fig. 5 (left). Notice that the set of parameters for which z = 1 is attracting corresponds to the unbounded black disk.

Fig. 5
figure 5

Parameter plane of c −family for n = k = 3 (left) and the M4 family for \(\alpha =\frac {-1+5\beta }{\beta }\) n = k = 4 (right)

A three-step iterative method for solving non-linear equations is studied in [13]. It is obtained by using the technique of undetermined coefficients and the composition of Newton’s scheme with itself, with frozen Jacobian. The authors denote it as M4 and its iterative expression is:

$$ \begin{array}{@{}rcl@{}} y_{k} &=& x_{k}-F^{\prime}(x_{k})^{-1}F(x_{k}), \\ z_{k} &=& y_{k}-\frac{1}{\beta}F^{\prime}(x_{k})^{-1}F(y_{k}), \\ x_{k+1} &=& z_{k}- F^{\prime}(x_{k})^{-1}((2-1/\beta -\beta)F(y_{k})+\beta F(z_{k})). \end{array} $$

After applying it on quadratic polynomials and enforce the conjugacy map, the operator obtained is:

$$ \begin{array}{@{}rcl@{}} O_{\beta}(z)=z^{4} \frac{-1+5\beta+14\beta z +14\beta z^{2}+6\beta z^{3}+ \beta z^{4}}{\beta+6\beta z+14\beta z^{2}+14\beta z^{3}+(-1+5\beta)z^{4}} . \end{array} $$
(18)

This operator can be transformed into an operator of the type (16) by means of the change of the parameter \(\alpha =\frac {-1+5\beta }{\beta }\):

$$ O_{\alpha}(z)=z^{4} \frac{\alpha+14 z +14 z^{2}+6 z^{3}+ z^{4}}{1+6 z+14 z^{2}+14 z^{3}+\alpha z^{4}}. $$

Its parameter plane (see Fig. 5 right) is similar to the previous one (Fig. 5 left). Despite they correspond to different values of n, the main difference between these parameter planes comes from the different sign of the parameters c and α. This leads to a change of positions of the main interior bulb.

Another example of n = 4 is obtained from a sub-family of operators coming from the Ostrowski- Chun methods (see [8]). Applying this method on quadratic polynomials, and studying one of the subfamilies obtained, the resulting operator is:

$$ \begin{array}{@{}rcl@{}} OS4_{b}\left( z\right) &=& z^{4}\frac{4b-3-6 z-2 z^{2}+2z^{3}+z^{4}}{ 1+2z-2z^{2}-6z^{3}+\left( -3+4b\right) z^{4} }. \end{array} $$

The parameter plane of this family is shown in Fig. 6 (top). Let us realise that, in this case, the fixed point z = 1 is superattracting. Therefore, this family has bad initial conditions for every parameter. Since z = 1 is superattracting for all values of the parameter, we cannot observe the disks that appears in the parameter planes of Fig. 5. Since z = 1 is superattracting, z = 1 does not need to have a critical point (other than itself) in its basin of attraction. As in Fig. 4, we use the green colour when the critical orbit converges to the basins of attraction of the fixed point z = 1.

Fig. 6
figure 6

The subfamily S4 for n = 4, k = 4. The parameter plane (top) and dynamical planes for b = 5 (bottom left) and b = 1 + 1i (bottom right)

In Fig. 6 (bottom), we also show two dynamical planes of this family. Colour green indicates the basin of attraction of z = 1. The left figure corresponds to a value of the parameter where the critical point belongs to the basin of attraction of another attracting cycle. The right figure corresponds to a value of the parameter for which there is not other stable behaviour than the basins of attraction of the roots of the polynomial or z = 1.

Let us remark that in Fig. 5 the region of parameters where z = 1 is attracting corresponds to the exterior black zone. Similar parameter planes are also obtained in [13, 30]. Although in each of these papers a different numerical method is studied, the operator obtained after applied it on quadratic polynomials is of the type given in Eq. (16). We also want to point out that for Fig. 6 (top) the unbounded region corresponds to a set of parameters for which the free critical points are captured by the basin of attraction of z = 1.

Another example where the operator is given by (8) for n = k = 4 can be found in the S3 family studied in paper [8], whose operator is:

$$ \begin{array}{@{}rcl@{}} OS3_{a}\left( z\right)\! &=&\! z^{4}\frac{5(14+5a)^{2}+(196 + 76a+9a^{2})((14+5a)z+2(7+2a)z^{2}+(6+a)z^{3}+z^{4})}{(1+(6+a)z+2(7+2a)z^{2}+(14+5a)z^{3})(196 + 76a+9a^{2})+5(14+5a)^{2} z^{4}}. \end{array} $$
(19)

Let us notice that, in this case, there is not a linear relationship between the coefficients of the operator and the parameter of the family. The parameter plane can be seen in Fig. 7.

Fig. 7
figure 7

Parameter plane of the subfamily S3, n = 4, k = 4

5 Conclusions

When Newton-like algorithms are applied on quadratic polynomials, an intrinsic symmetry appears in the operator obtained.

Firstly, we study some symmetry properties of certain families of maps which can be used to obtain the expressions of Newton-like finding algorithms. We show that many of the current numerical methods can be obtained from the functions defined in the Theorem 2.2.

Moreover, the functions introduced in Theorem 2.2 can be used to generate new Newton-type algorithms for solving nonlinear equations.

We conclude from Theorem 2.7 that the operators obtained for the different families of numerical Newton-like methods have the same expression, regardless of the method used, when they are applied on quadratic polynomials p(z) = z2c.

We carry out a dynamical study for this generic operator. Moreover, except for the degenerate cases, z = 1 is always a fixed point of the operator and we study its stability region (Proposition 3.5). We also prove that z = − 1 is a fixed point when the degree of the operator is odd and we locate its stability region (Proposition 3.6).

Along this paper, we exhibit some parameter planes coming from different papers of literature where predicted results are observed.