A Deeper Analysis of Infinite Exponentials

(The analysis that follows is a light introduction. For more austere results, consult Paper 1).

Having seen some clues about the behavior of z^{z^{z^...}} for complex z, in the fractal in the previous section, it would be nice to investigate it a bit further. In fact, quoting a reference by John Bailey and Dave L. Renfro, which was posted in sci.math:

Baker and Rippon, "Convergence of infinite exponentials" Ann Acad Sci Fenn Ser AI Math 1983 179-186 showed that z^{z^{z^...}} converges for log(z) contained in {t*e^-t: |t| < 1 or tⁿ = 1 for some n= 1, 2, ..} and that it diverges^[1] elsewhere.

Let's look at it closer:

Assume that Log(z) is the principal branch of the complex log(z) in all the following steps.

Let also W denote the principal branch of the Lambert's W function.

Then:

It would be wiser to examine the behavior of the function f(z)=c^z, instead of looking at the exponential through recursive sequences as I had done above.

First, we need a second Lemma:

Lemma #2:

if z is such that: Re(z)>-Im(z)*cot(Im(z)) and |Im(z)|<Pi, then W(z*e^z)=z

We have good reason to believe this. In fact, Maple says:

>series(W(z*exp(z)), z, 7);

= z + O(z^7).

(It is tempting here to want to claim that W(z*e^z)=z for ALL values of z. Unfortunately, this is NOT true. We will see later down why).

Proof:

From the definition of Lambert's W function:

W(z)*e^W(z)=z

Here we are tempted to say:

using z=W^-1(w) we get: (15)

W(W^-1(w))*e^{W(W^(-1)(w))}=W^-1(w),

so:

w*e^w=W^-1(w)

and therefore:

W(w*e^w)=W(W^-1(w)) = w.

But we have to be careful with step (15). Since W has a branch cut on real values of z in: (-oo,-1/e], we need to examine the behavior of the inverse: z*e^z on those values a bit more carefully.

First let's look at the graph of f(z)=z:

>complexplot3d(z,z=-1-I..1+I);

Now let us see if the function W(z*e^z) has any branch cuts. Indeed it does:

>h:=z->W(z*exp(z));
>complexplot3d(h(z),z=-1-I..1+I);

Here is the same function in more detail and different angles:

It is obvious that something strange happens at a well defined boundary of the W(z*e^z) and outside this boundary the function differs radically from the function f(z)=z, so we need to investigate this boundary a bit further.

Since W has a branch cut on real values of z in (-oo, -1/e], we expect this branch cut to give rise to new branch cuts for W(z*e^z). And indeed it does. The new branch cuts will come precisely from those values of z, for which z*e^z (W's inverse) is in (-oo,-1/e].

Those values then, will be those for which:

z*e^z=w is in (-oo,-1/e]

But the solution to the above equation is none other (by the definition of W!!) than:

z=W(w) with w in (-oo,-1/e].

Since W's branch cut starts at -1/e, we can force w to range in (-oo,-1/e] and see what happens to the image:

>complexplot(W(w),w=-exp(-1)..-10);

The curve above defines a branch cut for all such z with Arg(z) in [0, Pi]. Another branch cut for the function W(z*e^z) is defined for those z with Arg(z) in [Pi, 2*Pi].

And the surprise here is that the two cuts are none other than the curve -y*cot(y)+y*I, which is the image of (-oo,-1/e] under the LambertW! The figure below is the graph of this curve, but not to scale.

Do we indeed have a branch cut at -y*cot(y)+y*I? Let's try to see this since we have bothered with it so far:

Note that if k>0 then
W(z*e^z) is close to -y*cot(y)+y*I, if
z*e^z is close to -1/e-k+/-epsilon*I,
which happens if:
z is close to W(-1/e-k+/-epsilon*I).

First we need a third Lemma:

Lemma #3

conjugate(W(z))=W(conjugate(z));

Proof:

This is one of the standard properties of Lambert's W function and is contained in a paper by Corless which describes the function in detail.

Now, let w be a real number in the range (-oo,-1/e] and let z₁ and z₂ be:
z₁=W(w+epsilon*I),
z₂=W(w-epsilon*I).

Now by the definition of W:

z₁*exp(z₁)=w+epsilon*I, and
z₂*exp(z₂)=w-epsilon*I.

Now if f(z)= W(z*e^z),

|f(z₁)-f(z₂)|=|W(z₁*exp(z₁))-W(z₂*exp(z₂))| (16)

Since z₂ falls inside the -y*cot(y)+y*I curve,
W(z₂*exp(z₂))=z₂ by Lemma #2, and

W(z₁*exp(z₁))=
W(w+epsilon*I)=
W(conjugate(w-epsilon*I))=
conjugate(W(w-epsilon*I))=
conjugate(z₂), using Lemma #2 and #3.

So (16) becomes:

|conjugate(z₂)-z₂| >=
2*Im(z₂) = constant.

So indeed, the curve -y*cot(y)+y*I defines a branch cut. Let us verify this, using some Maple code:

> f:=z->W(z*exp(z));
> epsilon:=0.001;
> k:=13; #set any appropriate positive k here.
> z1:=W(-exp(-1)-k+epsilon*I);
> z2:=W(-exp(-1)-k-epsilon*I);
> evalf(f(z1));
> evalf(f(z2));
> evalf(abs(f(z2)-f(z1)));
> evalf(2*Im(z2));

and indeed, |f(z₁)-f(z₂)|>=2*Im(z₂).

To conclude:

if z is inside the curve, then W(z*e^z)=z,

in other words: if z is such that: Re(z)>-Im(z)*cot(Im(z)) and |Im(z)|<Pi, then W(z*e^z)=z.

The restriction |Im(z)|<Pi, ensures that z is inside the MAIN region (that which includes the origin) that is defined by the curve -y*cot(y)+y*I. The condition Re(z)>-Im(z)*cot(Im(z)) is satisfied by other z's as well that fall outside this region. I haven't checked the validity of the lemma for those z's, but it appears that it is false for those z's. In any case, we don't care, since we need the lemma for the unit disk only. See below. The value Pi for the restriction comes from the fact that the curve -y*cot(y)+y*I has two asymptotes, at +/-Pi.

Now, we can go back to the proof of Lemma #2 and complete it.

(Note that an immediate side consequence of the proof for the lemma is that if z is such that: Re(z)>-Im(z)*cot(Im(z)) and |Im(z)|<Pi then W^-1(w)=w*e^w).

Continuing with the rest of the proof:

We now iterate the function:

f(z)=c^z

Our fixed points z are given by:

-W(-Log(c))/Log(c), (2)

then, using the part of the hypothesis that Log(c)=t*e^-t, (2) becomes:

W(-t*e^-t)/(-t*e^-t) (3)

Using the second Lemma: W(-t*e^-t)=-t, when t is inside the unit circle, since the unit circle falls INSIDE the curve that defines the branch cut (-y*cot(y)+y*I) (below),

(3) then becomes:

=-t/(-t*e^-t)

=e^t.

Iterating the function f(z) at its fixed point e^t, we get:

[e^{(t/e^t)}]^{[e^{(t/e^t)}]^{[e^{(t/e^t)}]^...}}=e^t.

It is interesting to note that the fixed points e^t are indeed attractors:

if c₀ is a fixed point of f(z), c₀ it is an attractor, if:

|f'(c₀)|<1 (4)

Using the hypothesis:

|f'(c₀)|

=|t*e^-t*e^{Log(c)*e^t}|

=|t*e^-t*e^{t*e^(-t)*e^t}|

=|t*e^-t*e^t|

=|t|<1

which is true in view of the hypothesis. Convergence follows from a theorem by Shell:

Theorem (Shell): c^{c^...} converges to e^t in some neighborhood of e^t if |t|<1 and can do so only if |t|<=1.

Now let t_k=e^2*k*Pi/n*i be a complex n-th root of unity,

with k in {0,1,2,...n-1}.

Using c=e^Log(c),

and since Log(c)=t*e^-t from the hypothesis,

consider the functions f_k(z)=c_k^z, where c_k=e^{(t_k/e^t_k)}

The fixed points for the functions f_k(z) are again calculated through the expression -W(-Log(c_k))/Log(c_k):

z_k=-W(-Log(c_k))/Log(c_k)

-W(-t_k*e^-t_k)/(t_k*e^-t_k) (5)

Now, t_k is a complex n-th root of unity, thus it is ON the unit circle, so:
W(-t_k*e^-t_k)=-t_k, from Lemma #2, so (5) becomes:

-t_k/(-t_k*e^-t_k)

=e^t_k

Then, it is immediately obvious, that the points z_k= e^t_k

are fixed points for their corresponding functions, f_k(z).

Iterating the functions f_k(z) at their corresponding fixed points z_k,

and using a similar analysis to that above, we get:

[e^{(t_k/e^t_k)}]^{[e^{(t_k/e^t_k)}]^{[e^{(t_k/e^t_k)}]^...}}=e^t_k.

A couple of observations:

It is interesting to note that the fixed point condition (4), fails badly with the case 2: tⁿ=1,

since if we attempt to use it (using e^t_k as the fixed points on the derivative of f_k(z)), we get:

|f_k'(e^t_k)|=

=|Log(c_k)*c_k^{e^t_k}|

=|t_k*e^-t_k*e^t_k|

=|t_k|

=1, since all the n-th roots t_k lie on the unit circle.

(Actually the fact that it fails is a good indicator that the fixed points are quite "sensitive", most likely unstable)

After looking at the corresponding fractal above (the one with the double bulb near the origin) it becomes apparent that the case tⁿ=1 concerns the periphery of the double bulb itself. In fact, the n roots of unity, get mapped onto the periphery of the bulb, under the map h(z)=e^{z*e^-z}.

Here is some Maple code to verify this:

First, the roots themselves:

>restart;
>with(plots):
>t:=(k,n)->exp(2*k*Pi*I/n);
>UR:=[evalf(seq( [Re(t(k,60)),Im(t(k,60))], k=0..59 ))]:
>plot(UR,style=point);

And then, the roots under the appropriate transformation:

>h:=z->exp(z*exp(-z));
>URT:=[evalf(seq([Re(h(t(k,60))),Im(h(t(k,60)))], k=0..59 ))]:
>plot(URT,style=point);

In particular, -1 gets mapped onto (1/e)^e and 1 gets mapped onto e^1/e. That's why the real range of the double bulb IS exactly, [(1/e)^e,e^1/e]

Note also that the fixed points e^t_k of the functions f_k(z) are unstable.

p-cycle

If we perturb z away from e^t_k, the iterates of the function f(z)=c^z will eventually form a p-cycle where p depends on k and n. (The results here can be very deep, as number theory enters the game...)

That is, the sequence {z, c^z, c^{c^z}, ...} will eventually stabilize onto the p-cycle: {c₀, f_k(c₀), f_k⁽²⁾(c₀), ...f_k^(p-1)(c₀)}, where c₀ is a solution to the equation: "f_k^(p)(z)=z" (p total c's).

(Note that there exist solutions to f_k^(p)(z)=z, distinct from the trivial one: W(-Log(c))/(-Log(z))), but Maple cannot find them since it can only solve the equation: f⁽ⁿ⁾(z)=z, for n=1, but not for any n>1. To see these solutions, consult the article on Solving the Second Real Auxiliary Equation)

p, as stated, depends on both k and n and now GCD's and other nice number theoretical goodies enter the game.

As above, c_k= e^t_k is a fixed point of f⁽ⁿ⁾(z), for all n. (since it is a fixed point of f⁽¹⁾(z)=f(z))

To try to see the behavior of f around those known fixed points e^tk, we define:

f⁽¹⁾(z)=c^z, as above, and
f⁽ⁿ⁾(z)=f(f^(n-1)(z)),
and look at |[f^(p)(c₀)]'|.

On the other hand, it can be shown by induction that:

[f⁽ⁿ⁾(z)]'=[Log(c)]ⁿ*Prod(f^(k)(z), k=1..n), so:

|[f^(p)(e^t_k)]'|
=|Log(c_k)^p*(e^t_k)^p|
=|[Log(c_k)*e^t_k]^p|, but this is:
|[t_k/e^t_k*e^t_k]^p|
=|t_k^p|
=1,

and this stubbornly refuses to give us any information.

However the above actually gives us a small clue about what happens on the boundary:
|tⁿ|=1 => {|t|=1 and tⁿ=1 OR |t|=1 and tⁿ <>1, n in N}

Here is some Maple code to actually view the cycles. The code constructs the corresponding functions f_k and iterates them (perturbed slightly) at their fixed points e^t_k. (Set n and k to desired values).

> t:=(k,n)->exp(2*k*Pi/n*I);

>n:=5;

>for k from 0 to n-1 do
>c[k]:=exp(t(k,n)*exp(-t(k,n))); #calculate the corresponding c_k's
>od:

>for k from 0 to n-1 do
>f[k]:=z->c[k]^z; #calculate the corresponding f_k's
>od:

>perturb:=3.5; #purturb a bit

>for k from 0 to n-1 do
>z[k]:=exp(t(k,n))+perturb; #calculate some points close to the fixed points
>od:

>k:=2; #set for iterations with specific k

> rLim:=evalf(Re(exp(t(k,n))));
> iLim:=evalf(Im(exp(t(k,n)))); # calculate actual limit point

> URL:=[[rLim,iLim]]: #graph of limit point

>UR:=[evalf(seq([Re((f[k]@@m)(z[k])),Im((f[k]@@m)(z[k]))], m=1..50 ))]:
>plot({UR,URL},-1..1,-1..1,style=point,color=red);

Here is the cycle along with the limit point for the third 5-th root of unity (k=2, n=5), perturbed 3.5 away from the limit point:

And here is the cycle trajectory:

(An interesting connection exists between those trajectories above and a certain optics problem. For details, check this page).

An Example of Chaos

Use exactly the same Maple code as above, but replace the line:

> t:=(k,n)->exp(2*k*Pi/n*I);

with:

> t:=(k,n)->exp(2*sqrt(2)*Pi*I); #an irrational multiple of 2*π

Result:

(The resultant chaotic turbulence of the orbit is directly proportional to the magnitude of the "perturb" factor).

On the other hand, let K={z: Log(z)=t*e^-t, |t| <=1} be the main kidney-shaped region along with its boundary on the figure above and let c not in K.

e^{-W(-Log(c))/(e^-W(-Log(c)))}
=e^{-W(-Log(c))/(W(-Log(c))/(-Log(c)))}
=e^Log(c)
=c

so t=-W(-Log(c)) is a pre-image of the point c, under the map h(z)=e^{z*e^-z}. Necessarily then:

|t|=|W(-Log(c))|> 1 (16)

for otherwise, h(t)=c in K and this is not the case by assumption.

Since f(z)'s fixed points c₀ are always given by -W(-Log(c))/Log(c):

|f '(c₀)|
=|Log(c)*c^c₀|
=|Log(c)*e^-W(-Log(c))|
=|Log(c)*W(-Log(c))/(-Log(c))|
=|W(-Log(c))|>1 (17)

using (16). But the later implies that f⁽ⁿ⁾(z), n in N, c not in K cannot possibly converge to a limit l, for if it did, f(l)=l and |f'(l)| <= 1, contradicting (17).

Notes/References

For what happens when it diverges and in particular on the boudary of the unit circle, see math.stackexchange.com.