Complex roots by Bairstow

Complex roots of a polynomials

Basically

The basic idea to find a root of a function is to start at a particular point and run along the graph of the function till it crosses the cero line. In a computer program this can be done by starting at point P1 and laying a tangent onto the graph. This tangent should cross the cero line somewhere at x2 and there where it crosses it is already a bit closer to the cero crossing of the graph than where we started. From x2 we go to P2 and do the same again: Lay another tangent onto the graph and go to its cero crossing x3 and do the same again till we are close enough at the cero crossing. Close enough means the difference between one such iteration and the next is small enough..

In mathematics that means with

We get

And so on…that works for real roots. In Electronic Science often complex roots are interesting as well and for these complex roots principally the same is valid. Only the numbers are complex now. That’s a bit more complex :-)

There can be an easier approach if the numbers in the polynomial are not complex but real. That’s quite often the case if an electronic filter shall be calculated and therefore a Laplace transformation of a transfer function shall be carried out.

The basic procedure is the same as Newton Maehly’s algorithm. We start a certain point and run down the function till it crosses the cero line and the equation for this is still

only the approach to get f(x-1) and f'(x-1) is a bit different now.

Bairstow

The method of Bairstow to find the complex roots of a polynomial with real parameters uses the fact that to a complex root of polynomial always exists its conjugate complex root. Otherwise the parameters of the polynomial wouldn’t be real. And so these two roots can be multiplied to get a polynomial with real values to look for.

The complex root x – (u + jv) multiplied by its conjugate counterpart x - (u – jv) becomes

(x - (u + jv)) * (x - (u – jv)) = x² – 2ux + u² + v²

With a little substitution this can be written as

x² – 2ux + u² + v² = x² – px – q

The next step is almost same as in the Newton Maehly algorithm: The original polynomial

Shall be divided by the polynomial x² – px – q (instead of x – p at Newton Maehly) and in the result should be a sub polynomial like

To provide that the entire polynomial becomes

Expanding this and doing a parameter comparison leads to

a₀ = b₀
a₁ = b₁ – p*b₀
a₂ = b₂ – p*b₁ – q*b₀
…,
a_i = b_i – p*b_i-1 – q*b_i-2

This solved for b gives an algorithm to calculate the b parameters in a iterative loop:

b₀ = a₀
b₁ = a₁ + p*b₀
b₂ = a₂ + p*b₁ + q*b₀
…
b_j = a_j + p*b_j-1 + q*b_j-2 for j = 2,3,...,n

Like this the b parameters can be calculated.

Now x² – px – q is a divider of Pn(x) if b_n-1 and b_n both are 0.

The equations for the parameters b_n-1 and b_n can be regarded as two independent functions and the roots of both can be found in a similar iterative function as it is done in the Newton Maehly algorithm. Only it has to be a 2 dimensional algorithm. Therefore a linearization is carried out to get an approximation for the iterations:

If the Δx and Δy of a function f(x, y) is small, the function value f(x + Δx, y + Δy) can be approximated by:

and as we are looking for the root of f(x), we set f_k+1(x,y) = 0 and write

and with our 2 independent functions b_n-1(p,q) and b_n(p,q) we get

We get 2 equations with 2 unknown values: Δp and Δq. These two values we are looking for to start a next iteration to find the root. This is basically the tangent that we lay onto the graph of the function. Only the function is 2 dimensional now :-)

All we need now is d b_n-1/dp, d b_n-1/dq, d b_n /dp and d b_n /dq. Therefor we go back to the formulation

b_j = a_j + p*b_j-1 + q*b_j-2

build the derivative of this in iterations and do some substitutions:

And for q

Now we have all the parts to solve the equations from above and to determine Δp and Δq: With the method of Cramer (see determinant)

and with these Δp and Δq we get the next p_k+1 and q_k+1 for the next iteration like:

These are all the components in theory. Now let’s put that into code.

First we build a loop to determine the values for b_n, b_n-1, b_n-2, c_n-1, c_n-2 and c_n-3 according to the sequence from above:

The following loop can be implemented:

bn2 = a[0];
cn2 = bn2;
bn1 = a[1] + xp * bn2;
cn1 = bn1 + xp * cn2;
cn = 0;
bn = 0;
for (i = 2; i < order; i++)
{
     bn = xp * bn1 + xq * bn2 + a[i];
     if (i < order - 1)
     {
         cn = xp * cn1 + xq * cn2 + bn;
         if (i < order - 2)
         {
              cn2 = cn1;
              cn1 = cn;
         }
         bn2 = bn1;
         bn1 = bn;
     }
}

I use bn = b_n, bn1 = b_n-1, bn2 = b_n-2, cn = c_n-1…

With these values the next p and q can be calculated as:

p = p + (bn * cn2 - bn1 * cn1) / (cn1 * cn1 - cn * cn2);
q = q + (bn1* cn - bn*cn1) / (cn1 * cn1 - cn * cn2);

But we have to make sure the divider is not 0. In this case the matrix equation would not be solvable.

I take the delta from one iteration to the next as

dxp = p - xp_old;
dxq = q - xq_old;

and put all this into a while loop checking

while ((Math.Abs(dxp) > 1E-10) && (Math.Abs(dxq) > 1E-10))

{…}

At the end of this loop we get p and q of x2 – px – q. This we have to decompose into

(x - (u + jv)) * (x - (u – jv)) = x2 – 2ux + u2 + v2

A parameter comparison shows:

That’s the complex case.

With this the algorithm finds all kinds of paired roots: Complex and real roots. In case of real roots it’s enough to compute the roots of the polynomial

x² + px + q

And that’s

and the roots are u + v and u-v.

(see how to solve a quadratic equation)

Next step

After one pair of roots has been found, we should not stop and be happy. There could be more than one pair them :-)

So we need to go on searching. Therefore we have to divide the original polynomial by x² – px – q and start the same procedure again. This we repeat till the remaining polynomial is shorter or as long as x² – px – q. Then we are finally done.

If the remaining polynomial is exactly x² – px – q. We don’t have to search for its roots but can directly solve the quadratic equation to get the last 2 roots.

Annotation

When I was implementing this algorithm, an interesting point was that there seems no exit criteria to be needed in the loops as it is in the Newton Maehly algorithm. Bairstow always finds something and goes on searching till the polynomial is deflated enough. That’s quite nice and makes live uncomplicated :-)

In my sample project I use the function FindRoot(…) to find one pair of real or complex roots. The parameters p and q of this pair is stored in the list

public List<SRoot>root

In a loop all paired roots are searched and stored in this list. Afterwards I run though this list, check the p and q parameter of each pair and accordingly the real and complex roots of them are extracted.

At end the remaining polynomial is displayed as well as the found roots.

Here the remaining polynomial is x + 3. That means there is one last root = -3.

In the list box above for each paired root the parameter p and q and the found roots to them are displayed.

C# Demo Project Bairstow

Bairstow.zip