Cubic equation

From Exampleproblems

Jump to: navigation, search
Image:Polynomialdeg3.png
Graph of a cubic polynomial:
y = x3/4 + 3x2/4 − 3x/2 − 2
= (1/4)(x + 4)(x + 1)(x − 2)

In mathematics, a cubic equation is a polynomial equation in which the highest occurring power of the unknown is the third power. An example is the equation

2x3 − 4x2 + 3x − 4 = 0

and the general form may be written as:

α3x3 + α2x2 + α1x + α0 = 0.

Usually, the coefficients α0, ..., α3 are real numbers. However, most of the theory is also valid if they belong to a field of characteristic other than two or three. We will always assume that α3 is non-zero (otherwise it is a quadratic equation).

Solving a cubic equation amounts to finding the roots of a cubic function.

(This article discusses cubic equations in one variable. For a discussion of cubic equations in two variables, see elliptic curve.)

Contents

History

The Iranian mathematician Omar Khayyám (1048–1123) constructed solutions of cubic equations by intersecting a conic section with a circle. He showed how this geometric solution could be used to get a numerical answer by consulting trigonometric tables.

In the early sixteenth century, the Italian mathematician Scipione dal Ferro found a method for solving a class of cubic equations, namely those x3 + mx = n. In fact, all cubic equations can be reduced to this form if we allow m and n to be negative, but negative numbers were not known at that time. Dal Ferro kept his achievement secret until just before his death, when he told his student about it. Tartaglia heard about this and soon found a method himself. He revealed his method to Gerolamo Cardano, who published it in Ars Magna 1545.

Cardano noticed that Tartaglia's method sometimes required him to extract the square root of a negative number. He even included a calculation with these complex numbers in Ars Magna, but he did not really understand it. Rafael Bombelli studied this issue in detail and is therefore often considered as the discoverer of complex numbers.

The nature of the roots

Every cubic equation with real coefficients has at least one solution x among the real numbers; this is a consequence of the intermediate value theorem. We can distinguish several possible cases using the discriminant,

 \Delta = 4\alpha_1^3\alpha_3 - \alpha_1^2\alpha_2^2 + 4\alpha_0\alpha_2^3 - 18\alpha_0\alpha_1\alpha_2\alpha_3 + 27\alpha_0^2\alpha_3^2.

The following cases need to be considered.

  • If Δ < 0, then the equation has three distinct real roots.
  • If Δ > 0, then the equation has one real root and a pair of complex conjugate roots.
  • If Δ = 0, then (at least) two roots coincide. To decide how many distinct roots there are, we define
 \Delta_2 = 2\alpha_2^3 - 9\alpha_1\alpha_2\alpha_3 + 27\alpha_0\alpha_3^2,
and consider two further cases.
  • If Δ2 = 0, then all three roots coincide and we have a triple real root.
  • Otherwise, the equation has a double real root and a single real root.
The number Δ2 is the resultant of the cubic and its second derivative.
See also: multiplicity of a root of a polynomial

Cardano's method

The solutions can be found with the following method due to Scipione dal Ferro and Tartaglia, published by Gerolamo Cardano in 1545.

We first divide the given equation by α3 to arrive at an equation of the form

x^3 + ax^2 + bx +c = 0. \qquad (1)

The substitution x = t - a/3 eliminates the quadratic term; in fact, we get the equation

 t^3 + pt + q = 0, \quad\mbox{where } p = b - \frac{a^2}3 \quad\mbox{and}\quad q = c + \frac{2a^3-9ab}{27}. \qquad (2)

This is called the depressed cubic.

Suppose that we can find numbers u and v such that

 u^3-v^3 = q \quad\mbox{and}\quad uv = \frac{p}{3}. \qquad (3)

A solution to our equation is then given by

t = vu,

as can be checked by directly substituting this value for t in (2), as a consequence of the third order binomial identity

 (v-u)^3+3uv(v-u)+(u^3-v^3)=0  \ .

The system (3) can be solved by solving the second equation for v, which gives

 v = \frac{p}{3u}.

Substituting this in the first equation in (3) yields

 u^3 - \frac{p^3}{27u^3} = q.

This can be seen as a quadratic equation for u3. If we solve this equation, we find that

 u=\sqrt[3]{{q\over 2}\pm \sqrt{{q^{2}\over 4}+{p^{3}\over 27}}}. \qquad (4)

Since t = vu and t = x + a/3, we find

x=\frac{p}{3u}-u-{a\over 3}.

Note that there are six possibilities in computing u with (4), since there are two solutions to the square root (\pm), and three complex solutions to the cubic root (the principal root and the principal root multiplied by -1/2 \pm i\sqrt{3}/2). However, which sign of the square root is chosen does not affect the final resulting x, although care must be taken in two special cases to avoid divisions by zero. First, if p = 0, then one should choose the sign of the square root that gives a nonzero value for u, i.e. u = \sqrt[3]{q}. Second, if p = q = 0, then we have the triple real root x = −a/3.

Lagrange resolvents

The symmetric group S3 of order three has the cyclic group of order three as a normal subgroup, which suggests making use of the discrete Fourier transform of the roots, an idea due to Lagrange. Suppose that r0, r1 and r2 are the roots of equation (1), and define \zeta = (-1+i\sqrt{3})/2, so that ζ is a primitive third root of unity. We now set

s_0 = r_0 + r_1 + r_2,\,
s_1 = r_0 + \zeta r_1 + \zeta^2 r_2,\,
s_2 = r_0 + \zeta^2 r_1 + \zeta r_2.\,

The roots may then be recovered from the three si by inverting the above linear transformation, giving

r_0 = (s_0 + s_1 + s_2)/3,\,
r_1 = (s_0 + \zeta^2 s_1 + \zeta s_2)/3,\,
r_2 = (s_0 + \zeta s_1 + \zeta^2 s_2)/3.\,

We already know the value s0 = −a, so we only need to seek values for the other two. However, if we take the cubes, a cyclic permutation leaves the cubes invariant, and a transposition of two roots exchanges s13 and s23, hence the polynomial

(z-s_1^3)(z-s_2^3) \qquad (5)

is invariant under permutations of the roots, and so has coefficients expressible in terms of (1). Using calculations involving symmetric functions or alternatively field extensions, we can calculate (5) to be

{z}^{2}+ \left( -9\,ba+2\,{a}^{3}+27\,c \right) z+ \left( {a}^{2}-3\,b\right)^{3}.

The roots of this quadratic equation are

\frac92\,ab-{a}^{3}- \frac{27}{2}\,c \pm \frac32\,\sqrt{3\Delta},

where Δ is the discriminant defined above. Taking cube roots give us s1 and s2, from which we can recover the roots ri of (1).

Factorization

If r is any root of (1), then we may factor using r to obtain

(xr)(x2 + (a + r)x + b + ar + r2) = x3 + ax2 + bx + c.

Hence if we know one root we can find the other two by solving a quadratic equation, giving

 \frac12 \left(-a-r \pm \sqrt{-3r^2-2ar+a^2-4b}\right)

for the other two roots. If we are finding the roots of a polynomial with real coefficients and one real root, we can find the real root purely in terms of the real (rather than complex) cube root function, or alternatively stated we can find the root by extracting cube roots only of positive quantities. The complex conjugate roots can then be found as above.

Chebyshev radicals

The cube root function is in some respects not a well-behaved function, or one convenient for the purposes of finding the roots of a cubic equation. While cube roots are well-known and traditional, it is possible to use other algebraic functions to determine the roots, and avoid some of the problems of cube roots. The cube root function has a branch singularity at zero, as a result of which the real cube root function does not extend nicely to a complex cube root function. Moreover, when using cube roots to find the roots of a polynomial with three real roots we must take the roots of complex numbers, which introduces complex numbers into a situation which does not, in fact, require them.

We can get around these problems by using Chebyshev cube roots in place of ordinary cube roots. The polynomial C3 = x3 − 3x is the third Chebyshev polynomial normalized to obtain a monic polynomial. The Chebyshev cube root is then defined as a (suitably chosen) root (depending on t) of the polynomial equation

x^3 - 3x = t \ .

The polynomial C3(x) satisfies the third order addition relations

\, 2\,\cos(3x)= C_3(2 \cos x)

and (as \, 2\cos(ix)=2\cosh(x) )

 2\,\cosh(3x)=C_3(2\cosh x) \ .

If t is represented as t = 2cosy, then the polynomial equation x3 − 3x = t can now be transformed into

 t=2\,\cos y=C_3(2\cos (y/3)) \ .

The function C_{1\over3}(t) is then defined as (a branch of) the algebraic function of the third order which transforms 2cos(x) into 2cos(x / 3). It is given (inverting the relation t = 2cos(x) to x = arccos(t / 2)) as

C_{1\over3}(t) = 2 \,\operatorname{cos}\left(\operatorname{arccos}\left({t\over2}\right)/3\right) ,

if t lies in the real interval [−2, 2]. If t lies in the interval  [2,\infty] , then the Chebyshev root is given as

C_{1\over3}(t) = 2 \,\operatorname{cosh}\left(\operatorname{arccosh}\left({t\over2}\right)/3\right) \ .

The branch is uniquely defined by the value at t = 0, which is  2\, \operatorname{cos}\left(\operatorname{arccos}(0)/3\right)=2\, \operatorname{cos}(\pi/6)=\sqrt{3}, corresponding to the positive solution of x3 − 3x = x(x2 − 3) = 0.

This procedure is precisely analogous to the definition of the cube root in terms of logarithms and exponentials, with arccosh(x/2) resp. arccos(x/2) in the place of ln(x), and 2cosh(x) resp. 2cos(x) in the place of exp(x). The Chebyshev cube root can be constructed as an analytic function on the cut plane \mathbb{C}\setminus [-\infty,-2] and is the unique branch of the algebraic function C_{1\over3}(t) with this property. In the domain D_1 :=  \{z \in \mathbb{C}\,  | \, \Re{z}>2\} it can be defined as

 C_{1\over3}(t)= 2\,\operatorname{cosh}\left(\operatorname{arccosh}\left({t\over2}\right)/3\right)

where \operatorname{arccosh}(z/2)=\ln{{z+\sqrt{z^2-4}}\over 2},  
using the branch of the logarithm which is real on the positive real line and the branch of the square root which is positive on the real axis. On the domain D_2 :=\mathbb{C}\setminus{\{[-\infty,-2] \cup [2,\infty]\}} it can be defined as

 C_{1\over3}(t)= 2 \,\operatorname{cos}\left(\operatorname{arccos}\left({t\over2}\right)/3\right) ,

where \operatorname{arccos}(z/2)={\pi \over 2}+i\ln{{iz+\sqrt{4-z^2}}\over 2}  \ . Both D1 and D2 are simply-connected domains in \mathbb{C} on which the functions  \operatorname{arccos}(z) and  \operatorname{arccosh}(z) 
are well-defined analytic functions (because the square roots  \sqrt{\pm (z^2-4)} exist as analytic functions on D1 resp. D2 and the argument functions  {z+\sqrt{z^2-4}}\over 2 and  {iz+\sqrt{4-z^2}}\over 2 of the logarithm do not vanish on each domain). Both (partially overlapping) definitions of the Chebyshev cube root on the domains D1 and D2 can be put together to define the Chebyshev cube root unambiguously as an analytic function on the larger domain D= \mathbb{C}\setminus [-\infty,-2]. In fact, if one approaches the critical value t = 2 from either the left or the right on the real axis the value of each representative will tend to 2. Because x = 2 is a simple root of the polynomial x3 − 3x − 2 the branch of the Chebyshev root (defined as the algebraic function F(t)=2+G(t) satisfying

\, F(t)^3-3F(t)-2=9G(t)+6G(t)^2+G(t)^3=0

and F(2) = 2 exists locally as an analytic function in a (sufficiently small) neighbourhood U of t = 2 (according to the (complex-analytic ) inverse function theorem) and takes real values if  t=2\pm \epsilon, \,\epsilon >0 . Then it must coincide (on the intersection  U \cap D_1 and  U \cap D_2 ) with each of the two representatives (in terms of arccos z resp. arccosh z) constructed above. Therefore the Chebyshev cube root is in fact an analytic function on the whole of the domain D.

An alternative construction of the Chebyshev cube root in terms of hypergeometric functions is sketched in the next subsection.

The Chebyshev cube root as a hypergeometric function

The expression

 2 \,\operatorname{cos}\left(\operatorname{arccos}\left({t\over2}\right)/3\right)=2\,\operatorname{cos}\left({\pi\over 6}-\operatorname{arcsin}\left({t\over2}\right)/3\right)

can be transformed (using the difference-to-product trigonometric identity for the cosine) into the representation

 \sqrt{3} \,\operatorname{cos}\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)+ \operatorname{sin }\left(\operatorname{arcsin}\left({t\over2}\right)/3\right) \ .

For general complex parameter  \lambda \ne 0 the functions 2\,\operatorname{cos}\,(\lambda\, \operatorname{arcsin}(x/2)) and 2\,\operatorname{sin}\,(\lambda \,\operatorname{arcsin}(x/2)) are two linearly independent solutions of the second-order linear differential equation

\, (4-x^2)y''-xy'+\lambda^2 y=0

which can be obtained by differentiating the functional relations \, f(2\sin x)=2\sin(\lambda x) resp. \, f(2\sin x)=2\cos(\lambda x) twice with respect to x. The differential equation

\, (4-x^2)y''-xy'+\lambda^2 y=0

is equivalent (under the affine substitution  x \mapsto (2-4x) ) to the hypergeometric differential equation

 x(1-x) \,y''+{{1-2x}\over 2}\,y'+\lambda^2 y=0

with parameters  c={1\over 2},\, a=\lambda,\, b=-\lambda . According to the general theory of the hypergeometric equation it has (unless c is zero or a negative integer) a uniquely defined solution g which is analytic in x=0 and satisfies  \,g(0)=1 . It is given by the hypergeometric series (see hypergeometric function)

 \,F(a,b,c;z):=\,_2F_1 (a,b;c;z) = \sum_{n=0}^\infty 
\frac{(a)_n(b)_n}{(c)_n} \, \frac {z^n} {n!} \ .

Transforming back to the original differential equation one finds a solution g(x)=F(\lambda,-\lambda,{1\over2} ;{{2-x}\over 4}) of the differential equation

\, (4-x^2)y''-xy'+\lambda^2 y=0

which is analytic at x = 2 (unique up to scalar multiple). The representation

C_{{1\over 3}}(t)= \sqrt{3} \,\operatorname{cos}\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)+ \operatorname{sin}\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)

obtained above shows that the Chebyshev cube root is a solution of the differential equation

\, (4-x^2)y''-xy'+\lambda^2 y=0

for  \lambda={1\over 3} which is analytic at x = 2. It must be proportional to the argument-shifted hypergeometric series and thus

C_{{1\over 3}}(t)=2F({1\over 3},-{1\over 3},{1\over2} ;{{2-t}\over 4}) = \sum_{n=0}^\infty \frac{2}{1-3n} {3n \choose n}\left(\frac{2-t}{27}\right)^n\ ,

where the last series converges if | t − 2 | < 4. All three roots r1,r2,r3 of the equation x3 − 3xt = 0 are linear combinations of the two functions f_1(t)=2\operatorname{sin}\,\left({1\over 3} \operatorname{arcsin}{t\over2}\right) and f_2(t)=2\operatorname{cos}\,\left({1\over 3} \operatorname{arcsin}{t\over2}\right) \ . By construction

 r_1=C_{{1\over 3}}(t)=\sqrt{3} 
\,\operatorname{cos}\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)+ \operatorname{sin}\,\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)\, ,

the other two roots are

 r_2=-C_{{1\over 3}}(-t)=-\sqrt{3} 
\,\operatorname{cos}\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)+ \operatorname{sin}\,\left(\operatorname{arcsin}\left({t\over2}\right)/3\right)

and

 r_3=-r_1-r_2= -2 \,\operatorname{sin}\left(\operatorname{arcsin}\left({t\over2}\right)/3\right) \ .

One derives the further relations

 r_1={\sqrt{3}\over 2}\sqrt{4-r_3^2}-{r_3\over 2} , \qquad  r_2=-{\sqrt{3}\over 2} \sqrt{4-r_3^2}- {r_3\over 2}

which can be verified independently by calculating the other two roots ( here r1,r2 ) given one root (here r3 ) by means of the relation

 t=x^3-3x=y^3-3y \Longrightarrow (y-x)(y^2+xy+x^2-3)=0,

solving the quadratic equation \, y^2+xy+(x^2-3)=0 for y, given x.

Solving a general cubic equation using Chebyshev cube roots

If we have a cubic equation which is already in depressed form, we may write it as \,x^3 - 3px - q = 0. Substituting x = \sqrt{p} z we obtain z^3 - 3z - p^{-\frac{3}{2}}q = 0 or equivalently

z^3 - 3z = p^{-\frac{3}{2}}q \ .

From this we obtain solutions to our original equation in terms of the Chebyshev cube root as

r_1 = \sqrt{p}\,C_{1\over3}(p^{-\frac{3}{2}}q),\,
r_2 = -\sqrt{p}\,C_{1\over3}(-p^{-\frac{3}{2}}q),\,
r_3 = -r_1 - r_2 \ .

If now we start from a general equation

x^3 + ax^2 + bx +c = 0 \qquad (1)

and reduce it to the depressed form under the substitution x = ta/3, we have \, p = (a^2-3b)/9 and \, q = -(2a^3-9ab+27c)/27, leading to

t_{a;b;c} = p^{-\frac{3}{2}}q = -\frac{2a^3-9ab+27c}{(a^2-3b)^{3/2}}.

This gives us the solutions to (1) as

r_1 = \sqrt{p}\,C_{1\over3}(t_{a;b;c})-{a\over 3} ,\,
r_2 = -\sqrt{p}\,C_{1\over3}(-t_{a;b;c})-{a\over 3},\,
r_3 = -r_1 - r_2 - {a\over 3}\ .

The case of a real equation

Suppose the coefficients of (1) are real. If s is the quantity q/r from the section on real roots, then s = t2; hence 0 < s < 4 is equivalent to −2 < t < 2, and in this case we have a polynomial with three distinct real roots, expressed in terms of a real function of a real variable, quite unlike the situation when using cube roots. If s > 4 then either t > 2 and C_{1\over3}(t) is the sole real root, or t < −2 and -C_{1\over3}(-t) is the sole real root. If s < 0 then the reduction to Chebyshev polynomial form has given a t which is a pure imaginary number; in this case iC_{1\over3}(-it)-iC_{1\over3}(it) is the sole real root. We are now evaluating a real root by means of a function of a purely imaginary argument; however we can avoid this by using the function

S_{1\over3}(t) = iC_{1\over3}(-it)-iC_{1\over3}(it) = 2 \operatorname{sinh}\left(\operatorname{arcsinh}\left({t\over2}\right)/3\right),\,

which is a real function of a real variable with no singularities along the real axis. If a polynomial can be reduced to the form x3 + 3xt with real t, this is a convenient way to solve for its roots.

See also

External links

References

de:Kubische Gleichung da:Tredjegradsligning es:Ecuación de tercer grado ja:三次方程式 pl:Równanie trzeciego stopnia zh:三次方程

Argan Oil
Natural Skin Care
Organic Skin Care
visitor stats