Discriminant
In mathematics, the discriminant of a polynomial is a quantity that depends on the coefficients and allows deducing some properties of the roots without computing them. More precisely, it is a polynomial function of the coefficients of the original polynomial. The discriminant is widely used in polynomial factoring, number theory, and algebraic geometry.
The discriminant of the quadratic polynomial is
the quantity which appears under the square root in the quadratic formula. If this discriminant is zero if and only if the polynomial has a double root. In the case of real coefficients, it is positive if the polynomial has two distinct real roots, and negative if it has two distinct complex conjugate roots.[1] Similarly, the discriminant of a cubic polynomial is zero if and only if the polynomial has a multiple root. In the case of a cubic with real coefficients, the discriminant is positive if the polynomial has three distinct real roots, and negative if it has one real root and two distinct complex conjugate roots.
More generally, the discriminant of a univariate polynomial of positive degree is zero if and only if the polynomial has a multiple root. For real coefficients and no multiple roots, the discriminant is positive if the number of non-real roots is a multiple of 4 (including none), and negative otherwise.
Several generalizations are also called discriminant: the discriminant of an algebraic number field; the discriminant of a quadratic form; and more generally, the discriminant of a form, of a homogeneous polynomial, or of a projective hypersurface (these three concepts are essentially equivalent).
Origin
The term "discriminant" was coined in 1851 by the British mathematician James Joseph Sylvester.[2]
Definition
Let
be a polynomial of degree n (this means ), such that the coefficients belong to a field, or, more generally, to a commutative ring. The resultant of A and its derivative is a polynomial in with integer coefficients, which is the determinant of the Sylvester matrix of A and A′. The nonzero entries of the first column of the Sylvester matrix are and and the resultant is thus a multiple of Hence the discriminant—up to its sign—is defined as the quotient of the resultant of A and A' by :
Historically, this sign has been chosen such that, over the reals, the discriminant will be positive when all the roots of the polynomial are real. The division by may not be well defined if the ring of the coefficients contains zero divisors. Such a problem may be avoided by replacing by 1 in the first column of the Sylvester matrix—before computing the determinant. In any case, the discriminant is a polynomial in with integer coefficients.
Expression in terms of the roots
When the polynomial is defined over a field, it has n roots, r1, r2, ..., rn, not necessarily all distinct, in any algebraically closed extension of the field. (If the coefficients are real numbers, the roots may be taken in the field of complex numbers, where the fundamental theorem of algebra applies.)
In terms of the roots, the discriminant is equal to
It is thus the square of the Vandermonde polynomial times an2n − 2.
This expression for the discriminant is often taken as a definition. It makes clear that if the polynomial has a multiple root, then its discriminant is zero, and that if all the roots are real and simple, then the discriminant is positive. Unlike the previous definition, this expression is not obviously a polynomial in the coefficients, but this follows either from the fundamental theorem of Galois theory, or from the fundamental theorem of symmetric polynomials by noting that this expression is a symmetric polynomial in the roots of A.
Low degrees
The discriminant of a linear polynomial (degree 1) is rarely considered. If needed, it is commonly defined to be equal to 1 (using the usual conventions for the empty product and considering that one of the two blocks of the Sylvester matrix is empty). There is no common convention for the discriminant of a constant polynomial (i.e., polynomial of degree 0).
For small degrees, the discriminant is rather simple (see below), but for higher degrees, it may become unwieldy. For example, the discriminant of a general quartic has 16 terms,[3] that of a quintic has 59 terms,[4] and that of a sextic has 246 terms.[5] This is OEIS sequence A007878.
Degree 2
The quadratic polynomial has discriminant
The square root of the discriminant appears in the quadratic formula for the roots of the quadratic polynomial:
where the discriminant is zero if and only if the two roots are equal. If a, b, c are real numbers, the polynomial has two distinct real roots if the discriminant is positive, and two complex conjugate roots if it is negative.[6]
The discriminant is the product of a2 and the square of the difference of the roots.
If a, b, c are rational numbers, then the discriminant is the square of a rational number if and only if the two roots are rational numbers.
Degree 3
The cubic polynomial has discriminant
In the special case of a depressed cubic polynomial , the discriminant simplifies to
The discriminant is zero if and only if at least two roots are equal. If the coefficients are real numbers, and the discriminant is not zero, the discriminant is positive if the roots are three distinct real numbers, and negative if there is one real root and two complex conjugate roots.[7]
The square root of a quantity strongly related to the discriminant appears in the formulas for the roots of a cubic polynomial. Specifically, this quantity can be −3 times the discriminant, or its product with the square of a rational number; for example, the square of 1/18 in the case of Cardano formula.
If the polynomial is irreducible and its coefficients are rational numbers (or belong to a number field), then the discriminant is a square of a rational number (or a number from the number field) if and only if the Galois group of the cubic equation is the cyclic group of order three.
Degree 4
The quartic polynomial has discriminant
The discriminant is zero if and only if at least two roots are equal. If the coefficients are real numbers and the discriminant is negative, then there are two real roots and two complex conjugate roots. Conversely, if the discriminant is positive, then the roots are either all real or all non-real.
Properties
Zero discriminant
The discriminant of a polynomial over a field is zero if and only if the polynomial has a multiple root in some field extension.
The discriminant of a polynomial over an integral domain is zero if and only if the polynomial and its derivative have a non-constant common divisor.
In characteristic 0, this is equivalent to saying that the polynomial is not square-free (i.e., divisible by the square of a non-constant polynomial).
In nonzero characteristic p, the discriminant is zero if and only if the polynomial is not square-free or it has an irreducible factor which is not separable (i.e., the irreducible factor is a polynomial in ).
Invariance under change of the variable
The discriminant of a polynomial is, up to a scaling, invariant under any projective transformation of the variable. As a projective transformation may be decomposed into a product of translations, homotheties and inversions, this results in the following formulas for simpler transformations, where P(x) denotes a polynomial of degree n, with as leading coefficient.
- Invariance by translation:
- This results from the expression of the discriminant in terms of the roots
- Invariance by homothety:
- This results from the expression in terms of the roots, or of the quasi-homogeneity of the discriminant.
- Invariance by inversion:
- when Here, denotes the reciprocal polynomial of P; that is, if and then
Invariance under ring homomorphisms
Let be a homomorphism of commutative rings. Given a polynomial
in R[x], the homomorphism acts on A for producing the polynomial
in S[x].
The discriminant is invariant under in the following sense. If then
As the discriminant is defined in terms of a determinant, this property results immediately from the similar property of determinants.
If then may be zero or not. One has, when
When one is only interested in knowing whether a discriminant is zero (as is generally the case in algebraic geometry), these properties may be summarised as:
- if and only if either or
This is often interpreted as saying that if and only if has a multiple root (possibly at infinity).
Product of polynomials
If R = PQ is a product of polynomials in x, then
where denotes the resultant with respect to the variable x, and p and q are the respective degrees of P and Q.
This property follows immediately by substituting the expression for the resultant, and the discriminant, in terms of the roots of the respective polynomials.
Homogeneity
The discriminant is a homogeneous polynomial in the coefficients; it is also a homogeneous polynomial in the roots and thus quasi-homogeneous in the coefficients.
The discriminant of a polynomial of degree n is homogeneous of degree 2n − 2 in the coefficients. This can be seen two ways. In terms of the roots-and-leading-term formula, multiplying all the coefficients by λ does not change the roots, but multiplies the leading term by λ. In terms of its expression as a determinant of a (2n − 1) × (2n − 1) matrix (the Sylvester matrix) divided by an, the determinant is homogeneous of degree 2n − 1 in the entries, and dividing by an makes the degree 2n − 2.
The discriminant of a polynomial of degree n is homogeneous of degree n(n − 1) in the roots. This follows from the expression of the discriminant in terms of the roots, which is the product of a constant and squared differences of roots.
The discriminant of a polynomial of degree n is quasi-homogeneous of degree n(n − 1) in the coefficients, if, for every i, the coefficient of is given the weight n − i. It is also quasi-homogeneous of the same degree, if, for every i, the coefficient of is given the weight i. This is a consequence of the general fact that every polynomial which is homogeneous and symmetric in the roots may be expressed as a quasi-homogeneous polynomial in the elementary symmetric functions of the roots.
Consider the polynomial
It follows from what precedes that the exponents in every monomial a0i0. ..., anin appearing in the discriminant satisfy the two equations
and
and also the equation
which is obtained by subtracting the second equation from the first one multiplied by n.
This restricts the possible terms in the discriminant. For the general quadratic polynomial there are only two possibilities and two terms in the discriminant, while the general homogeneous polynomial of degree two in three variables has 6 terms. For the general cubic polynomial, there are five possibilities and five terms in the discriminant, while the general homogeneous polynomial of degree 4 in 5 variables has 70 terms
For higher degrees, there may be monomials which satisfy above equations and do not appear in the discriminant. The first example is for the quartic polynomial ax4 + bx3 + cx2 + dx + e, in which case the monomial bc4d satisfies the equations without appearing in the discriminant.
Real roots
In this section, all polynomials have real coefficients.
It has been seen in § Low degrees that the sign of the discriminant provides a full information on the nature of the roots for polynomials of degree 2 and 3. For higher degrees, the information provided by the discriminant is less complete, but still useful. More precisely, for a polynomial of degree n, one has:
- The polynomial has a multiple root if and only if its discriminant is zero.
- If the discriminant is positive, the number of non-real roots is a multiple of 4. That is, there is a nonnegative integer k ≤ n/4 such that there are 2k pairs of complex conjugate roots and n − 4k real roots.
- If the discriminant is negative, the number of non-real roots is not a multiple of 4. That is, there is a nonnegative integer k ≤ (n − 2)/4 such that there are 2k + 1 pairs of complex conjugate roots and n − 4k + 2 real roots.
Homogeneous bivariate polynomial
Let
be a homogeneous polynomial of degree n in two indeterminates.
Supposing, for the moment, that and are both nonzero, one has
Denoting this quantity by one has
and
Because of these properties, the quantity is called the discriminant or the homogeneous discriminant of A.
If and are permitted to be zero, the polynomials A(x, 1) and A(1, y) may have a degree smaller than n. In this case, above formulas and definition remain valid, if the discriminants are computed as if all polynomials would have the degree n. This means that the discriminants must be computed with and indeterminate, the substitution for them of their actual values being done after this computation. Equivalently, the formulas of § Invariance under ring homomorphisms must be used.
Use in algebraic geometry
The typical use of discriminants in algebraic geometry is for studying plane algebraic curves, and more generally algebraic hypersurfaces. Let V be such a curve or hypersurface; V is defined as the zero set of a multivariate polynomial. This polynomial may be considered as a univariate polynomial in one of the indeterminates, with polynomials in the other indeterminates as coefficients. The discriminant with respect to the selected indeterminate defines a hypersurface W in the space of the other indeterminates. The points of W are exactly the projection of the points of V (including the points at infinity), which either are singular or have a tangent hyperplane that is parallel to the axis of the selected indeterminate.
For example, let f be a bivariate polynomial in X and Y with real coefficients, so that f = 0 is the implicit equation of a real plane algebraic curve. Viewing f as a univariate polynomial in Y with coefficients depending on X, then the discriminant is a polynomial in X whose roots are the X-coordinates of the singular points, of the points with a tangent parallel to the Y-axis and of some of the asymptotes parallel to the Y-axis. In other words, the computation of the roots of the Y-discriminant and the X-discriminant allows one to compute all of the remarkable points of the curve, except the inflection points.
Generalizations
There are two classes of the concept of discriminant. The first class is the discriminant of an algebraic number field, which, in some cases including quadratic fields, is the discriminant of a polynomial defining the field.
Discriminants of the second class arise for problems depending on coefficients, when degenerate instances or singularities of the problem are characterized by the vanishing of a single polynomial in the coefficients. This is the case for the discriminant of a polynomial, which is zero when two roots collapse. Most of the cases, where such a generalized discriminant is defined, are instances of the following.
Let A be a homogeneous polynomial in n indeterminates over a field of characteristic 0, or of a prime characteristic that does not divide the degree of the polynomial. The polynomial A defines a projective hypersurface, which has singular points if and only the n partial derivatives of A have a nontrivial common zero. This is the case if and only if the multivariate resultant of these partial derivatives is zero, and this resultant may be considered as the discriminant of A. However, because of the integer coefficients resulting of the derivation, this multivariate resultant may be divisible by a power of n, and it is better to take, as a discriminant, the primitive part of the resultant, computed with generic coefficients. The restriction on the characteristic is needed because otherwise a common zero of the partial derivative is not necessarily a zero of the polynomial (see Euler's identity for homogeneous polynomials).
In the case of a homogeneous bivariate polynomial of degree d, this general discriminant is times the discriminant defined in § Homogeneous bivariate polynomial. Several other classical types of discriminants, that are instances of the general definition are described in next sections.
Quadratic forms
A quadratic form is a function over a vector space, which is defined over some basis by a homogeneous polynomial of degree 2:
or, in matrix form,
for the symmetric matrix , the row vector , and the column vector . In characteristic different from 2,[8] the discriminant or determinant of Q is the determinant of A.[9]
The Hessian determinant of Q is times its discriminant. The multivariate resultant of the partial derivatives of Q is equal to its Hessian determinant. So, the discriminant of a quadratic form is a special case of the above general definition of a discriminant.
The discriminant of a quadratic form is invariant under linear changes of variables (that is a change of basis of the vector space on which the quadratic form is defined) in the following sense: a linear change of variables is defined by a nonsingular matrix S, changes the matrix A into and thus multiplies the discriminant by the square of the determinant of S. Thus the discriminant is well defined only up to the multiplication by a square. In other words, the discriminant of a quadratic form over a field K is an element of K/(K×)2, the quotient of the multiplicative monoid of K by the subgroup of the nonzero squares (that is, two elements of K are in the same equivalence class if one is the product of the other by a nonzero square). It follows that over the complex numbers, a discriminant is equivalent to 0 or 1. Over the real numbers, a discriminant is equivalent to −1, 0, or 1. Over the rational numbers, a discriminant is equivalent to a unique square-free integer.
By a theorem of Jacobi, a quadratic form over a field of characteristic different from 2 can be expressed, after a linear change of variables, in diagonal form as
More precisely, a quadratic forms on may be expressed as a sum
where the Li are independent linear forms and n is the number of the variables (some of the ai may be zero). Equivalently, for any symmetric matrix A, there is an elementary matrix S such that is a diagonal matrix. Then the discriminant is the product of the ai, which is well-defined as a class in K/(K×)2.
Geometrically, the discriminant of a quadratic form in three variables is the equation of a quadratic projective curve. The discriminant is zero if and only if the curve is decomposed in lines (possibly over an algebraically closed extension of the field).
A quadratic form in four variables is the equation of a projective surface. The surface has a singular point if and only its discriminant is zero. In this case, either the surface may be decomposed in planes, or it has a unique singular point, and is a cone or a cylinder. Over the reals, if the discriminant is positive, then the surface either has no real point or has everywhere a negative Gaussian curvature. If the discriminant is negative, the surface has real points, and has a negative Gaussian curvature.
Conic sections
A conic section is a plane curve defined by an implicit equation of the form
where a, b, c, d, e, f are real numbers.
Two quadratic forms, and thus two discriminants may be associated to a conic section.
The first quadratic form is
Its discriminant is the determinant
It is zero if the conic section degenerates into two lines, a double line or a single point.
The second discriminant, which is the only one that is considered in many elementary textbooks, is the discriminant of the homogeneous part of degree two of the equation. It is equal to[10]
and determines the shape of the conic section. If this discriminant is negative, the curve either has no real points, or is an ellipse or a circle, or, if degenerated, is reduced to a single point. If the discriminant is zero, the curve is a parabola, or, if degenerated, a double line or two parallel lines. If the discriminant is positive, the curve is a hyperbola, or, if degenerated, a pair of intersecting lines.
Real quadric surfaces
A real quadric surface in the Euclidean space of dimension three is a surface that may be defined as the zeros of a polynomial of degree two in three variables. As for the conic sections there are two discriminants that may be naturally defined. Both are useful for getting information on the nature of a quadric surface.
Let be a polynomial of degree two in three variables that defines a real quadric surface. The first associated quadratic form, depends on four variables, and is obtained by homogenizing P; that is
Let us denote its discriminant by
The second quadratic form, depends on three variables, and consists of the terms of degree two of P; that is
Let us denote its discriminant by
If and the surface has real points, it is either a hyperbolic paraboloid or a one-sheet hyperboloid. In both cases, this is a ruled surface that has a negative Gaussian curvature at every point.
If the surface is either an ellipsoid or a two-sheet hyperboloid or an elliptic paraboloid. In all cases, it has a positive Gaussian curvature at every point.
If the surface has a singular point, possibly at infinity. If there is only one singular point, the surface is a cylinder or a cone. If there are several singular points the surface consists of two planes, a double plane or a single line.
When the sign of if not 0, does not provide any useful information, as changing P into −P does not change the surface, but changes the sign of However, if and the surface is a paraboloid, which is elliptic or hyperbolic, depending on the sign of
References
- "Discriminant | mathematics". Encyclopedia Britannica. Retrieved 2020-08-09.
- Sylvester, J. J. (1851). "On a remarkable discovery in the theory of canonical forms and of hyperdeterminants". Philosophical Magazine. 4th series. 2: 391–410.
Sylvester coins the word "discriminant" on page 406. - Wang, Dongming (2004). Elimination practice: software tools and applications. Imperial College Press. ch. 10 p. 180. ISBN 1-86094-438-8.
- Gelfand, I. M.; Kapranov, M. M.; Zelevinsky, A. V. (1994). Discriminants, resultants and multidimensional determinants. Birkhäuser. p. 1. ISBN 3-7643-3660-9. Archived from the original on 2013-01-13.
- Dickenstein, Alicia; Emiris, Ioannis Z. (2005). Solving polynomial equations: foundations, algorithms, and applications. Springer. ch. 1 p. 26. ISBN 3-540-24326-7.
- Irving, Ronald S. (2004). Integers, polynomials, and rings. Springer-Verlag New York, Inc. ch. 10.3 pp. 153–154. ISBN 0-387-40397-3.
- Irving, Ronald S. (2004). Integers, polynomials, and rings. Springer-Verlag New York, Inc. ch. 10 ex. 10.14.4 & 10.17.4, pp. 154–156. ISBN 0-387-40397-3.
- In characteristic 2, the discriminant of a quadratic form is not defined, and is replaced by the Arf invariant.
- Cassels, J. W. S. (1978). Rational Quadratic Forms. London Mathematical Society Monographs. Vol. 13. Academic Press. p. 6. ISBN 0-12-163260-1. Zbl 0395.10029.
- Fanchi, John R. (2006). Math refresher for scientists and engineers. John Wiley and Sons. sec. 3.2, p. 45. ISBN 0-471-75715-2.