The author studies forms over finite fields obtained as the determinant of Hermitian matrices and use these determinatal forms to define and study the base polynomial of a square matrix over a finite field.
The authors give full proofs for the new results, quoting previous works by other authors in the proofs. In the introduction, the authors quoted related references.
The authors get a few theorems, mainly describing some monic polynomial arising as a base polynomial of a square matrix.
As far as the author knows, all the results are new, and the approach is also new.
1. Introduction
For any field K, any positive integer m and any number of variables t1, …, tm, we call K[t1, …, tm] the polynomial ring over K with variables t1, …, tm, not the vector space of all polynomial functions . These two rings are isomorphic if and only if the field K is infinite. If K is a finite field, then the ring of polynomial functions is isomorphic to . In this paper, we are always taking K finite, with either #K = q or #K = q2, where q is a fixed prime power.
Fix a prime p and a p-power q. For any , let M† denote the matrix . M is said to be Hermitian if M = M†. Note that the diagonal elements of a Hermitian matrix are elements of and that the set of all Hermitian n × n matrices forms an vector space of dimension n2. We briefly recall the notion of Hermitian geometry for the Galois degree 2 extension of . The Frobenius map is a generator of the Galois group of this degree 2 extension. The Hermitian form (i.e. σ-sesquilinear form) is defined by the formula
Fix positive integers m, n and m n × n Hermitian matrices . Set
and call it the determinantal polynomial of the Hermitian matrices M1, …, Mm. For m ≥ 2 set
We say that is the base polynomial of the Hermitian matrices M1, …, Mm−1.
All polynomials are homogeneous degree n polynomials with coefficients in (Lemma 1).
The motivation for this paper came from Kippenhahn’s paper on the numerical range, his definition of the base polynomial f(x, y, z) and his use of the dual curve of the plane curve {f(x, y, z) = 0} to characterize the numerical range ([1, 2]), which is even now a source of inspirations ([3, 4]). The numerical range of a matrix is also defined for matrices ([5–8]), using a choice of a certain element ([5, 6]). With this choice for any , we get uniquely determined Hermitian matrices such that M = M+ + βM− (see References [1, 2] for more details). The field is a degree 2 extension of . First assume q odd. There is , which is a square in , but not in . We take such that β2 = α and set M+≔(M + M†)/2 and M−≔(M − M †)/2β. Now assume q even. There is such that the polynomial t2 + t + ɛ has no root in . We call β one of its root in (the other one is β + 1). We set M−≔M + M† and M+≔(β + 1)M + βM†.
Using M+ and M−, one can use Kippelmahn’s definition of the base polynomial of a square complex matrix and set
Note that bp(M) is a homogeneous degree n polynomial with zn as one of its monomials and that its coefficient is 1. We call monic such degree n forms. A form is said to be concise if there is no linear change of coordinates such that in the new coordinates f does not depend on all coordinates. For degree 2 forms conciseness is equivalent to the smoothness of their zero-locus (Remark 10).
In Sections 4 and 5, we study the realizability problem (which monic forms are of the form bp(A) for some A) for 2 × 2 matrices. At the end of Section 4, we collect several questions concerning the base polynomials.
We get some negative results, i.e. many matrices have base polynomials not interesting and unrelated to the numerical range of any non-zero matrix. We prove the following result.
Fix .
(i) Assume either A = A+ or A = βA−. Then bp(A) = zn if and only if 0 is the unique eigenvalue of A over .
(ii) There are q2 2 × 2 matrices A such that A = A+ (resp. A = βA−) and bp(A) = z2.
(iii) Assume n = 2. Then bp(A) = z2 if and only if there are , , , such that
(iv) Assume q even. There are (q − 1)(q2 − 1) matrices such that bp(A) = z2, A+ ≠ 0 and A− ≠ 0. Each such A is of the form A = A+ + βA− with A+ and A− as in (2) and (3). Each such matrix A is obtaining taking , , setting d ≔ tc and taking as a and e the only elements of such that a2 = cq+1 and e2 = dq+1.
(v) Take q odd. There are at least q2 matrices such that A+ ≠ 0, A− ≠ 0 and bp(A) = z2. Some of them may be obtained taking A+ as in (2) and taking A = A+ + βA+.
We get some positive results (obtaining a monic polynomial as the base polynomial of a square matrix). This is called the reconstruction problem for monic polynomials. We prove the case of 2 × 2 matrices, i.e. we prove the following result.
All monic degree 2 forms are realized as a base polynomial.
Let K be a field. Take f ∈ K[x1, …, xn]. We say that f depends on n variables or that it does not depend on variables or that it is concise if there is no pair (g, M), where M ∈ Mn−1,n(K), g ∈ K [y1, …, yn−1] and f(x1, …, xn) = g(y1, …, yn−1), where
We say that the polynomial 0 depends on 0 variables. In Section 3, we study the conciseness of some determinantal polynomial and of some base polynomial, with the main results only for 2 × 2 matrices. We conclude Section 3 with several questions.
We found only a weak connection between the study of our determinantal polynomial and the (in principle) very similar problem of the description of a homogeneous form as a determinant of a matrix of linear forms. A. Beauville wrote the beautiful paper [9], which also contains realization as the determinant of a symmetric matrix of linear forms and as the Pfaffian of an anti-symmetric matrix. We discuss this topic in Section 5 which studies bp(A) for a matrix such that and . Of course, it depends on the choice of . Section 5 also contains the use of [9] for , mainly for m = 3.
We thank a referee for useful suggestions.
2. Preliminaries
For any matrix set . Thus, M is Hermitian if and only if Mt = M(q). Note that and that (tM)(q) = tqM(q) for all .
Assume q = pe for some e > 0. The field is the set of all such that zq = z ([10, page 1], [11, Theorem 2.5]). Fix any . Since q + 1 is invertible in , the polynomial t q+1 − a and its derivative (q + 1)tq have no common zero. Hence, the polynomial tq+1 − a has q + 1 distinct roots in . Fix any one of them, b. Since aq−1 = 1, . Thus, . Thus, for any there are exactly q + 1 elements such that cq+1 = a. Obviously, 0 is the only element t of such that tq+1 = 0.
Note that (−1)q = −1 in . Since (u + v)q = uq + vq and (u − v)q = uq + (−1)qvq = uq − vq for all , det(M(q)) = det(M)q for all . Now assume that M is Hermitian, i.e. assume M = M†. Thus, . Hence, by Remark 2.
For any two Hermitian matrices , there is a unique such that A = M+ and B = M−, the matrix M = A + βB.
Take and . We have (aA + bB)+ = aA+ + bB+ and (aA + bB)− = aA− + bB−. Usually these equalities fail if . For instance, if A is Hermitian, A ≠ 0 and a = β, then (aA)+ = 0, while (aA)− = A.
For any and any let A ⊕ B denote the matrix such that cij = aij if 1 ≤ i ≤ n and 1 ≤ j ≤ n, cij = 0 if either i > n and j ≤ n or i ≤ n and j > n, cij = bi−a,j−n if i > n and j > n. The matrix A ⊕ B is called the unitary direct sum of A and B. Since (A ⊕ B)+ = A+ ⊕ B+ and (A ⊕ B)− = A−⊕ B−, bp(A ⊕ B) = bp(A)bp(B).
Fix positive integers m, n and take m n × n Hermitian matrices . Then
Proof. Since for all i, . Thus to prove that , it is sufficient to prove that all its coefficients are preserved by the Frobenius map x↦xq. Let be the coefficient of . Since the Frobenius map is additive, is a monomial of . Recall that (Remark 3). Since and , then αq = α. Hence, (Remark 2). □
Take such that M = M†. The matrix M has 0 as its unique eigenvalue in if and only if there are and such that
Moreover, there are exactly q2 such matrices.
Proof. A 2 × 2 matrix over a field K has 0 as its unique eigenvalue over the algebraic closure of K if and only if its traces and determinant are 0. Since M = M†, these are exactly the conditions on the entries of M stated in the lemma. For any , there are q + 1 elements such that cq+1 = −a2 (Remark 2). 0 is the unique such that cq+1 = 0. Since , there are 1 + (q − 1)(q + 1) = q2 such matrices. □
The definition of bp(A) depends on the definitions of A+ and A−, which depend on the choice of a suitable . We explore the dependency of A+, A− and bp(A) for different choices of β if q is odd. Assume q odd. Take a different choice and call it γ. We write A+(β), A−(β), bp(A)β, A+(γ), A−(γ) and bp(A)γ for the matrices and polynomials obtained from these two choices. Since q is odd, A+(β) = A+(γ) and . Thus, .
For all integers d ≥ 0 and any field K, let K[x,y,z]d denote the set of all homogeneous degree d polynomials in the variables x, y, z with coefficients in K. The set K[x,y,z]d is a K-vector space of dimension . Fix . We have for every (Lemma 1).
Take such that f(x, y, z) = (z + ax + by)n for some . Then .
Proof. Since is a perfect field, the plane {z + ax + by = 0} is defined over . Thus, there is , c ≠ 0, such that . Since c ≠ 0, we first get and then . □
:Assume A = A+, i.e. assume A− = 0. Thus, . Since the eigenvalues of A are the roots of the polynomial , we get that bp(A) = zn if and only if all eigenvalues of A are 0, i.e. we get part (i) for A = A+. If A = βA−, then just note that bp(A) = bp(A−) up to changing the names of the variables.
Now assume n = 2. Part (ii) follows from Lemma 2. Part (iii) follows from part (ii) and the explicit computation of the coefficient of xy in the base polynomial bp(A).
Now assume n = 2 and q even. Since q is a 2-power, −2ae = 0 in . Let denote the set of all such that cqd + cdq = 0. Since q is even, if and only if c, d are non-zero elements of and . By Remark 2, the set is the set of all such that tq−1 = 1. Thus for every , there are exactly q − 1 elements such that , the elements . Take . Since is a perfect field and q is even, for every there is a unique such that w2 = z. Thus for all , there are unique a, e such that c, d, a, e satisfy (1).
Now we prove part (v). Assume n = 2 and q odd. Take a, c satisfying the first equation of (1) and set e≔a and d ≔ c. Note that all equations in (1) are satisfied. □
3. Conciseness of determinantal polynomials
Fix a field K and . The form f is concise over if and only if the degree d hypersurface is not a cone. Note that this criterion gives the same answer if we take the irreducible components of the hypersurface f = 0 with their multiplicity or not.
Fix fields and , d ≥ 2, f ≠ 0. Assume that K is perfect. The form f is concise over L if and only if it is concise over K.
Proof. If f is concise over a field K′ ⊃ K, then f is concise over K. Thus, it is sufficient to prove that if f is not concise over , then it is not concise over K. Assume that f is not concise over , i.e, that the closed hypersurface of with f as its equation is a cone with, say, vertex ; in the definition of , we allow the multiplicities of the indecomposable factors of f (Remark 8). The set is a non-empty linear subspace of . The decomposition of f in its irreducible factors and the linear subspace are defined over a finite extension K′ of K. Since K[x1, …, xn] is UFD, we reduce to the case in which f is irreducible over K. Since K is perfect, each indecomposable factor of f over has multiplicity 1 and hence, up to a non-zero multiplicative constant, f is uniquely determined by the set (no multiplicity is required). Since K is perfect, there is a finite extension L of K′ such that L is a Galois extension of K, say with Galois group G. The finite group G acts on . Set . Let v the minimsl dimension of a linear subspace of contained in and containing . Let be the set of all v-dimensional linear subspace of contained in . Since is a cone with vertex , v > 0, and . Since the embedding of in is defined over K, G acts linearly on and hence it acts on , i.e. each g ∈ G induces a permutation of . Thus, for all g ∈ G. Since each g ∈ G induces a permutation of and , we get for all g ∈ G. Thus is defined over K. Since is defined over K, there are n − e linear forms such that . Since is defined over K, there are such that y0, …, yn is a new system of coordinates of and yn−e, …, yn are the homogeneous coordinates of . Set W≔{yn−e = ⋯ = yn = 0}. Note that W is a linear subspace of defined over K, , and y0, …, yn−e−1 are homogeneous coordinates of W. Call the linear subspace of associated to W. Set . Since and is a cone with vertex , , i.e. u ≠ 0. Since f and W are defined over K, . Since is a cone with vertex , u (as an element of ) is an equation of . Thus, f is not concise over K. □
For each prime power q and each n ≥ 2, let m(q, n) be the maximal integer m such that there are m Hermitian matrices such that the degree n form is concise over . By Lemma 4, we get the same integer m(q, n) if we prescribe that is concise over .
Take Hermitian matrices which are linearly dependent over . Then is not concise over .
Proof. Suppose for instance that Mm = c1M1 + ⋯ + cm−1Mm−1 for some . Take the new variables xi = ti + citm, 1 ≤ i ≤ m − 1, and xm = tm. Note that . □
For any prime power q we have m(q, 2) = 4.
Proof. The set of all Hermitian is an n2-dimensional vector space over . Thus, Lemma 5 gives m(q, 2) ≤ 4. Hence, it is sufficient to prove that m(q, 2) ≥ 4. If q is even fix any . If q is odd fix any such that c4q − 2c2q+2 + c4 ≠ 0. Set
First assume q even. Since , then cq−1 ≠ 1 and c ≠ 0. Thus cq + c ≠ 0. Consider the degree 2 binary form . Since the coefficients of and in h(t3, t4) are the same and the coefficient of t3t4 is non-zero, h(t3, t4) is not a square. Thus, h(t3, t4) is concise. The binary form t1t2 in the variables t1 and t2 is concise. The quaternary form is concise, because the binary forms t1t2 and h(t3, t4) are concise.
Now assume q odd. We show that we may take such that c4q − 2c2q+2 + c4 ≠ 0, i.e. c4q−4 − 2c2q−2 + 1 ≠ 0. If q ≥ 5, it is sufficient to use that . Now assume q = 3. Each , c ≠ 0, satisfies c8 = 1 and hence it is sufficient to take c such that c4 ≠ − 1, i.e. c4 = 1. The quaternary form is concise if and only if the binary form in the variables t3, t4 is concise. The binary form u(t3, t4) is concise, because it has degree 2, −cq+1 ≠ 0, and the polynomial −cq+1t2 + (c2 + c2q)t − cq+1 has 2 distinct roots over by our assumptions on c. □
We ask the following question.
Fix n ≥ 2 and a prime power q. Set m ≔ m(q, n). Is it possible to find m Hermitian matrices M1, …, Mm such that defines a smooth hypersurface (smooth at all points of )?
Recall that a form f in n variables is concise if and only if the hypersurface {f = 0} is not a cone (Remark 8 and Lemma 4). For n = 2, Question 1 is trivially true, because for quadric hypersurfaces not to be a cone is equivalent to smoothness ([10, Lemma 5.1.1]).
Obviously m(q, n + 1) ≥ m(q, n) for all q and n. We do not know the rate of growth of m(q, n) for a fixed q and n ≫ 0. We have m(q, n) ≤ n2 for all n (Lemma 5), but we do not know the values of and .
4. Realization of homogeneous polynomials
In this section, we consider the realization problem, i.e. we ask for which homogeneous polynomial there are Hermitian matrices such that . The interested reader should consider the problem of the descriptions of the m-ples (M1, …, Mm) such that .
We only consider the cases m = 1 and m = 2 and the case m = 3 with , i.e. the case of base polynomials, and prove Proposition 1.
4.1 Forms in m ≤ 2 variables
Since and for each there is a Hermitian M1 such that det(M1) = a (even with M1 diagonal), the realization problem is trivially satisfied for m = 1.
Here we observe that the set of all binary n-forms realized by some is invariant for the action of on the variables x, y. For instance, and for any . Use that these transformations generate the group of projective transformations acting on binary forms.
Now take m = 2. We are looking to the realization of binary n-forms, and we call x and y the two variables and M1 and M2 the two Hermitian matrices.
Take . Then there are Hermitian 2 × 2 matrices M1, M2 such that .
Proof. By Remark 13, it is sufficient to realize at least one element for each orbit for the action of .
The binary form 0 is realized by M1 = M2 = 0. The binary form x2 is realized taking and M2 = 0. The binary form x(x + y) is (up to an linear transformation of ) the only one with 2 distinct roots over . This form is realized taking and M2 = (bij), where b11 = b12 = b21 = 0 and b22 = 1.
Now we consider binary forms which split over , but not over .
First assume q odd. Up to an linear transformation it is sufficient to realize the form x2 − ay2 with a not a square in . Take such that cq+1 = a (Remark 2). Take and B = (bij), where b11 = b22 = 0, b12 = c and b21 = cq.
Now assume q = 2e even. Since every element of is a square, the form x2 + cy2 splits and hence up to an linear transformation, it is sufficient to realize the form x2 + xy + δy2, where has non-zero absolute trace D(δ), where for any ([10, p. 3]). Fix any and take such that cq+1 = δ. Take and B = (bij), where b11 = 1, b12 = c, b21 = cq and b22 = 0. □
4.2 Base polynomials
Now we take m = 3, , M1 = A+, M2 = A− for some . By Remark 4, it is not restrictive to the existence of a matrix A such that M1 = A+ and M2 = A−. We call x, y and z the variables. Every degree n base polynomial contains the monomial zn with degree 1. We call monic such forms.
Are there other restrictions?
Let denote the set of all polynomials bp(A) with . Take any and any . Since , we have and . Thus, bp(A + aA−)(x, y, z) = bp(A)(x + ay, y, z). Hence, is invariant for the linear transformations x↦x + ay, y↦y, z↦z. Since , we have and . Thus, bp(A + aβA+)(x, y, z) = bp(A)(x, ax + y, z). Hence, is invariant for the linear transformations x↦x, y↦ax + y, z↦z. Since , we have and . Thus, . Thus, is invariant for the linear transformations x↦x, y↦y, z↦z + ax + by. Thus, the set is invariant for all changes of coordinates such that g33 = 1.
Take a monic such that f = gh for some monic g, h and 0 < a≔ deg(g) < d. Assume g = bp(A) and h = bp(B) for some , . Then f = bp(A ⊕ B). In particular, if f splits over as a product of n monic linear forms (we allow multiple linear forms), then f = bp(M) for some . Now assume that f is the product of n linear forms over , say f = L1⋯Ln with Li = cizi + aix + biy, but we allow that some of the forms are not monic. We get , and hence f is the product of the n monic linear forms .
:By Remark 14, it is sufficient to realize at least one form for each orbit for the action of the subgroup of described in Remark 15. The plane conics over are classified in Ref. [10] in terms of their rank.
There is a unique rank 1 monic conic, z2. The binary form z2 is realized as a base polynomial taking M1 = M2 = 0.
Rank 2 monic conics form 2 orbits, the ones union of 2 lines defined over and the one induced by a form indecomposable over , but decomposable over . We first check that all rank 2 monic conics which splits over are realized as a base polynomial. For any q, we realize the polynomial (z + x)(z + y) taking the matrix A = A+ + βA− = (aij) with a12 = a21 = 0, a11 = 1 and a22 = β.
There is, up to a projective transformation, another rank 2 conic ([10, Th. 5.1.6 for q odd, Th. 5.1.7 for q even]).
First assume q odd. We need to represent the equation dx2 + z2 with and d not a square. Take A = (aij) with a11 = d, a22 = 1 and a12 = a21 = 0 (so that A+ = A and A− = 0).
Now assume q even, say q = 2e for some e > 0. Since every element of is a square, the form z2 + cy2 splits and hence up to an linear transformation it is sufficient to realize as a base polynomial the form z2 + zy + δy2, where has non-zero absolute trace D(δ), where for any ([10, p. 3]). Fix any and take such that cq+1 = δ. Take A = 0 and B = (bij), where b11 = 1, b12 = c, b21 = cq and b22 = 0.
For any finite field up to a projective transformation, there is a unique smooth projective conic ([10, Theorems 5.1.6 and 5.1.7]), and we may take z(z + x) − y2 as its equation. Use the matrix C = (cij) with c11 = 1, c12 = c21 = β and c22 = 0, which have bp(C) = z(z + x) − y2 (any q). □
Remark 15 and Proposition 1 gives that every reducible monic is a base polynomial.
5.
A. Beauville studied the realization over a finite field of a form as the determinant of a matrix with entries linear forms ([9]). In this section, we use [9] for matrices such that and . Obviously this very strong assumption depends on the choice of . For any q, it requires that , but it is stronger.
Take symmetric matrices . Set M ≔ A + βB. Since A, B are symmetric and with coefficients in , they are Hermitian. Thus, M+ = A and M− = B. The matrix is symmetric, hence in this case bp(M) is the determinant of a symmetric matrix of linear forms. Conversely, any symmetric matrix of linear forms over with z appearing only in the diagonal and with all coefficients 1 is obtained in this way for some symmetric matrices.
Assume . Then every smooth plane curve of degree n defined over is of the form for some .
Proof. Let X be a smooth plane curve of degree n defined over . The curve X has genus g≔(n − 1)(n − 2)/2. To get a determinantal equation of X over , it is necessary and sufficient to find a degree g − 1 line bundle L on X defined over and such that h0(L) = 0 ([9, Proposition 3.1]). Assume . Any smooth projective curve C of genus g defined over satisfies by the Hasse–Weil theorem ([12, Theoren 9.18]). A theorem proved in Refs. [13, 14] and quoted in [15, Proposition 2.2] says that any smooth genus γ curve C such that has a degree γ − 1 line bundle L defined over and with h0(L) = h1(L) = 0. □
The lower bound on q in Proposition 4 is not sharp. The existence of a line bundle L as in the proof of Proposition 4 is related to the computational complexity of the multiplication in finite extensions of a finite field ([13–17]).
The paper [18] and its references gives better information on the number of points of smooth plane curves with a fixed degree and large q. Hasse–Weil bound and related tools may also be used for singular plane curves ([19–21]). See Ref. [22] for results on .
Note that given any , f ≠ 0, it is computationally easy to check (a system with the coefficients of f and its partial derivatives) if the plane curve {f = 0} is smooth (smooth at all points, not only at its points). It is also very easy to check when a trivariate polynomial is monic with respect to z. We do not have an always working (or always working for large q) criterion to realize a monic polynomial as bp(A) for some , but Remark 17 is sufficient if the monic polynomial is the determinant of a symmetric matrix. If q is odd, this is the content of [9, Proposition 4.2].
Funding: The author was partially supported by MIUR and GNSAGA of INdAM (Italy).
References
Competing Interests
Conflict of interests and funding declaration: The author has no conflict of interest and this research received no funding.
