In mathematics, the conjugate transpose, also known as the Hermitian transpose, of an m × n {\displaystyle m\times n} complex matrix A {\displaystyle \mathbf {A} } is an n × m {\displaystyle n\times m} matrix obtained by transposing A {\displaystyle \mathbf {A} } and applying complex conjugation to each entry (the complex conjugate of a + i b {\displaystyle a+ib} being a − i b {\displaystyle a-ib} , for real numbers a {\displaystyle a} and b {\displaystyle b} ). There are several notations, such as A H {\displaystyle \mathbf {A} ^{\mathrm {H} }} or A ∗ {\displaystyle \mathbf {A} ^{*}} ,[1] A ′ {\displaystyle \mathbf {A} '} ,[2] or (often in physics) A † {\displaystyle \mathbf {A} ^{\dagger }} .
For real matrices, the conjugate transpose is just the transpose, A H = A T {\displaystyle \mathbf {A} ^{\mathrm {H} }=\mathbf {A} ^{\operatorname {T} }} .
The conjugate transpose of an m × n {\displaystyle m\times n} matrix A {\displaystyle \mathbf {A} } is formally defined by
where the subscript i j {\displaystyle ij} denotes the ( i , j ) {\displaystyle (i,j)} -th entry (matrix element), for 1 ≤ i ≤ n {\displaystyle 1\leq i\leq n} and 1 ≤ j ≤ m {\displaystyle 1\leq j\leq m} , and the overbar denotes a scalar complex conjugate.
This definition can also be written as
where A T {\displaystyle \mathbf {A} ^{\operatorname {T} }} denotes the transpose and A ¯ {\displaystyle {\overline {\mathbf {A} }}} denotes the matrix with complex conjugated entries.
Other names for the conjugate transpose of a matrix are Hermitian transpose, Hermitian conjugate, adjoint matrix or transjugate. The conjugate transpose of a matrix A {\displaystyle \mathbf {A} } can be denoted by any of these symbols:
In some contexts, A ∗ {\displaystyle \mathbf {A} ^{*}} denotes the matrix with only complex conjugated entries and no transposition.
Suppose we want to calculate the conjugate transpose of the following matrix A {\displaystyle \mathbf {A} } .
We first transpose the matrix:
Then we conjugate every entry of the matrix:
A square matrix A {\displaystyle \mathbf {A} } with entries a i j {\displaystyle a_{ij}} is called
Even if A {\displaystyle \mathbf {A} } is not square, the two matrices A H A {\displaystyle \mathbf {A} ^{\mathrm {H} }\mathbf {A} } and A A H {\displaystyle \mathbf {A} \mathbf {A} ^{\mathrm {H} }} are both Hermitian and in fact positive semi-definite matrices.
The conjugate transpose "adjoint" matrix A H {\displaystyle \mathbf {A} ^{\mathrm {H} }} should not be confused with the adjugate, adj ( A ) {\displaystyle \operatorname {adj} (\mathbf {A} )} , which is also sometimes called adjoint.
The conjugate transpose can be motivated by noting that complex numbers can be usefully represented by 2 × 2 {\displaystyle 2\times 2} real matrices, obeying matrix addition and multiplication: a + i b ≡ [ a − b b a ] . {\displaystyle a+ib\equiv {\begin{bmatrix}a&-b\\b&a\end{bmatrix}}.}
That is, denoting each complex number z {\displaystyle z} by the real 2 × 2 {\displaystyle 2\times 2} matrix of the linear transformation on the Argand diagram (viewed as the real vector space R 2 {\displaystyle \mathbb {R} ^{2}} ), affected by complex z {\displaystyle z} -multiplication on C {\displaystyle \mathbb {C} } .
Thus, an m × n {\displaystyle m\times n} matrix of complex numbers could be well represented by a 2 m × 2 n {\displaystyle 2m\times 2n} matrix of real numbers. The conjugate transpose, therefore, arises very naturally as the result of simply transposing such a matrix—when viewed back again as an n × m {\displaystyle n\times m} matrix made up of complex numbers.
For an explanation of the notation used here, we begin by representing complex numbers e i θ {\displaystyle e^{i\theta }} as the rotation matrix, that is, e i θ = ( cos θ − sin θ sin θ cos θ ) = cos θ ( 1 0 0 1 ) + sin θ ( 0 − 1 1 0 ) . {\displaystyle e^{i\theta }={\begin{pmatrix}\cos \theta &-\sin \theta \\\sin \theta &\cos \theta \end{pmatrix}}=\cos \theta {\begin{pmatrix}1&0\\0&1\end{pmatrix}}+\sin \theta {\begin{pmatrix}0&-1\\1&0\end{pmatrix}}.} Since e i θ = cos θ + i sin θ {\displaystyle e^{i\theta }=\cos \theta +i\sin \theta } , we are led to the matrix representations of the unit numbers as 1 = ( 1 0 0 1 ) , i = ( 0 − 1 1 0 ) . {\displaystyle 1={\begin{pmatrix}1&0\\0&1\end{pmatrix}},\quad i={\begin{pmatrix}0&-1\\1&0\end{pmatrix}}.}
A general complex number z = x + i y {\displaystyle z=x+iy} is then represented as z = ( x − y y x ) . {\displaystyle z={\begin{pmatrix}x&-y\\y&x\end{pmatrix}}.} The complex conjugate operation (that sends a + b i {\displaystyle a+bi} to a − b i {\displaystyle a-bi} for real a , b {\displaystyle a,b} ) is encoded as the matrix transpose.[3]
The last property given above shows that if one views A {\displaystyle \mathbf {A} } as a linear transformation from Hilbert space C n {\displaystyle \mathbb {C} ^{n}} to C m , {\displaystyle \mathbb {C} ^{m},} then the matrix A H {\displaystyle \mathbf {A} ^{\mathrm {H} }} corresponds to the adjoint operator of A {\displaystyle \mathbf {A} } . The concept of adjoint operators between Hilbert spaces can thus be seen as a generalization of the conjugate transpose of matrices with respect to an orthonormal basis.
Another generalization is available: suppose A {\displaystyle A} is a linear map from a complex vector space V {\displaystyle V} to another, W {\displaystyle W} , then the complex conjugate linear map as well as the transposed linear map are defined, and we may thus take the conjugate transpose of A {\displaystyle A} to be the complex conjugate of the transpose of A {\displaystyle A} . It maps the conjugate dual of W {\displaystyle W} to the conjugate dual of V {\displaystyle V} .