Skip to Main Content

Topological Photonics: A Mathematical Perspective

Ross Parker
Alejandro Aceves

Communicated by Notices Associate Editor Reza Malek-Madani

1. Introduction

Topological photonics is a framework that follows both condensed matter physics and topology. It refers to designing the guiding properties of the propagating medium (e.g., a photonic crystal or a waveguide lattice) in such a way that the transport of electromagnetic energy is realized in unique, robust, and sometimes unexpected ways. Consider a simple thought experiment: imagine first the two-dimensional wave equation on a square domain, and assume homogeneous Dirichlet boundary conditions. We know that the accessible modes extend in periodic form throughout the whole domain and, in time, waves can propagate in all directions. This behavior is in response to the inherent symmetries of the medium. Imagine instead that we engineer the medium in such a way that all the energy concentrates in the boundary of the medium and propagates in only one direction. (In the language of optics, this would be seen as inhibiting back reflection and making the bulk medium act like an insulator).

Typically, in describing a photonic system, we refer to physical quantities such as frequency, wave vector, polarization, and dispersion. Instead, in the relatively new field of topological photonics, the term “topology” refers to a property of a photonic material that characterizes global behavior of the wavefunctions on their entire dispersion map; most importantly, this property takes quantized values. Think of this as characterizing the “genus” of an object, like a doughnut, with the “object” being described in wave vector space rather than in physical space. There are analogues in photonics to the topological fact that continuous deformations will not change the genus of an object. As an example, photonic topological insulators that are designed using artificial materials can support topologically nontrivial unidirectional states of light. These states are characterized by a particular “genus-like” number. Since this number is quantized, this unidirectional property will be robust to perturbations in the underlying photonic structure.

Photonics research often parallels or aims at explaining phenomena in other physical contexts. Bose-Einstein condensation in condensed matter physics is governed by the Gross-Pitaevskii equation, which is identical to the nonlinear Schrödinger equation that governs intense laser beam propagation in a dielectric medium such as air. In the quantum realm, nontrivial states of two-dimensional matter (e.g., a periodic lattice of atoms) with broken time-reversal symmetry can have the property that the bulk is an insulator while states (modes) exist that carry current along the sample edges without dissipation. The characteristic “genus-like” integer is called the Chern number (see section 3 for an example), which arises out of topological properties of the material’s band structure (see the discussion in section 2 and section 3 below).

In photonic crystals, a periodic variation of the dielectric properties of the medium affects photons in the same manner as solids modulate electrons (with the caveat that photons are bosons, while electrons are fermions). The question is whether the topological features are replicated in the analogous photonic system. In two foundational papers by Haldane and Raghu HR08RH08, the authors highlight the photonics analogue to quantum properties. They demonstrate the ingredients necessary to create a “one-way waveguide” which exhibits properties similar to the Quantum Hall Effect. While the model in HR08 has not been experimentally realized, it motivated further work by Wang, Chong, Joannopoulos, and Soljačić, in which they first predicted the existence of edge states in a magneto-optical crystal in the microwave regime WCJS08 and then demonstrated these experimentally WCJS09. Experiments by Rechtsman et al. RZP13 found topological edge states without the need for an external magnetic field by using a photonic crystal comprising helical waveguides.

Since then, the field of topological photonics has matured and continues to be very active, both in theory and experiments, as well as in the linear and nonlinear regimes. While we have briefly discussed its origins, it is not our purpose to give a detailed history of the field (for this purpose, we point the interested reader to the review articles JS14LCG22). Instead, we will focus our discussion on three prototypical examples: the one-dimensional Su-Schrieffer-Heeger model, the two-dimensional Haldane model, and the model of a photonic Floquet topological insulator from RZP13. We hope that this article highlights why this is a fertile area for mathematicians to explore and contribute to with their expertise.

2. SSH Model

The Su-Schrieffer-Heeger (SSH) model SSH79, is the simplest lattice model that exhibits topological features. It was devised to describe the electrical conductivity in a doped polyacetylene polymer chain. The lattice comprises repeating, two-node unit cells, where the couplings within and between unit cells are given by and , respectively (Figure 1, top). The optical analogue is a linear lattice of waveguides in which the nearest-neighbor couplings are staggered (this can be implemented, e.g., by altering the physical spacings between the fibers). Mathematically, the SSH model can be described by the discrete nonlinear Schrödinger equation

where is the th unit cell, and is the strength of the cubic nonlinearity. (A rigorous mathematical derivation can be found in AC22).

Figure 1.

Top: schematic of SSH model, unit cell in dotted box. Coupling constant is within unit cell, and between unit cells. Bottom: reciprocal lattice, first BZ shown in red.

Graphic without alt text

Our analysis follows that of AOP16, Chapter 1 and AC22, Section 8. The topological features of the optical SSH model can be understood by studying the linear model (). As a first step, we look for plane wave solutions of the form

where is the frequency and is the wavenumber. Equation (2) is periodic in with period , since for any integer . The points define another linear lattice, which is called the reciprocal lattice (Figure 1, bottom). The first Brillouin zone (BZ) is the set of points closer to the origin than any other point of the reciprocal lattice, which in this case is the interval . Due to the -periodicity, the BZ is topologically equivalent to the unit circle .

Substituting the ansatz (2) into (1) and simplifying, we obtain the -dependent eigenvalue problem , where , and is the Hermitian matrix

Since is -periodic, we only need to consider , i.e., in the first BZ. We note that since we are posing the problem on the full integer lattice, can take any value in . The eigenvalues of are

which is the dispersion relation relating the frequency and the wavenumber . Each eigenvalue is a continuous function of the wavenumber on the first BZ, and is called a band. Since is a matrix, the SSH model has two bands. All of the bands of the system form its band structure, which is illustrated in the left column of Figure 2. (These terms are borrowed from solid state physics, where the band structure describes the energy levels that electrons can occupy in a solid).

Figure 2.

Left: band structure of the SSH model for (top) and (bottom). Right: circle in the complex plane traced counterclockwise by for .

Graphic without alt text

When , there is a space between the upper and lower bands, which is known as a band gap. This band gap has size , where . The band gap closes when . Since the eigenvalues (3) are unchanged if and are exchanged, it appears at first glance that the cases and are identical, i.e., that the problem is symmetric about . Interestingly, this is not the case. For a complete picture, we need to look at the eigenvectors of as well.

The normalized eigenvectors corresponding to the eigenvalues are given by

Since is a complex number of unit modulus whose argument is the same as that of , we can write as

As the wavenumber varies from to over the BZ, the complex number traces a clockwise circle in the complex plane with center and radius . This circle encloses the origin when , but does not when (Figure 2, right column). The topological invariant is the winding number of , which is the number of times travels counterclockwise around the origin. We can see from Figure 2 that

where a winding number of represents a single clockwise trip around the origin. (The winding number is undefined if ).

The same topological information can be obtained in a different way by computing a quantity known as the Berry phase Ber84 (also known as the Zak phase Zak89 in 1D). Intuitively, the Berry phase is the phase angle accumulated by a complex vector, e.g., one of the eigenvectors , around a closed contour in -space. We will take a brief digression to discuss these concepts, following AOP16, Chapter 2 and Van18, Chapter 3, before computing them for the SSH model.

Let be a normalized eigenvector of . For any wavenumbers and , we define the relative phase between and by

where is the phase, or argument, of the complex number . (We are using the Hermitian inner product , where the dagger symbol denotes the conjugate transpose; the complex conjugation is placed on the first component to be consistent with the Dirac notation of quantum mechanics). The relative phase satisfies the equation

It is important to note that the eigenvector is not unique. In particular, since it is a unit vector, it is specified only up to multiplication by a constant unit complex number . The transformation is called a gauge transformation. The relative phase is not invariant under a gauge transformation, since if we take , transforms to

thus .

We wish to define the change of phase of in such a way as to be gauge invariant. To do this, we take a sequence of points in -space ordered in a loop. We then define the discrete Berry phase by

which is the phase accumulated by around the loop. Unlike the relative phases , the Berry phase is gauge invariant; if we take the gauge transformations , the Berry phase transforms to , which is equal to , since all of the cancel. We note that the Berry phase is only unique up to an integer multiple of unless we take the principal value of the argument, i.e., restrict to .

We now move from discrete to continuous. In particular, we wish to compute the phase accumulated by along a continuous, closed path. For small , let be the relative phase accumulated between and . Following (8), satisfies the equation

Since is small and is a unit vector, the denominator in (9) is approximately 1, thus

Expanding both sides in a Taylor series to first order in and and simplifying, we find that is approximately given by

We define the Berry connection by

which is the coefficient of on the RHS of (10). We then define the Berry phase to be the integral of Berry connection around a closed contour :

As in the discrete case, the Berry connection is not gauge invariant, while the Berry phase is invariant under gauge transformations (modulo integer multiples of ).

Returning to the SSH model, we first compute the Berry connection using the eigenvector :

We then compute the Berry phase by integrating the Berry connection from to . This is a closed contour in the BZ since the endpoints of correspond to the same point on the unit circle . The Berry phase is

which is the change in phase of the eigenvector over the BZ. Since it is a constant multiple of (6), it conveys the same information as the winding number.

Figure 3.

Eigenvalues of for (top left), (top right), and (bottom left). Edge mode eigenvectors for (bottom right); (solid blue line) and (dotted orange line) are either in-phase (top) or out-of-phase (bottom). unit cells with Dirichlet boundary conditions.

Graphic without alt text

The fundamental topological difference between the two asymmetric lattice configurations ( and ) becomes evident when we consider a finite lattice. Specifically, we take a lattice comprising waveguides ( unit cells) with Dirichlet boundary conditions at the two ends ( and ). For the linear system, solutions are standing waves of the form , where and represent the and sublattices of the system. Substituting this ansatz into (1) yields the eigenvalue problem , where and is the off-diagonal block matrix

An intuitive understanding of the difference between the two cases can be gained by considering the two extreme configurations, where one of the coupling constants is set to 0. If and , the lattice comprises independent dimers with internal coupling constant . The eigenvalues of are , each with multiplicity . On the other hand, if and , the lattice instead comprises independent dimers (staggered from the ones in the previous case) with internal coupling constant , as well as two unconnected nodes at the ends of the lattice. In addition to eigenvalues at , each with multiplicity , the matrix has two eigenvalues at . The eigenvectors corresponding to these zero eigenvalues are and . These are called edge modes, since they are localized at the ends of the lattice.

Since is a matrix, its spectrum is a discrete set of eigenvalues, as opposed to the two continuous bands of eigenvalues found from the dispersion relation in the infinite lattice case. The eigenvalues of are shown in Figure 3 for , , and . The two asymmetric configurations contain a “gap,” which closes when . This eigenvalue gap is analogous to the band gap in the infinite lattice case. When , there are no eigenvalues in this gap, and all of the eigenvectors are nonlocalized. When , however, there are two eigenvalues close to (but not exactly at) 0 which lie within this gap (these eigenvalues approach 0 in the limit ). As in the case where , these eigenvalues correspond to edge modes, since and are localized to the left and right edges of the lattice, respectively (Figure 3, bottom right). All of the remaining modes are nonlocalized.

Finally, we briefly comment on what occurs when a cubic nonlinearity ( is present (see MS21 for a more thorough treatment). Standing wave solutions of the form solve the equation , where . Numerical continuation experiments show that the edge modes from the linear model persist for small .

3. Haldane Model

We now turn to a two-dimensional model. We start with a honeycomb lattice (Figure 4), which is constructed from a two-site unit cell, with sites labeled and . These unit cells tile the plane periodically along the two primitive lattice vectors and to obtain a hexagonal lattice. We note that the -sites and -sites form two offset, triangular sublattices. This structure is similar to that of the material graphene, which is a hexagonal lattice constructed entirely from carbon atoms. The spatial location of a unit cell is specified by the vector , where . It is therefore natural to index the unit cells by the vector ; the locations of the lattice sites and in unit cell are and , respectively. Each node in the honeycomb lattice is connected to its three nearest neighbors with coupling strength , which is a coupling between sublattices. The directions of the nearest-neighbor (NN) couplings are given by the vectors , , and , which are depicted in Figure 4. The resulting linear model can be written as

where the angle brackets indicate that the sum is taken over nearest neighbors. The system (13) obeys time-reversal symmetry, i.e., is invariant under the transformation , .

The Haldane model Hal88 adds two more terms to the honeycomb model. We will see that this results in a band gap in the spectrum, similar to what occurs in the SSH model with unequal couplings. First, the Haldane model has an on-site energy term of magnitude , which takes opposite signs on the and sublattices. In addition, there is an imaginary, next-nearest neighbor (NNN) coupling term with strength . (In the original Haldane model, this coupling term has the complex strength ; we take here for simplicity). Each node is coupled to its six next-nearest neighbors, which is a coupling within sublattices. These couplings are staggered so that there is no net flux into or out of a single lattice site. The directions of the next-nearest neighbor couplings are given by the vectors , , and (Figure 4). Using this notation, the linear model for the Haldane lattice is

where the double angle brackets indicate that the sum is taken over next-nearest neighbors. The signs of the NNN couplings are indicated by the arrows in Figure 4, where outward and inward pointing arrows denote couplings of and , respectively. The arrangement of the arrows in two staggered, counterclockwise triangles ensures no net flux results from the NNN term. Time-reversal symmetry is broken when , but is unaffected by the on-site term .

Figure 4.

Schematic of the Haldane lattice. Rhombus is unit cell with sites and . Primitive lattice vectors and . Nearest neighbor coupling vectors , , and . Next-nearest neighbor coupling vectors , , and .

Graphic without alt text

As in the SSH model, the first step is to compute the band structure, which is found by looking for plane wave solutions to (14) of the form

where