The notion of an adjunction between categories is a fundamental idea in category theory whose utility is so great that it appears in virtually every field of mathematics. Roughly speaking, an adjunction is a weaker form of equivalence between two categories, often allowing one to translate problems in one category to problems in another category. In this post, we provide multiple equivalent interpretations of adjunctions, and demonstrate how they are all interrelated.
To start, we give the "standard" definition of adjunction.
Definition. Let F:C→D and G:D→C be a pair of functors. If there exists a natural isomorphism
HomD(F[−],−)≅HomC(−,G[−]),
between functors Cop×D→Set, then we say that F is a left adjoint of G and that G is a right adjoint of F, and write F⊣G.
Intuitively, this definition states that morphisms F(X)→Y are equivalent to morphisms X→G(Y). Typically, arguments leveraging the fact that two functors are adjoint will make use of this standard definition. For the remainder of the article, we discuss three equivalent alternative definitions of adjunction, each one providing a unique perspective on what it means for a pair of functors to be mutually adjoint.
While the primary definition given above defines adjunction as a binary relation, it turns out that if a functor has a right or left adjoint, then that adjoint is determined up to natural isomorphism. Thus, it is perfectly acceptable to define what it means for a functor to be a left (or right) adjoint, and later show how to recover the corresponding adjoint functor. Two of the interpretations we discuss are these so called "asymmetric" ways to formulate adjunction:
Representability. Roughly speaking, a functor has an adjoint if and only if a certain family of presheafs (or copresheafs) defined by the functor consists only of representable functors. The role of the adjoint in this viewpoint is to describe the representing object of each presheaf or copresheaf.
Approximations. In certain cases, one might describe a morphism in a category as a way of approximating one object by another. A functor has an adjoint if for every object in the target category, one can find a "universal" approximation of it by an object in the image of the functor. The role of the adjoint in this viewpoint is describe which objects in the source category produce these approximations.
For a given functor F:C→D, suppose G:D→C is the "solution" to the representability or approximation problem posed by F. As a consequence of the symmetry inherent the original definition, one can show that F must also be the solution to the "dual" representability or approximation problem posed by G. It turns out that this phenomenom implies a type of idempotence in the construction of optimal solutions, which may be expressed as a pair of equations referred to as the counit-unit equations. These equations yield a third equivalent formulation of adjunction.
Representability
For an object Y∈D the representable presheaf HomD(−,Y) describes all the ways objects in D can fit their structure into Y. Part of the Yoneda lemma implies that this presheaf "encodes" all the data of Y; in particular, an isomorphism of representable presheaves corresponds an to isomorphism of their representing objects. Precomposing by F:C→D, we obtain a presheaf HomD(F[−],Y) describing all the ways objects in C can fit their structure into Y via F. A natural question to ask is if this functor is also representable: does this presheaf encode the information of some object X∈C? Note that this representing object, if it exists, is determined up to isomorphism by the Yoneda lemma.
Similarly, one may also pose the "dual" representability problem: for X∈C, when is the copresheaf HomD(X,G[−]) representable? As we show shortly, the data of an adjunction is equivalent to the existence of solutions to either representability problem. First, note that an adjunction Ψ already provides natural isomorphisms
given by θY,X(ϕ)=ΨX,Y(ϕ) and ξY,X(ψ)=ΦX,Y(ψ), so an adjunction yields solutions to both representability problems. Before proving the converse implication, we establish some helpful notation that will be used throughout the rest of the article:
Suppose H:Cop→Set is a presheaf. Given s∈H(X) and morphism f:X′→X, we say that f∗s=def[H(f)](s) is the pullback of s along f.
Suppose H:D→Set is a copresheaf. Given s∈H(Y) and morphism g:Y→Y′, we say that g∗s=def[H(g)](s) is the pushforward of s along g.
Suppose H:Cop×D→Set be a profunctor. Given s∈H(X,Y) and morphisms f:X′→X and g:Y→Y′, we say that
f∗sg∗s=def[H(f,idY)](s),=def[H(idX,g)](s)
are the pullback and pushforward of s along f and g, respectively.
With this established, we now demonstrate how a solution to the representability problem for F gives rise to an adjunction.
Proposition. Let C and D be categories and F:C→D be a functor such that for all Y∈D, the presheaf HomD(F[−],Y) is representable. A collection of presheaf isomorphisms
Given a morphism g:Y→Y′, consider the square (to be shown commutative)
consisting of natural transformations. By the Yoneda lemma, a morphism of presheaves Hom(−,GY)⇒Hom(−,GY′) is determined by where the identity section idGY is mapped. But by definition,
so the above square commutes. Thus, if h:Y′→Y′′ is another morphism, we may paste together the corresponding squares to obtain the commutative diagram
Taking the component at idGY and using the fact that よ is functorial implies that GΘ(h∘g)=GΘ(h)∘GΘ(g). To show that F and GΘ are adjoints, consider the map
Ψ:HomD(F[−],−)→HomC(−,G[−]);ΨX,Y(ϕ)=θY,X(ϕ).
To show this is a natural isomorphism, it suffices to show that Ψ is natural in the second argument, since θY is already a natural isomorphism. Indeed,
for any morphism g:Y→Y′, we have
where the middle equalities follow from the commutativity of the above square. Finally, we show that any other collection Θ={θY}Y∈D yields a functor GΘ naturally isomorphic to GΘ. By the Yoneda lemma, the isomorphism
θY∘θY−1:HomC(−,GΘY)⟶≅HomC(−,GΘY)
is induced by a unique isomorphism τY:GΘ(Y)→GΘ(Y) via the Yoneda embedding. It follows that the diagram
commutes. Taking the identity component implies that τ:GΘ⇒GΘ is a natural isomorphism.
■
As one should expect, one may recover an adjunction between C and D from a solution to the "dual" representability problem for the copresheafs HomC(−,G[−]). We omit the proof as it is essentially the same as the previous argument.
Proposition. Let C and D be categories and G:D→C be a functor such that for all X∈C, the copresheaf HomD(X,G[−]) is representable. A collection of copresheaf isomorphisms
for which FΞ⊣G. For any other collection of isomorphisms Ξ, the functors FΞ and FΞ are naturally isomorphic.
Proof. Repeat the above argument, this time making use of the "contravariant" Yoneda lemma.
■
Approximations
Occasionally, morphisms in a category can be interpreted as approximations of the structure of the target by the source of the morphism, or vice versa. Given a functor F:C→D and an object Y∈D, a natural question to ask is if there exists "universal" approximation of Y by objects in the image of F.
To be precise, let us call a pair (X,ϕ:F(X)→Y) a morphism fromFtoY, or a left approximation ofYbyF(X). Since there is no confusion if F is understood, we may call this a left approximation of Y by X as well. Together, these approximations form the comma category(F↓Y); a morphism (X,ϕ)→(X′,ϕ′) is given by a morphism X→X′ which makes the triangle
commute. The existence of a morphism (X,ϕ)→(X′,ϕ′) essentially states that ϕ′ captures at least as much of the structure of Y as ϕ does: by precomposing with the map induced by X→X′, one may recover ϕ from ϕ′, but not vice versa necessarily. This suggests that the desired "universal" left approximation should be a terminal object of (X↓F) if one exists; this is called a universal morphism from F to Y. In more concrete terms, a universal morphism is an object (X0,ϕ0) satisfying the following property:
Given any morphism ϕ:F(X)→Y, there exists a unique morphism ψ:X→X^ such that ϕ=ϕ0∘F(ψ).
As a word of caution, a universal morphism might not coincide with what one might consider the "best" approximation of Y. For example, if Y=F(X) for some X∈C, it is usually not the case that (X,idY) is a universal morphism. Rather, one should think of a universal morphism as containing the data of all other approximations of Y and nothing else. In special cases, such as when C and D are posets, these notions do coincide, as illustrated in the following example.
Example. Let C and D be posets and let F:C→D be an order-preserving map. For instance, we may take C=Z and D=R, and let F be the inclusion map.
Given an element y∈D, a left approximation is given by any element x∈C such that F(x)≤Dy. If it exists, the universal approximation to y is x0=maxC{x∣F(x)≤Dy}, whose image is the "best" approximation of y from the left. In the example above, a universal left approximation to y is given by its floor⌊y⌋∈Z.
Instead of approximating objects from the left, we may try to solve the dual problem of approximating objects from the right, given a functor G:D→C and an object X∈C.
In this case, let us call a pair (Y,ψ:X→G(Y)) a morphism fromXtoG, or a right approximation ofXbyG(Y). Since there is no confusion if G is understood, we may call this a right approximation of X by Y as well. Together, these approximations form the comma category(X↓G); a morphism (Y,ψ)→(Y′,ψ′) is given by a morphism Y→Y′ which makes the triangle
commute. The existence of a morphism (Y,ψ)→(Y′,ψ′) says that ψ captures at least as much structure as ψ′ does: one may recover ψ′ from ψ but not necessarily vice versa. Ths suggests the desired "universal" right approximation to X should be an initial object of (X↓G); this is called a universal morphism fromXtoG. In more concrete terms, a universal morphism (Y0,ψ0) satisfies the following property:
Given any morphism ψ:X→G(Y), there exists a unique morphism ϕ:Y0→Y such that ψ=G(ϕ)∘ψ0.
These viewpoints offer another way to define adjunction equivalent to the previous formulation in terms of representability.
Proposition. Let C and D be categories, F:C→D be a functor, and let Y∈D. The presheaf HomD(F[−],Y) is representable if and and only if there exists a universal morphism from F to Y.
Proof. To begin, suppose that HomD(F[−],Y) is represented by some X0∈C. Then there exists a natural isomorphism
θ:HomD(F[−],Y)⟶≅HomC(−,X0)
which must map some morphism ϕ0:F(X0)→Y to the identity idX0. We show that (X0,ϕ0) is a universal morphism from F to Y. For this, it suffices to show that for every morphism ϕ:F(X)→Y, there exists a unique map ψ:X→X0 such that the diagram
commutes. The leftmost "degenerate" triangle in the diagram states that ψ∈HomC(X,X0) is the pullback of idX0 via ψ. Since θ is a presheaf morphism, θX−1(ψ) is the pullback of ϕ0 via ψ as well, and the rightmost triangle in the diagram states that this pullback is equal to ϕ. It follows that ψ=θX(ϕ) is uniquely determined.
Conversely, suppose that there exists a universal morphism ϕ0:F(X0)→Y. Then for every morphism ϕ:F(X)→Y, there exists a unique morphism (X,ϕ)→(X0,ϕ0) induced by a unique morphism θX(ϕ):X→X0. This defines a map
θ:HomD(F[−],Y)→HomC(−,X0)
which we show is a presheaf isomorphism. By definition, each component of θ is an isomorphism, since for every morphism ϕ:F(X)→Y, there exists a unique morphism ψ:X→X0 such that θX(ϕ)=ψ.
To show that θ is a presheaf morphism, let f:X′→X and ϕ∈HomD(FX,Y) and consider the diagram
It follows that θX′(f∗ϕ)=θX(ϕ)∘f=f∗θX(ϕ), so θ respects pullbacks. Thus, θ is a presheaf isomorphism.
■
The above argument shows that a representing object of the presheaf HomD(F[−],Y) is the same as a universal left approximation to Y. Thus, by an earlier proposition, the approximation problem can be solved for every Y∈D if and only if there is a right adjoint functor G:D→C. If so, the solution for Y will be given by G(Y).
Furthermore, as one should expect, one obtains an analogous result for the right approximation problem. As such, we omit the proof of the following proposition.
Proposition. Let C and D be categories, G:D→C be a functor, and let X∈C. The copresheaf HomC(X,G[−]) is representable if and and only if there exists a universal morphism from X to G.
Proof. Reverse all the arrows in the previous argument and replace "pullback" with "pushforward".
■
Similarly, the proof of this proposition implies that representing object of the copresheaf HomD(X,G[−]) is the same as a universal right approximation to X. Thus, by the same earlier proposition, the approximation problem can be solved for every X∈C if and only if there is a left adjoint functor F:C→D. If so, the solution for X will be given by F(X).
The counit-unit equations
As discussed before, for an adjunction F⊣G, there is a certain symmetry between the representability or approximation problems posed by F and G, which we paraphrase as follows:
If F is the solution to the problem posed by G, then G is the solution to the "dual" problem posed by F.
To illustrate this phenomenon, consider the case where both C and D are posets and F:C→D is a left adjoint. Then the best left approximation of each element y∈D exists, and is equal to the maximum of all left approximations to y. Packaging together all these approximations yields a map G:D→C defined by G(y)=maxC{x∣F(x)≤y}. Note that G is also order-preserving: if y≤Dy′, then
G(y)=Cmax{x∣F(x)≤Dy}≤Cmax{x∣F(x)≤Dy′}=G(y′).
Furthermore, G is a right adjoint. For x∈C, the set of all right approximations to x is {y∣x≤CG(y)}. However, G(y) was defined to be the best left approximation to y, which means anything less than that is just a left approximation to y. Explicitly, we have
This identifies the set of all right approximations to x with the set of all element greater than or equal to F(x). Thus, minx∈D{y∣x≤CG(y)}=F(x), so G is indeed a right adjoint, with F solving the approximation problem posed by G.
As a consequence of this duality, one obtains a kind of idempotence in taking best approximations. If x∈C, then
F(x) is a right approximation of x, which means that x≤CG(F(x)), and
G(F(x)) is a left approximation of F(x), which means that G(x)≤DF(G(F(x))).
Together, this implies that F≤DFGF≤DF, so FGF=F. Similarly, if y∈D,
G(y) is a left approximation of y, which means that F(G(y))≤Dy, and
F(G(y)) is a right approximation of G(y), which means that G(y)≤_CG(F(G(y))).
Together, this implies that G≤CGFG≤CG, so GFG=G. In other words, once one takes the best approximation of an object, taking a second approximation does not improve the result.
In the context of posets, the adjunction between C and D is called a Galois connection, and the maps GF:C→C and FG:D→D are referred to as the closure operator and kernel operator of the connection, respectively. The closure yields the best right approximation to an element in C by elements in the image of G, while the kernel yields the best left approximation to an element in D by elements in the image of F. Both operators are idempotent.
For general categories, one has a similar result, which can be used to give another equivalent formulation of adjunctions.
Proposition. Let C and D be categories and let F:C→D be a functor with right adjoint G:D→C. There exists a natural transformation ϵ:FG⇒idD and natural transformation η:idC⇒GF such that the diagrams
commute.
Proof. There exists a natural isomorphism
Ψ:HomD(F[−],−)⟹≅HomC(−,G[−])
defining the adjunction. Given Y∈D, the representing object of the presheaf HomD(F[−],Y) is G(Y); from this, we may obtain a universal morphism from F to Y by defining ϵY=Ψ−1(idG(Y)). These are the components of a natural transformation ϵ:FG⇒idD: for any morphism g:Y→Y′ we have
since Ψ−1 respects both pullbacks and pushforwards. Similarly, given X∈D, the representing object of the copresheaf HomC(X,G[−]) is F(X); from this, we may obtain a universal morphism from X to G by defining ηX=Ψ(idF(X)). These are the components of a natural transformation η:idC⇒GF: for any morphism f:X→X′ we have
Given a pair of functors (F,G):CFGD, a pair of natural transformations ϵ:FG⇒idD and η:idC⇒GF which make the diagrams in the previous proposition commute are called a counit-unit pair, and the corresponding equations ϵF∘Fη=idF and Gϵ∘ηG=idG are the counit-unit equations of the pair. If these equations are satisfied, then either one of these natural transformations may be thought of as an explicit solution to the representability or approximation problems described earlier, giving rise to an adjunction between F and G.
Proposition. Let C and D be categories and (F,G):CFGD be a pair of functors. If there exists a pair (ϵ,η) satisfying the counit-unit equations, then there exists an adjunction F⊣G.
Proof. We wish to construct a natural isomorphism HomD(F[−],−)≅HomC(−,G[−]) using the counit-unit pair. So, define
so Ψ also respects pushforwards. Thus, Ψ is a natural transformation, and by a similar argument, so is Φ. It remains to show that these natural transformations are inverses. Indeed, for each ϕ∈Hom(FX,Y), we have