Counting symmetric scales in equal-temperament

Jeffery MensahDecember 7, 2025∼1250 words

In this post, we give two puzzles related to a concept in music theory referred to as scale symmetry. Roughly speaking, a symmetric scale is a musical scale which does not give a strong impression of tonality. Before stating the two problems, we give a brief overview of the relevant theory and establish some terminology.

Equal-temperament

A musical note is an abstract label that represents a sound with a particular pitch. The current labeling system for pitch classes in Western music involves two components:

  1. a letter name from A\mathrm{A} to G\mathrm{G}, and
  2. optional accidentals attached to the letter, represented by the flat symbol \flat and the sharp symbol \sharp.

Within an octave, the notes rise in pitch from A\mathrm{A} to G\mathrm{G}. The \flat symbol slightly lowers the pitch of a note, and the \sharp symbol slightly raises the pitch of a note. To distinguish notes in the same pitch class, a subscript may be used to distinguish the octave.

The actual mapping between note labels to frequencies is only determined by general guidelines (which we omit for brevity), and is otherwise unspecified. The two main degrees of freedom in this mapping are a choice of pitch standard, which specifies the frequency of a single note, and a choice of temperament, which specifies the distances between each pair of notes.

In modern music, the typical pitch standard is A4=440Hz\mathrm{A}_4 = 440 \, \text{Hz}, and the most common choice of temperament is twelve-tone equal temperament, which, as the name suggests, possesses only twelve pitch classes. The pitches in this system may each be labeled with a unique integer kZk \in \mathbb{Z}, and the pitch class that a pitch kk belongs to is represented by its residue class [k][k] modulo 1212.

NN-tone equal temperament

This can be generalized to tuning systems with NN pitch classes. Instead of the traditional labeling system for notes, one labels each pitch with a unique integer kZk \in \mathbb{Z}, and represents the pitch class of kk by its residue class [k][k] modulo NN.

Scales

A musical scale is an ordered collection of pitches spanning an octave, which defines the pitch classes in use for a piece or section of music.

The pitches in a scale may be obtained by starting at a initial note called the tonic and applying a sequence of ascending intervals that collectively span a single octave, until the tonic is reached again. For our purposes, the only intervals we consider are half steps (H\mathrm{H}) and whole steps (W\mathrm{W}), which increase the label of a pitch by 11 and 22, respectively (this excludes scales such as the pentatonic scale). We call the sequence used to construct the scale its type or pattern. For example, the following illustration depicts the pitch collection of a major scale with tonic [0][0], obtained from the interval sequence WWHWWWH\mathrm{WWHWWWH}.

When listening to tonal music, the brain subconsciously tries determines the tonic of the underlying scale currently used; this process forms the basis of solmization. This raises the following natural question:

  • When can one determine the tonic of a known scale type only by hearing the notes in its pitch collection?

Note that one can always aurally determine the tonic of a major scale just by hearing its pitch collection, since no two major scales have identical pitch collections. However, for the whole-tone scale, there is ambiguity in the choice of tonic: the C\mathrm{C} whole-tone scale shares all the same notes as the D\mathrm{D} whole-tone scale, so a listener cannot determine the tonic solely from its pitch collection.

Scales for which this perceptual problem does not have a unique solution are said to be tonally ambiguous. With this established, we finally state the two problems and give their solutions.

Problem 1

Are there any tonally ambiguous seven-note scales in 1212-tone equal temperament?

Solution to Problem 1

The answer is no.

Let P=Z/12Z\mathbf{P} = \mathbb{Z}/12\mathbb{Z} be the set of all pitch classes, which is a homogeneous space for the cyclic group Z/12Z\mathbb{Z}/12\mathbb{Z}. Note that a sequence of ascending intervals j1,,j7{1,2}j_1, \ldots, j_7 \in \{1, 2\} defining a scale pattern may be identified with a subset ΣZ/12Z\Sigma \subset \mathbb{Z}/12\mathbb{Z} of size 77 by considering the set of partial sums

Σ(j)={i=1kji0k7}.\Sigma(j) = \left\{ \sum_{i=1}^{k} j_i \mid 0 \leq k \leq 7 \right\}.

Then given a pitch class πP\pi \in \mathbf{P}, the set of pitches Σ+π\Sigma + \pi is a scale with type jj and tonic π\pi. If there exists another tonic π\pi' producing the same pitch collection, then

Σ+π=Σ+π    Σ+(ππ)=Σ    ππstab(Σ),\Sigma + \pi = \Sigma + \pi' \implies \Sigma + (\pi - \pi') = \Sigma \implies \pi - \pi' \in \mathrm{stab}(\Sigma),

so the interval between the two tonics is in the stabilizer of Σ\Sigma. Thus, the action of ππ\langle \pi - \pi' \rangle on Z/12Z\mathbb{Z}/12\mathbb{Z} restricts to an action of ππ\langle \pi - \pi' \rangle on Σ\Sigma. The orbits of this action are simply the orbits of the former action, which are the cosets of ππ\langle \pi - \pi' \rangle. Since these partition Σ\Sigma, it follows that the order of ππ\langle \pi - \pi' \rangle divides Σ|\Sigma|. In particular, gcd(12,Σ)>1\gcd(12, |\Sigma|) > 1, which implies that tonally ambiguous scale cannot consist of seven elements.

Problem 2

Provide a general formula for the number of tonally ambiguous scale types in mm-tone equal temperament.

Solution to Problem 2

Let Jn,m\mathcal{J}_{n, m} be the set of nn-note scale patterns in mm-tone equal temperament, that is,

Jn,m={(j1,,jn)ji{1,2} and i=1nji=m},\mathcal{J}_{n, m} = \left\{ (j_1, \ldots, j_n) \bigm| j_i \in \{1, 2\} \text{ and } \sum_{i=1}^{n} j_i = m \right\},

and let Jm=nJn,m\mathcal{J}_m = \bigcup_{n} \mathcal{J}_{n, m}. We may identify any sequence jJn,mj \in \mathcal{J}_{n, m} with a subset of Z/mZ\mathbb{Z}/m\mathbb{Z} by considering its partial sums

Σ(j)={i=1kji0kn}.\Sigma(j) = \left\{ \sum_{i=1}^{k} j_i \mid 0 \leq k \leq n \right\}.

As shown previously, a scale type jJmj \in \mathcal{J}_{m} is ambiguous if and only if Σ(j)\Sigma(j) has a nontrivial stabilizer or, equivalently, the orbit of Σ(j)\Sigma(j) has size less than mm. So, let Sk,m\mathcal{S}_{k, m} be the number of scale patterns ΣZ/mZ\Sigma \subseteq \mathbb{Z}/m\mathbb{Z} such that orb(Σ)=k|\mathrm{orb}(\Sigma)| = k. For an integer kk dividing mm, we show that there exists a bijection between scale patterns in kk-tone equal temperament and scale patterns in mm-tone equal temperament whose orbit has size dividing kk. To this end, consider the map

Φ ⁣:JkdkSd,m;jΦΣ(jjm/k times).\Phi \colon \mathcal{J}_{k} \to \bigcup_{d \mid k} \mathcal{S}_{d, m}; \quad \quad j\, \overset{\Phi}{\longmapsto} \Sigma\big(\underbrace{\,j \cdots j\,}_{m/k\text{ times}}\big).

This map is well-defined: if jJkj \in \mathcal{J}_{k}, then repeating it m/km/k times yields a sequence with a sum of mm, and by construction, k+Φ(j)=Φ(j)k + \Phi(j) = \Phi(j), implying the orbit of Φ(j)\Phi(j) has size dividing kk.

Since Φ(j)\Phi(j) is simply formed by repetition, it is injective. To show surjectivity, note that any pattern Σ(j)\Sigma(j) whose orbit has size dividing kk must be invariant under translation by kk. Since 00 is a partial sum of jj, we must also have kk as a partial sum. Hence, jj must contain a prefix jJkj' \in \mathcal{J}_k for which j=Φ(j)j = \Phi(j'). It follows from the Möbius inversion formula that

Sk,m=dkμ(d)Jk/d,|\mathcal{S}_{k, m}| = \sum_{d \mid k} \mu(d) |\mathcal{J}_{k/d}|,

where μ ⁣:NZ\mu \colon \mathbb{N} \to \mathbb{Z} is the Möbius function. The size of Jm\mathcal{J}_m is well-known: removing the last term of a sequence in Jm\mathcal{J}_m yields a sequence in either Jm1\mathcal{J}_{m-1} or Jm2\mathcal{J}_{m-2}. Since the initial values are J1=J2=1|\mathcal{J}_1| = |\mathcal{J}_2| = 1, this implies that Jm|\mathcal{J}_m| is the mmth Fibonacci number, FmF_m. It follows that the number of ambiguous scales in mm-tone equal temperament is

#ambiguous=JmSm,m=,.dmd1μ(d)Fm/d.\#\text{ambiguous} = \left| \mathcal{J}_m - \mathcal{S}_{m, m}\right| = - \vphantom{\sum^{.}_{,}}\sum_{\substack{d\hspace{0.5pt}\mid m \\ d \neq 1}} \mu(d) F_{m/d}\,.

For example, in ordinary 1212-tone equal temperament, there are 18+13+0211+01=101 \cdot 8 + 1 \cdot 3 + 0 \cdot 2 - 1 \cdot 1 + 0 \cdot 1 = 10 types of tonally ambiguous scales.