In their timely Perspective article (Kazanina, N. & Tavano, A. What neural oscillations can and cannot do for syntactic structure building. Nat. Rev. Neurosci. 24, 113–128 (2023))1, Kazanina and Tavano argue that neural oscillations cannot linearly chunk (or segment) speech into syntactic constituents because constituents are defined in terms of hierarchical relations. Instead, they propose that oscillations could support syntactic structure building (SSB) through ‘multi-scale integration’ of hierarchically organized constituents. We agree with their arguments against the utility of chunking for SSB. However, the dichotomy between ‘oscillations for chunking’ and ‘oscillations for integration’ does not accurately represent the literature: the integratory role of oscillations is well-accepted2,3, and chunking is not a candidate model of SSB. Here, we show that recent work on oscillations and syntax4,5 does not assume chunking and we identify principal challenges for the integration proposal put forward by Kazanina and Tavano.

Syntax builds hierarchical structures, so sentences cannot be described in terms of sequential properties6. As phrases are hierarchically embedded in one another, phrase length is highly variable, and no regular ‘phrase rhythm’ occurs in sentences. On the basis of these facts, Kazanina and Tavano convincingly argue that oscillations cannot support SSB via naive phrase chunking. However, they incorrectly attribute this view to authors who have used oscillations to investigate neural sensitivity to syntax4,5. Contrary to Kazanina’s and Tavano’s claims, neither ref. 4 nor ref. 5 argues that phrase durations in natural language fall within a narrow frequency band or that phrases follow one another linearly without overlap. Thus, these results can be informative about the hierarchical nature of SSB without a chunking perspective on neural oscillations7.

As Kazanina and Tavano point out, neural mechanisms for SSB must be able to encode structural relationships between hierarchically organized elements (for example, dominance, scope), but their proposed solution based on cross-frequency interaction faces critical challenges. First, it relies on a neurobiological model of sequence encoding, which represents ‘horizontal’ information about ordinal positions in a sequence8. However, what matters for hierarchical syntax is the ‘vertical’ relation between a constituent and its subordinate elements (for example, that ‘eat cookies’ is a verb phrase, not a noun phrase)6, which is not represented in the neural sequence code. Second, it is unclear how ‘relational’ information, as found in long-distance dependencies, could be extracted. In Kazanina’s and Tavano’s multi-scale integration view, high-band-frequency encodings of non-adjacent words occupy different phases of low-band-frequency oscillations. But the crucial question — that is, how the dependency between distant words is then resolved — remains unanswered. Third, their integration proposal relies on a one-to-one correspondence between embedding of constituents and embedding of high-frequency into low-frequency oscillations. Kazanina and Tavano correctly argue against a temporal correspondence between the lengths of phrases and oscillatory cycles, but their proposal assumes an equally transparent correspondence between hierarchy in syntax and hierarchy in oscillations. Such a strict isomorphism between cognitive and neural units, whether temporal or hierarchical, is, in our opinion, unlikely to be correct9.

To explain the potentially oscillatory underpinnings of SSB, Kazanina’s and Tavano’s multi-scale integration proposal remains underspecified, and would be strengthened if it explained how ‘vertical’ and ‘relational’ information, needed for truly hierarchical relationships, could be encoded. Only when this type of information is incorporated in a theory of structure building can we properly evaluate what oscillations can do for syntax10.

There is a reply to this letter by Kazanina, N. & Tavano, A. Nat. Rev. Neurosci. https://doi.org/10.1038/s41583-023-00735-4 (2023).