A causal role for frontal cortico-cortical coordination in social action monitoring

Ninomiya, Taihei; Noritake, Atsushi; Kobayashi, Kenta; Isoda, Masaki

doi:10.1038/s41467-020-19026-y

Download PDF

Article
Open access
Published: 16 October 2020

A causal role for frontal cortico-cortical coordination in social action monitoring

Nature Communications volume 11, Article number: 5233 (2020) Cite this article

4570 Accesses
31 Citations
55 Altmetric
Metrics details

Subjects

Abstract

Decision-making via monitoring others’ actions is a cornerstone of interpersonal exchanges. Although the ventral premotor cortex (PMv) and the medial prefrontal cortex (MPFC) are cortical nodes in social brain networks, the two areas are rarely concurrently active in neuroimaging, inviting the hypothesis that they are functionally independent. Here we show in macaques that the ability of the MPFC to monitor others’ actions depends on input from the PMv. We found that delta-band coherence between the two areas emerged during action execution and action observation. Information flow especially in the delta band increased from the PMv to the MPFC as the biological nature of observed actions increased. Furthermore, selective blockade of the PMv-to-MPFC pathway using a double viral vector infection technique impaired the processing of observed, but not executed, actions. These findings demonstrate that coordinated activity in the PMv-to-MPFC pathway has a causal role in social action monitoring.

Dynamic targeting enables domain-general inhibitory control over action and thought by the prefrontal cortex

Article Open access 12 January 2022

Domain-specific functional coupling between dorsal and ventral systems during action perception

Article Open access 03 December 2020

Temporal hierarchy of observed goal-directed actions

Article Open access 11 November 2023

Introduction

Primates, including humans, are social creatures and coordinate their behavior to maintain group cohesion. Successful social exchanges require monitoring others’ actions to optimally organize one’s own actions. This social action monitoring for response optimization has been thought to be mediated at the frontal level by brain regions centered in the ventral premotor cortex (PMv) and the medial prefrontal cortex (MPFC)^1,2,3. The PMv is a cortical area in which the so-called mirror neurons were discovered in the macaque⁴. This class of neurons discharge not only when executing specific actions, but also during the observation of similar actions performed by others. This shared coding of actions between the self and others has been associated with diverse roles, including intention reading via embodied simulation⁵, and is hypothesized to be deficient in autism spectrum disorder (ASD)^6,7,8. Later work has shown, however, that the MPFC also contains neurons with mirror properties^9,10,11. In a similar vein, it has been argued that the existence of self neurons and other neurons in the MPFC^9,10,11 may characterize its unique role in self-other distinction¹². Self neurons and other neurons are those that selectively or preferentially encode the self-action and others’ action, respectively, and the paucity of other neurons in the MPFC is associated with a spontaneous autistic phenotype in a macaque monkey¹³. However, such separate neuronal coding of self and other is not confined to the MPFC. Although studies on the PMv have usually emphasized the existence of motor neurons and mirror neurons, a subset of PMv neurons, albeit in the minority (~5%), are purely action observation neurons¹⁴. These findings suggest that the existence of a particular class of neurons per se cannot uniquely define a particular area in the brain. Despite a growing interest in cortical mechanisms of social action monitoring, the similarities and differences between the PMv and MPFC have yet to be determined. This is in part because direct comparisons of neuronal activities have not been performed between the two areas in the same monkeys performing the same task.

In relation to the issue raised above, there has been much debate on the functional organization of the PMv and MPFC. First, it is controversial whether the two cortical areas show a preference for actions of biological agents. Some studies show that the PMv is activated by actions performed by biological, but not nonbiological, entities^4,14,15,16, while others show the lack of such biological preference^17,18. There is even evidence that mirror neurons in the PMv can become active in the absence of any visible actions^19,20. In the MPFC, activation is selective for biological interactions as assessed using functional neuroimaging methods³; however, this issue has never been addressed systematically at the single-neuron level.

Second, it is highly controversial whether the PMv and MPFC coordinate with one another for social information processing. A large body of work in human neuroimaging has shown that the two areas are rarely concurrently active¹, leading to the hypothesis that their roles in social cognition are mutually independent or even divergent^21,22. This is obviously a puzzle, however, if one considers ample evidence showing rich anatomical connections between the PMv and MPFC^{23,24,25,26,27,28}. In particular, stronger tuning to the self and others’ actions in more caudal MPFC divisions is associated with stronger connections with the PMv²⁸. Although several neuroimaging and theoretical studies^{29,30,31,32,33,34,35,36} suggest possible interactions between the PMv and MPFC, or between neural systems to which these areas are thought to belong separately, this issue has never been addressed electrophysiologically in the macaque.

Clarifying these unsolved issues is fundamental to understanding the organizing principles of the primate social brain. To achieve this goal, we developed an experimental procedure in which monkeys alternated making choices with three different types of partners: a real monkey or a human experimenter [real agent (RA) condition, Fig. 1b, left)], a filmed monkey replayed on a monitor (FM condition, Fig. 1b, center), and a filmed object replayed on the monitor (FO condition, Fig. 1b, right). The RA condition is biologically more natural than the FM condition, because a partner in the RA condition really exists in front of the recorded monkey, allowing real-time social interactions, as opposed to a filmed partner in the FM condition that appears and disappears on the monitor abruptly every time a trial starts and ends. Likewise, the FM condition is biologically more natural than the FO condition, because the partner is a biological entity (monkey) in the FM condition as opposed to a nonbiological entity (a wooden stick) in the FO condition. Thus, we operationally defined the biological nature of these partners to be highest in the RA condition, intermediate in the FM condition, and lowest in the FO condition.

**Fig. 1: Behavioral task and performance.**

Under these conditions, we performed simultaneous, multisite neural recordings in the PMv and MPFC. Here, we report that a sizable number of PMv and MPFC neurons changed their activities depending on the biological nature of observed actions; however, only MPFC neurons exhibited a systematic response bias in favor of biological actions at the population level. By applying the Granger causality analysis to simultaneously recorded neural signals, we obtained evidence that information flow from the PMv to the MPFC increased as the biological nature of observed actions increased. Furthermore, we found by using a double viral vector infection technique^37,38 that the PMv-to-MPFC pathway was causally involved in monitoring observed, but not executed, actions, for determining optimal choices, particularly when the observed actions were performed by real social agents. Interestingly, the observed deficits in social action monitoring resembled peculiar behavioral disorders reported previously in a monkey with an autistic phenotype¹³.

Results

Evaluation of ability to monitor partner’s performance

We used two monkeys (macaca fuscata, monkeys A and B; both designated as M1) as experimental subjects for behavioral and neural data collections. M1 was trained to perform a role-reversal choice task (Fig. 1a) with partners (designated as M2) with different degrees of biological nature (Fig. 1b). In the RA condition, M1 and M2 alternated the roles of ‘actor’ and ‘observer’ every three trials (Supplementary Movie 1). In each trial, once both agents had pressed their start buttons, three target buttons on the actor’s side were illuminated simultaneously. The actor was then required to choose one of them by a reach. When the actor chose the target associated with a reward, both M1 and M2 received a water reward; however, when the actor’s choice was wrong, neither was rewarded (Fig. 1a, top). The correct target position remained the same during a block of 11–17 trials (nonswitch trials) and was then changed without prior notice (switch trials; red arrows in Fig. 1a, bottom). The task structure in the two filmed conditions was fundamentally the same (Supplementary Movie 2 for a video stimulus in the FO condition). Both M1s performed the task reasonably well, with the overall correct rate being higher than 75% in all the partner conditions (Fig. 1c). Interestingly, the overall performance of monkey A decreased as M2’s biological nature decreased (Fig. 1c, left; ρ = −0.40, P = 7.8 × 10⁻⁴, Spearman correlation test).

We examined whether M1 properly monitored M2’s choice information. To this end, we analyzed M1’s performance after M2’s choice ended in no-rewards. In our task, such no-rewards occurred in both nonswitch and switch trials. In nonswitch trials, no-rewards occurred when, despite no clear evidence that a block switch had occurred, M2 chose a button that was not associated with a reward in the current block (‘choice error’; Supplementary Fig. 1a). This no-reward was caused simply by M2’s erroneous choice. The optimal strategy for M1 following M2’s choice error was to keep choosing the same button that had been rewarded in preceding trials (Supplementary Fig. 1a). In switch trials, no-rewards occurred most typically when M2 chose a button that was associated with a reward in the preceding block (Supplementary Fig. 1b). Thus, this no-reward was caused by an unexpected block switch (‘switch error’). In this case, the optimal strategy was to shift to one of two target buttons that were not rewarded in the preceding block (Supplementary Fig. 1b). We found that the percent optimal choice was significantly above chance in both cases (all P < 0.05, Student’s t test; Fig. 1d, e), except for post-switch-error trials in the FO condition (monkey A, t₃₉ = 1.1, P = 0.30; monkey B, t₆₆ = 0.42, P = 0.67; Student’s t test). These findings indicate that M1 monitored M2’s choice information to determine their own choice. Note, again, that the percent optimal choice following M2’s switch error correlated positively with M2’s biological nature (Fig. 1e; monkey A, ρ = −0.16, P = 0.012; monkey B, ρ = −0.17, P = 0.038; Spearman correlation test).

Difference in actor coding and biological preference between two areas

We isolated 565 PMv and 480 MPFC single neurons by multisite recordings with no sampling bias (Fig. 2a). Among these, 277 PMv and 215 MPFC neurons exhibited significantly changed activity in the 600-ms period starting 400 ms before the target button press (peri-action period; see Methods). Because direct comparisons of neural activities have never been reported between the PMv and MPFC in the same monkeys performing the same tasks, we first examined actor coding in the RA condition. Most interesting in this respect was the existence of three types of neuronal populations referred to as self, mirror, and partner types (see Methods for the statistical definition of each neuronal type). These populations were found in both areas but with different frequencies (Fig. 2b).

**Fig. 2: Actor coding in PMv and MPFC.**

Self-type neurons responded preferentially to the self-action compared to the partner-action (Fig. 2c, g and Supplementary Fig. 2a). This type was significantly more prevalent in the PMv than in the MPFC (PMv, n = 92, 33%; MPFC, n = 47, 22%; df = 1, χ² = 7.7, P = 5.5 × 10⁻³, χ² test). The response was inhibitory in a subset of neurons (PMv, n = 12; MPFC, n = 22; Fig. 2b and Supplementary Fig. 3a). Mirror-type neurons responded nondifferentially to the self-action and partner-action (Fig. 2d, h and Supplementary Fig. 2b). The proportion of this type did not differ between the two areas, albeit were more numerous in the PMv than in the MPFC (PMv, n = 105, 38%; MPFC, n = 68, 32%; df = 1, χ² = 2.1, P = 0.15). About half of them were negatively modulated (PMv, n = 53; MPFC, n = 28; Fig. 2b and Supplementary Fig. 3b). Finally, partner-type neurons responded preferentially to the partner-action compared to the self-action (Fig. 2e, i and Supplementary Fig. 2c). In contrast to the self-type, partner-type neurons were significantly more frequent in the MPFC than in the PMv (PMv, n = 80, 29%; MPFC, n = 100, 47%; df = 1, χ² = 16.2, P = 5.6 × 10⁻⁵) and exhibited inhibitory modulations only occasionally (PMv, n = 6; MPFC, n = 8; Fig. 2b). Partner-type neurons were the most prevalent population in the MPFC (Fig. 2b). The activity of a subset of partner-type neurons was significantly higher during the partner’s incorrect than correct actions (Fig. 2f, j and Supplementary Fig. 2d). This subtype, referred to as the partner-error-type³⁹, was identified in both areas but was significantly more common in the MPFC than in the PMv (Fig. 2b; PMv, n = 18, 7%; MPFC, n = 42, 20%; df = 1, χ² = 19.2, P = 1.2 × 10⁻⁵). Across the three types of neurons, response selectivity for the self-action and partner-action varied along a continuum between the two extremes of being almost purely self-selective and partner-selective (Supplementary Fig. 4). Population-averaged activity for each neuronal type aligned with start button onset (i.e., trial start) revealed that such actor selectivity emerged after target onset (Supplementary Fig. 5).

We next examined neuronal preference for specific partner types by comparing the peri-action-period activities during the RA and FM conditions. We found that in both areas the activity of many neurons differed significantly between the two conditions, with some showing a preference for the RA condition and others for the FM condition (Fig. 3a, b). This bidirectional bias was observed in all types at the single-neuron level (Fig. 3c, d). An analysis of covariance using partner’s response time as a covariate revealed that the difference in this kinematic movement parameter did not account for the observed difference in neuronal activity for the great majority of neurons (partner-correct trials, 94%; partner-error trials, 88%). When analyzed at the population level, however, only MPFC neurons exhibited systematic bias in favor of the RA condition (Fig. 3d, yellow background). Specifically, activities of partner-type neurons in the MPFC were significantly greater in the RA condition than in the FM condition during self-correct actions (z = 2.4, P = 0.015, Wilcoxon signed-rank test), partner-correct actions (z = 3.1, P = 1.7 × 10⁻³), and partner-incorrect actions (z = 3.4, P = 8.0 × 10⁻⁴). Qualitatively similar results were obtained when the partner type was tested separately for the partner-error and non-partner-error types (Supplementary Fig. 6a, b).

**Fig. 3: Biological preference in PMv and MPFC.**

A comparison between the FM and FO conditions revealed similar findings. First, the activities of many neurons in both areas changed depending on these two conditions (Fig. 3e, f). Second, at the population level, systematic bias was observed only in the MPFC: activities of partner-type neurons were significantly larger in the FM condition than in the FO condition during partner-correct actions (Fig. 3f; z = 3.5, P = 4.0 × 10⁻³, Wilcoxon signed-rank test; see also Supplementary Fig. 6c, d). Thus, the results from the two comparisons indicate that both PMv and MPFC neurons are sensitive to the partner’s biological nature on a cell-by-cell basis, but basically only MPFC partner-type neurons show a preferential bias toward the partner of higher biological nature at the population level. Note, however, that a time course analysis using a population-averaged spike-density function revealed that mirror-type and partner-type PMv neurons also exhibited significantly higher activity around the time of target choice in the RA condition than in the FM condition, albeit for a short period of time (Supplementary Fig. 7). The activity of self-type MPFC neurons during the self-action was significantly larger in the FO than in the FM condition (z = −3.6, P = 2.7 × 10⁻⁴, Wilcoxon signed-rank test; Fig. 3f), suggesting that even self-action-related activity could be influenced by social contexts.

Inter-areal coordination between two areas

We next asked whether, and if so, how the PMv and MPFC might coordinate during social action monitoring. To this end, local field potentials (LFPs) were simultaneously recorded in the two areas using 16-channel electrodes. Here, we focused on LFPs recorded when the self-action and partner-action were made correctly, because these trial conditions occurred most frequently and equally often, and thus allowed us to make a more reliable estimate of inter-areal coordination.

We first performed a time-frequency domain analysis. It has been documented that scalp potentials over the human sensorimotor cortex are suppressed in the alpha band during action execution and observation, a phenomenon known as mu suppression⁴⁰. This suppression has been hypothesized to reflect mirror neuron activity^7,40 and can be weakened or absent in people with ASD^7,41. In macaques, similar phenomena, but in higher frequency bands, were consistently reported in the PMv^42,43,44,45, whereas possible contributions of the MPFC to mu suppression have not been reported. Here we confirmed in the RA condition that LFP power in the high-beta band (23–30 Hz; black arrows in Fig. 4a, left) was suppressed in the PMv during both self-action and partner-action. Notably, similar suppression also occurred in the MPFC (black arrows in Fig. 4a, right). Critically, the suppression in the MPFC was significant only in the RA condition (Fig. 4a–c, right; asterisks in Fig. 4d, right), which was in marked contrast to the PMv where the suppression was significant in all the partner conditions (Fig. 4a–c, left; asterisks in Fig. 4d, left). In the MPFC, the suppression magnitude for observed actions correlated positively with the biological nature of the partner (Fig. 4d, right; partner-action, ρ = 2.2, P = 1.8 × 10⁻²; self-action, ρ = 0.044, P = 0.70; Spearman correlation test). Such systematic modulation was not observed in the PMv either for the self-action (ρ = −0.26, P = 0.082) or partner-action (ρ = −0.052, P = 0.40). These findings indicate that mu suppression in the high-beta band co-exists in the PMv and MPFC, but only that for observed actions in the MPFC changes systematically depending on the partner’s biological nature. Interestingly, when the two filmed conditions were combined together [filmed agent (FA) condition], a significant interaction was found between the factor of actors (self vs. partner) and the factor of partner conditions (RA vs. FA) in the PMv [P = 0.025, two-way analysis of variance (ANOVA)]. This effect was likely to be caused by enhanced mu suppression in the FA condition during the self-action (P = 5.1 × 10⁻³, Welch’s t test; Fig. 4d, left, inset).

**Fig. 4: LFP analysis at time-frequency domain in PMv and MPFC.**

We also analyzed the dependency of two other LFP components on the partner’s biological nature. First, LFP power below 10 Hz was increased especially prior to the target choice, but this increase was not significantly correlated with the partner’s biological nature in either the PMv (self-action, ρ = −0.17, P = 0.15; partner-action, ρ = −0.15, P = 0.21) or the MPFC (self-action, ρ = −0.15, P = 0.19; partner-action, ρ = 0.076, P = 0.51; Supplementary Fig. 8). In the PMv, however, the power during the self-action was significantly larger in the RA condition than in the two filmed conditions combined together (i.e., FA condition, as defined above) (P = 0.012, actor × partner interaction, two-way ANOVA; P = 0.037, post-hoc Welch’s t test; Supplementary Fig. 8, inset). Second, LFP power at 18–30 Hz increased after target choice, particularly in the PMv during the partner-action. This increased power was, again, not significantly correlated with the partner’s biological nature (PMv, ρ = −0.0034, P = 0.98; MPFC, ρ = 0.13, P = 0.26; Supplementary Fig. 9). However, this LFP component in the MPFC during the partner-action was significantly larger in the FA condition than in the RA condition (P = 0.036, actor × partner interaction, two-way ANOVA; P = 0.019, post-hoc Welch’s t test; Supplementary Fig. 9, right, inset). These findings suggest the existence of frequency-specific, task-phase-dependent processing of social information.

The above findings raise the possibility that the PMv and MPFC might coordinate in a frequency-specific manner. To test this more directly, we computed field-field coherence between the two areas using the first derivatives of the LFP signals⁴⁶. We found that inter-areal coherence was prominent in the frequency band especially below 3 Hz (Fig. 5a), consistent with the increased low-frequency power in the time-frequency domain analysis (Fig. 4a–c). This coherence increase occurred during both self-action and partner-action in all the partner conditions (Fig. 5a). It is therefore likely that the PMv and MPFC coordinate during social action monitoring mainly via the delta-frequency band. The coherence power in the delta band was not significantly associated with the partner’s biological nature (ρ = −6.5 × 10⁻³, P = 0.38 for self-correct, ρ = 0.086, P = 0.49 for partner-correct; Spearman correlation test; Supplementary Fig. 10) or was not significantly different between the RA and FA conditions (self-correct, P = 0.37; partner-correct, P = 0.52; Welch’s t test; Supplementary Fig. 10, inset).

**Fig. 5: Coherence and Granger causality between PMv and MPFC.**

Coherence is silent on the directionality of information flow. Overlapping response latencies between the PMv and MPFC made it difficult to clearly determine which one of the two areas might affect the other (Supplementary Fig. 11). To clarify this point and potential dependence of information flow on the partner’s biological nature, we performed a multivariate Granger causality analysis using the first derivatives of LFPs. We found that Granger causality from the PMv to the MPFC (Fig. 5b, top), but not from the MPFC to the PMv (Fig. 5b, bottom), was greater when the partner’s biological nature was higher for both executed and observed actions (PMv-to-MPFC, ρ = −0.27, P = 0.021 for self-correct, ρ = −0.30, P = 0.010 for partner-correct; MPFC-to-PMv, ρ = −0.040, P = 0.73 for self-correct; ρ = −0.0054, P = 0.96 for partner-correct; Spearman correlation test). Furthermore, consistent with increased coherence, Granger causality was most pronounced in the delta band (Fig. 5c). In this frequency band, Granger causality in the PMv-to-MPFC direction also correlated positively with biological nature during the partner-action (ρ = −0.27, P = 0.021; Spearman correlation test; Fig. 5d, bottom-left). These findings suggest that the pathway from the PMv to the MPFC carries signals for social action monitoring, particularly when interacting partners are real animate beings.

Causal role for PMv-to-MPFC pathway in social action monitoring

If signals carried by the PMv-to-MPFC pathway are crucial, then selective blockade of this pathway should impair the monitoring of partner’s action. We tested this possibility by employing a viral vector-mediated, pathway-selective intervention in monkey B. In this intervention procedure (Fig. 6a), a retrograde gene transfer vector, AAV2-retro-CMV-rtTAV16, was injected bilaterally into the MPFC (1.1 × 10¹³ viral genomes/ml, 0.25–0.3 μL at 12 sites), while another vector, AAV-DJ-TRE-EGFP-eTeNT-PEST, was injected bilaterally into the PMv (2.1 × 10¹³ viral genomes/ml, 0.25–0.3 μL at 12 sites). Seven weeks later, oral administration of doxycycline (Dox, 0.25–0.3 mg/kg/day) was initiated. With this double viral vector technique^37,38, PMv neurons whose axons terminate in the MPFC are double infected (Fig. 6a). Dox administration then induces expression of enhanced tetanus neurotoxin (eTeNT), which in turn blocks synaptic transmission of double-infected neurons while other inputs to the MPFC are spared. Using anti-GFP immunohistochemistry, we confirmed that double-infected neurons were localized ventrally to the arcuate spur corresponding to the PMv (Fig. 6b). This PMv sector has been shown to connect abundantly with the MPFC^27,28.

**Fig. 6: Task performance after selective blockade of PMv-to-MPFC pathway.**

We performed four courses of Dox administration with an intervening period of at least 4 weeks. In the first experiment, the overall performance was not affected throughout a 14-day test period compared to a 7-day pre-administration control period (Fig. 6c). However, the percent optimal choice decreased gradually, yet steadily and specifically, in trials following partner choice errors (Fig. 6d). This decline in performance was not caused by a failure of no-reward detection or a decrease in general motivation, because the percent optimal choice following M1’s own choice error was not at all affected (Fig. 6e). These observations were consistent across all the Dox experiments: performance was severely impaired after partner choice errors (RA, t₆₄ = 8.1, P = 6.0 × 10⁻¹²; FM, t₅₇ = 1.9, P = 0.024; FO, t₅₇ = 2.8, P = 7.3 × 10⁻³; Welch’s t test; Fig. 6g), whereas neither overall performance (all P > 0.05; Fig. 6f) nor performance after M1’s own choice errors was deteriorated (all P > 0.05; Fig. 6h). Critically, with the pathway-selective intervention, the performance levels after partner choice error were no longer different from chance level in the RA and FM conditions, while those in the FO condition were still significantly higher than chance (RA, 34%, t₆₅ = 0.28, P = 0.78; FM, 44%, t₅₈ = 1.7, P = 0.10; FO, 50%, t₅₈ = 2.8, P = 8.5 × 10⁻³; Student’s t test; Fig. 6g). Furthermore, the performance levels after partner choice error were negatively correlated with the partner’s biological nature after the blockade of the PMv-to-MPFC pathway (ρ = 0.19, P = 0.044; Spearman correlation test; Fig. 6g). These findings indicate that the PMv-to-MPFC pathway plays a causal role in the monitoring of observed, but not executed, actions for determining one’s own optimal choices. The gaze duration at the partner’s target position was not affected by the intervention (RA, t₄₅ = −1.0, P = 0.31; FM, t₄₅ = −0.76, P = 0.50; FO, t₄₅ = 1.8, P = 0.085; Welch’s t test; Supplementary Fig. 12), which ruled out the possibility that the observed impairment was merely due to a loss of overt attention to the partner-action. Interestingly, as opposed to the decline in performance after partner choice errors, the performance levels after partner switch errors were significantly increased in the RA and FM conditions during intervention (RA, t₆₄ = −2.9, P = 5.3 × 10⁻³; FM, t₅₈ = −2.5, P = 0.014; FO, t₅₈ = −1.2, P = 0.22; Welch’s t test; Fig. 6i). These opposing intervention effects could be accounted for by deficits in monitoring the partner-action (see Discussion).

Discussion

In the present study, we directly compared neural activities between the PMv and MPFC in the same monkeys performing the same task, as well as to determine the way in which the two areas might coordinate. We have shown that mirror-type neurons and partner-type neurons are the prevalent population in the PMv and MPFC, respectively, although all three types of actor-coding exist in both areas. We have also shown that, although the classification of neuronal types was made on the basis of statistical criteria, the selectivity for the self-action and partner-action in individual neurons varied along a continuum (Supplementary Fig. 4), like reports in other functional domains, such as visual⁴⁷, oculomotor⁴⁸, and reward⁴⁹ processing. While the continuous distribution of self-coding and partner-coding raises the possibility that the exact boundaries between the three classes of neurons may be somewhat arbitrary, it might also reflect a perceived overlap between the self and others that can be dynamically changed depending on social contexts^50,51.

We found that at the population level, only partner-type neurons in the MPFC, but not mirror-type or partner-type neurons in the PMv, exhibited a significantly increased response as the biological nature of the partner’s action was increased. The biological preference in the PMv has long been a debate due to mixed evidence in both macaque single-neuron studies^4,14,15,18 and human neuroimaging studies¹⁶. Although the activities of a sizable number of PMv neurons did change depending on the partner’s biological nature, our findings suggest that PMv neurons, including the mirror-type, do not exhibit systematic bias in favor of biologically performed actions. Instead, such consistent biological preference is evident in partner-type neurons in the MFPC. This observation accords with neuroimaging findings that the macaque MPFC becomes active during observation of social, but not physical, interactions³. The possibility remains, however, that biological preference in the MPFC is dependent, at least partly, on the inputs from the PMv, considering the existence of PMv neurons sensitive to the partner’s biological nature at the single-cell level (Fig. 3c, e), a brief but significant enhancement of PMv activity in the RA condition (Supplementary Fig. 7a), and the partner-dependent information flow from the PMv to the MPFC (Fig. 5b, d).

The time-frequency domain analysis of the LFP signals further supports our conclusion that the PMv and MPFC have different biological preferences. We confirmed that the magnitude of mu suppression in the MPFC, but not in the PMv, changed systematically depending on the biological nature of the partner (Fig. 4d). Mu suppression has been thought to be a neural signature of mirror neuron activity in human studies^7,40. Our findings now suggest that activity in the MPFC, a midline cortical structure, also contributes to mu suppression in humans. In support of this view, it has been documented that mu suppression can be recorded at Cz, in addition to C3 and C4, in the international 10–20 method of electrode placement⁷. Moreover, the magnitude of mu suppression decreases when observing nonbiological movements compared to biological actions⁷, as found in MPFC LFPs in the present study. These findings suggest that electroencephalogram-based mu suppression contains MPFC-derived response components to a significant degree. The functional significance and clinical relevance of mu suppression in each area are unclear. While the mu frequency band seems to carry information regarding social actions in each of the PMv and MPFC, interactions between the two areas, as indexed by coherent activity, take place in much lower frequency bands (<3 Hz). It has been proposed that slow oscillations serve to synchronize neural activities in large-scale networks^52,53. These results indicate that social action information is processed in specific frequency channels within (mu frequency band) and between (delta-frequency band) the PMv and MPFC.

We have shown that the distributions of response latencies in the PMv and MPFC overlap substantially (Supplementary Fig. 11). Given a conduction time as fast as a few milliseconds between frontal cortical areas^54,55, it is still possible that one of the two areas leads the other despite a substantial latency overlap, or separate neural signals are conveyed concurrently in both directions. The Granger causality analysis applied to simultaneously recorded LFPs has revealed the existence of information flow in the PMv-to-MPFC direction that is increased as the partner’s biological nature is increased. Determining what social information might be conveyed in the opposite direction, i.e., from the MPFC to the PMv, is an important future direction.

Consistent with the change in information flow depending on the partner’s biological nature, selective blockade of the PMv-to-MPFC pathway impaired performance after partner choice errors particularly in the RA condition. This impairment cannot be explained by failures to detect no-rewards, lack of attention to other-actions, or decreases in motivation to work. Moreover, further analysis revealed that performance after partner switch errors was significantly better in the RA and FM conditions after intervention (Fig. 6i). This is surprising at first sight, because optimal performance after partner switch errors also requires monitoring other-actions (Supplementary Fig. 1b). How can this apparent puzzle be solved? We hypothesize that blockade of the PMv-to-MPFC pathway causes selective impairments of other-action monitoring, thereby leading to a difficulty extracting task-relevant information from observed actions. We consider that when information about others’ actions becomes virtually unavailable, the monkeys would take a choice strategy that is optimal in the nonsocial condition, that is, action selection based on self-action monitoring and reward feedback, like in the performance of a reversal-learning task in a solo condition. This choice strategy has been described as a ‘win-stay, lose-switch’ (continuation of the same response as long as it is rewarded but changes of the response once it is not rewarded). Under this condition, the monkeys would switch the target whenever no-rewards happens in the partner’s turns regardless of the actual causes of no-rewards (choice error or block switch; Supplementary Fig. 13). Then, choice performance after partner switch errors would be unaffected, or could become even better, because switching of the target is optimal in this case (Supplementary Fig. 1b). By contrast, lower performance levels would be expected after partner choice errors, because switching of the target is nonoptimal in this case (Supplementary Fig. 1a). These are exactly what we observed when the PMv-to-MPFC pathway was selectively blocked. Notably, this pattern of behavioral deficits closely resembles that observed in an autistic monkey with mutations in genes (HTR2C and ABCA13) linked to human psychiatric and neurodevelopmental disorders¹³. Specifically, the monkey with a spontaneous autistic phenotype in that study exhibited severe performance deficits after partner choice errors, but maintained high performance levels after partner switch errors, during a similar role-reversal choice task. Remarkably, partner-type neurons and mirror-type neurons were almost nonexistent in the MPFC of the autistic monkey¹³.

One of the major questions in social neuroscience is whether, and if so, how the PMv and MPFC coordinate with one another for processing social actions. While a meta-analysis of human neuroimaging has shown that the two areas are rarely concurrently active during social task performance¹, leading to the proposal that their roles in social cognition are mutually independent or even divergent^21,22, there have been alternative views that the MPFC may receive social information from the PMv to mediate social cognition^30,35,56. The present findings indicate that the ability of the MPFC to monitor the other’s action depends on input from the PMv, supporting a coordinative relationship between the two areas. Our findings can also extend the so-called broken mirror hypothesis, which postulates that people with ASD have dysfunctional mirror neurons in the premotor cortex^6,7,8. We now suggest that, rather than a deficit in single-neuron types in single brain regions, discoordination between the PMv and the MPFC underlies maladaptive social information processing in ASD. Our intervention procedure selectively targeting the PMv-to-MPFC pathway may provide a useful tool for making a reversible, nonhuman primate model of ASD.

Methods

Animals

Two male macaque monkeys [Macaca fuscata, monkey A (age 6, 5.1 kg), monkey B (age 6, 5.0 kg)] were used as experimental animals for neural recordings. Other monkeys C (age 8, 8.1 kg) and Q (age 5, 6.3 kg) also participated in this study solely as non-recorded subjects. All animal care and experimentation protocols were approved by the Institutional Animal Care and Use Committee of National Institutes of Natural Sciences and were conducted in accordance with the guidelines described in the US National Institutes of Health Guide for the Care and Use of Laboratory Animals.

Behavioral procedures

Role-reversal choice task [real agent (RA condition)]: The monkeys were trained to perform the role-reversal choice task (Fig. 1a) with an RA partner (Fig. 1b, left). During data collection, monkey A was paired with monkey B or monkey C, while monkey B was paired with monkey A, monkey C, monkey Q, or a human experimenter. Hereafter, the recorded monkey will be referred to as M1 (or self) and a non-recorded partner will be referred to as M2. In each experimental session, M1 and M2 sat in individual primate chairs facing each other. A square panel was attached horizontally to the front of each chair. Four buttons were placed on the panel: a circular one on the near side toward each participant, which served as a start button, and three rectangular ones on the far side, which served as target buttons. The chair panels for M1 and M2 were closely positioned (distance = ~1 cm; Fig. 1a), such that their target buttons looked continuous from the subjects’ viewpoints.

In each trial, one participant was assigned as the actor and the other the observer. These roles alternated every three trials. Each trial started when the start buttons of both participants turned on. After the participants pressed the start buttons with their right hands for 0.7–1.3 s, the target buttons on the actor’s side turned on. The actor was required to choose one of the three buttons within 3 s, only one of which was associated with a reward. The position-reward association remained the same for a block of 11–17 trials and then was changed unpredictably. A high-pitched tone (1 kHz) occurred together with each target button press regardless of its correctness. The observer was required to hold its start button throughout the trial. When the actor made a correct choice, both participants were rewarded with a drop of water 1300 ms after the target button press. Neither participant was rewarded when the actor made a wrong choice.

Role-reversal choice task [filmed monkey (FM) and filmed object (FO) conditions]: Two filmed conditions were also introduced. In these conditions, a large LCD monitor (W67.41 × H99.56 cm) was placed at the far end of M1’s chair panel; under this setting, the target buttons for M1 and those in the monitor were aligned in the same direction and looked continuous from M1’s viewpoint. In the FM condition, the partner replayed on the monitor was a monkey sitting in a primate chair. In the FO condition, the partner replayed on the monitor was a wooden stick. When the filmed partner was the actor, the monkey or stick on the monitor reached and pressed one of the three targets. When the filmed partner was the observer, the monkey or stick on the monitor kept pressing the start button. A blank screen was presented during intertrial intervals. During the whole experimental period, the overall correct rates for the performance of the filmed partners were set comparable to those of the RA partner (RA, 80%; FM, 78%; FO, 77%; P = 0.31, one-way analysis of variance).

To generate visual stimuli for the filmed conditions, actions performed by the filmed partners were recorded beforehand using a video camera (HDR-CX470, Sony, Tokyo, Japan) at 30 frames per second with a resolution of 1920 × 1080 pixels in an uncompressed format. The video sequences were then edited to obtain video clips replayed during actual data collection. The duration and resolution of video clips were 7–8 s and 800 × 860 pixels, respectively. Each video clip started about 1 s prior to the start button onset and ended about 3 s after target button press. Eight to ten different video clips were prepared for each target button choice.

The filmed condition to be tested first in each day was determined randomly, and the two filmed conditions alternated every nine blocks (1 block = 11–17 trials). Where possible, neural recording in the RA condition was also performed before or after the filmed conditions.

Surgical procedures

A plastic headpost and a plastic chamber were implanted on the skull under aseptic conditions for neuronal recording. The monkeys were initially anesthetized by intramuscular injections of ketamine HCl (10 mg/kg) and xylazine (1–2 mg/kg) and maintained under general anesthesia with isoflurane (1–2%). After the skull was exposed, acrylic screws were installed to fasten a dental acrylic head implant to the skull. A plastic headpost and two recording chambers were placed stereotaxically and secured with dental acrylic. The coordinates of the chambers were determined and confirmed to allow access to the PMv and MPFC bilaterally using magnetic resonance images taken pre- and post-surgery. Antibiotics and analgesics were administered after surgery.

Behavioral recording procedures

Stimulus presentation, behavioral monitoring, and reward delivery were controlled by a personal computer running the MonkeyLogic Matlab toolbox⁵⁷. Eye position data were streamed to the computer through an infrared video tracking system at a sampling rate of 1 kHz and a spatial resolution of 0.1° (EyeLink II; SR Research, Ontario, Canada). The water reward was delivered through a spout under the control of a solenoid valve placed outside the sound-attenuated room. Overt movements were continuously monitored using a video-capturing system.

Neuronal recording procedures

Electrophysiological experiments were performed in M1 (monkeys A and B). All sessions for neural recording in the RA condition were performed with a conspecific partner. Single-unit activity and LFPs were recorded simultaneously in the PMv and MPFC using two 16-channel electrodes (U- or S-probe, Plexon Inc., TX, USA), with an interelectrode spacing of 200 μm. The impedance for each channel was 0.3–0.5 MΩ at 1 KHz. Signals were initially amplified and bandpass-filtered (150 Hz to 8 kHz; OmniPlex system; Plexon Inc., TX, USA), and then single-unit activity was isolated using an online template-matching spike discriminator (SortClient; Plexon Inc., TX, USA). All well-isolated neurons were sampled. An oil-driven micromanipulator (MO-97A or MO-971A; Narishige, Tokyo, Japan) was used to advance each probe through a stainless-steel guide tube that was held in place by a grid. The grid allowed recordings every 0.5 mm between penetrations. LFP signals were bandpass-filtered (0.2–500 Hz) and digitized at 1 kHz (OmniPlex system; Plexon Inc., TX, USA) for offline analysis.

Identification of recording sites

PMv: The rostral bank of the arcuate sulcus was initially explored using intracortical microstimulation (ICMS; cathodal pulses of 0.2-ms duration at 333 Hz) to identify the posterior end of the frontal eye field (FEF). Briefly, any penetration was considered to be in the FEF if saccades were evoked by ICMS with low thresholds (typically 11 pulses at a current intensity of <50 μA)^48,58. During this mapping, the location of the spur of the arcuate sulcus was also noted on the basis of the complete lack of neuronal activity. The premotor cortex was located just caudal to the FEF, and the PMv was identified based on its location relative to the arcuate spur and on distal movements evoked by ICMS^59,60. During this functional mapping, we ‘clinically’ examined functional properties of well-isolated neurons^60,61. Briefly, the monkeys were presented with a food item and allowed to grasp it when at reaching distance. To test mirror properties, an experimenter also performed a series of hand actions in front of the monkeys, such as putting a food item on a surface and grasping it.

MPFC: The MPFC mainly included the presupplementary motor area (pre-SMA) and its rostrally adjacent prefrontal area 9. We first identified the border between the SMA and pre-SMA according to physiological criteria⁵⁴. Briefly, the face region of the SMA was carefully mapped on the basis of motor effects evoked by ICMS (typically 11–22 pulses at a current intensity of up to 40 μA) and neuronal responses to somatosensory stimuli. The pre-SMA was identified just rostral to the face region in the SMA, and was characterized by (1) a higher threshold for evoking forelimb movements (≥40 pulses and ≥40 μA), (2) complex motor effects involving multiple joints following ICMS, and (3) the prevalence of neurons responsive to visual rather than somatosensory stimuli. The rostral-most portion of the recording site was 12 mm anterior to the border between the SMA and pre-SMA.

Statistics

No statistical methods were used to predetermine sample sizes, but our sample sizes were similar to those reported in previous publications (for example, see refs. ^37,39,44). Data were assumed to be normally distributed, but this was not formally tested. We did not select the type of neurons during neural data acquisition; all well-isolated neurons were recorded. Data collection and analysis were not performed blinded to the conditions of the experiments. Correct target positions were pseudorandomly determined. Behavioral analysis was performed using data collected after the initiation of neuronal recordings. No data were excluded, except for LFP time series data with problems of colinearity, nonstationarity, and/or heteroscedasticity for the Granger causality analysis (see below)^62,63. All the statistical procedures for behavioral and neural data analysis were assessed by two-tailed tests unless otherwise noted and performed in Matlab 2016a and 2017a (MathWorks Inc., Natick, MA, USA) using Statistics and Machine Learning Toolbox, Signal Processing Toolbox, Parallel Computing Toolbox, Control System Toolbox, and Multivariate Granger Causality Toolbox.

Data analysis

Trial selection and calculation of chance level for M1’s performance following M2’s choice error: To calculate the optimal choice performance (percent optimal choice) following M2’s choice error (e.g., Fig. 1d), we selected M1-actor trials that met the following two conditions: (1) M1-actor trials were immediately preceded by M2’s choice error (i.e., M2 committed a choice error in the third of three consecutive M2-actor trials; Supplementary Fig. 1a, red cross), and (2) the correct target had been revealed, either by the choice of M1 or M2, at least once in the current block before the commission of M2’s choice error. Here, M1’s optimal performance was to choose the correct target in the current block (e.g., if B3 was associated with a reward, then the choice of only B3 was optimal; Supplementary Fig. 1a, gray square). Thus, the percent optimal choice by chance was 33.3%.

Trial selection and calculation of chance level for M1’s performance following M2’s switch error: To calculate the percent optimal choice following M2’s switch error (e.g., Fig. 1e), we selected M1-actor trials that met the following two conditions: (1) M1-actor trials were immediately preceded by M2’s switch error that occurred in the first trial in the current block, and (2) M2’s switch error occurred by choosing the target that was associated with a reward in the preceding block (Supplementary Fig. 1b). In this situation, M1’s optimal performance was to choose one of the two targets that were not associated with a reward in the preceding block (e.g., if B3 was associated with a reward in the preceding block, then the choice of B1 or B2 was optimal; Supplementary Fig. 1b, gray square). Thus, the percent optimal choice by chance was 66.7%.

Neuronal activity: We recorded spike activities of 565 PMv neurons and 480 MPFC neurons. The firing rates of individual neurons were measured during a control period (600–0 ms before target onset) and a peri-action period (from 400 ms before to 200 ms after target button press). We then performed a series of analyses for individual neurons, as follows. First, a two-way analysis of variance (P < 0.05) was performed for activity in the peri-action period to test the effects of two factors: agent (self or partner) and performance outcome (correct or incorrect). A neuron was then defined to be agent-selective if it showed a significant main effect of agent and was further classified into one of three types. Specifically, a neuron was classified as a self type (excitatory) if its peri-action-period activity was significantly higher in self-action trials than in partner-action trials (P < 0.05, Turkey-Kramer post-hoc test) and, additionally, its peri-action-period activity was significantly higher than the control-period activity in self-action trials (P < 0.05, paired t test). Alternatively, a neuron was classified as the self type (inhibitory) if its peri-action-period activity was significantly lower in self-action trials than in partner-action trials (P < 0.05, Turkey-Kramer post-hoc test) and, additionally, its peri-action-period activity was significantly lower than the control-period activity in self-action trials (P < 0.05, paired t test).

Likewise, a neuron was classified as a partner type (excitatory) if its peri-action-period activity was significantly higher in partner-action trials than in self-action trials (P < 0.05, Turkey-Kramer post-hoc test) and, additionally, its peri-action-period activity was significantly higher than the control-period activity in partner-action trials (P < 0.05, paired t test). Alternatively, a neuron was classified as the partner type (inhibitory) if its peri-action-period activity was significantly lower in partner-action trials than in self-action trials (P < 0.05, Turkey-Kramer post-hoc test) and, additionally, its peri-action-period activity was significantly lower than the control-period activity in partner-action trials (P < 0.05, paired t test). A partner-type neuron could be further classified as a partner-error type if it also showed a significant main effect of performance outcome. Specifically, an excitatory partner-type neuron was defined as an excitatory partner-error-type neuron if it exhibited peri-action-period activity that was significantly higher in partner-error trials than in partner-correct trials (P < 0.05, Turkey-Kramer post-hoc test). Conversely, an inhibitory partner-type neuron was defined as an inhibitory partner-error-type neuron if it exhibited peri-action-period activity that was significantly lower in partner-error trials than in partner-correct trials (P < 0.05, Turkey-Kramer post-hoc test). Finally, a neuron was classified as a mirror type if it lacked a significant main effect of agent, but its peri-action-period activity was significantly higher (excitatory) or lower (inhibitory) than the control-period activity in both self-action and partner-action trials (P < 0.05, paired t test).

In constructing continuous spike-density functions for populations of neurons, each spike was convolved with a Gaussian kernel (SD = 30 ms) for each neuron. The resulting spike densities for individual neurons were normalized from zero to maximum and were averaged for each neuron type.

To examine a biological preference of individual neurons, activity in the peri-action period was compared between the RA and FM conditions, and between the FM and FO conditions (P < 0.01, permutation test with 1000 iterations). Also, to examine a biological preference of the population of PMv neurons and MFC neurons, we constructed a histogram illustrating the distribution of differential firing rates. The differential firing rate was defined for each excitatory neuron as [(peri-action-period activity in the RA condition) minus (peri-action-period activity in the FM condition)] for the RA–FM comparison and [(peri-action-period activity in the FM condition) minus (peri-action-period activity in the FO condition)] for the FM–FO comparison. For each inhibitory neuron, the sign of this value was reversed. We then tested whether the differential firing rate at the population level was significantly different from zero (P < 0.05/3, Wilcoxon signed-rank test with Bonferroni correction). This comparison was made separately for each neuron type.

To compute actor selectivity for individual neurons (Supplementary Fig. 4b), we calculated another differential firing rate, which was defined as [(peri-action-period activity during partner-action) minus (peri-action-period activity during self-action)]. Here, self-action was defined as the action in self-correct trials. Partner-action was defined as the action in partner-error trials for partner-error-type neurons and the action in partner-correct trials for the other types of neurons. Again, the sign of the value was reversed for each inhibitory neuron.

Time-frequency domain analysis: We calculated time-varying power spectra of LFPs⁴⁶. Briefly, power in each frequency band was computed in 1-ms steps and in 1-Hz steps from 1 to 50 Hz. Each time-varying power spectra was normalized per frequency by the activity 500–0 ms before the target onset using a z-score normalization procedure and averaged across sessions. The strength of LFP power was quantified by averaging the z-scored time-varying power spectra (1–10 Hz and 23–30 Hz, −600 to 0 ms from target button press; 18–30 Hz, 200–1000 ms from target button press; Fig. 4d and Supplementary Figs. 8 and 9).

Field-field coherence: For each 16-channel electrode in the PMv and MPFC, the first derivative of the LFPs from adjacent channels were computed in the superficial direction to generate 15 bipolar LFPs. This procedure is known to effectively attenuate potential artifacts caused by electric volume conduction and the common reference, resulting in more spatially precise evaluation of signal interactions^46,64. The procedures for calculating coherence using bipolar LFPs were described in detail elsewhere⁶². Briefly, the bipolar LFPs from 2.5 s before to 2.5 s after target button press were concatenated for each recording session into one long time series for self-correct and partner-correct trials. The concatenated signals were then convolved with a complex Morlet wavelet function and divided into the original 5-s LFP segments. Coherence was calculated for all LFP pairs between the PMv and MPFC ranging from 1 to 50 Hz in a logarithmic step. Each coherence profile was normalized in the same way as the time-varying power spectra and averaged across sessions. To quantify the strength of delta-band coherence, z-scored coherent data were averaged between 1 and 3 Hz during a 300-ms period immediately before target button press (Supplementary Fig. 10).

Granger causality: To evaluate the direction of information flow between the PMv and MPFC, a Granger causality analysis⁶⁵ was applied to the bipolar LFPs simultaneously recorded in the two areas. This analysis was implemented using a multivariate linear vector autoregressive (MVAR) model provided by the Multivariate Granger Causality Toolbox⁶³ and described in detail elsewhere⁶⁶. Briefly, the bipolar LFP segments (300–0 ms before target button press) were analyzed. Akaike information criteria was used to estimate the best model order up to 50 ms. The MVAR model parameters for the selected model order were estimated using ordinary least-squares regression. The autocovariance sequence from the MVAR parameters was calculated for the LFP time series data without problems of colinearity, nonstationarity, and/or heteroscedasticity. Finally, the time-domain pairwise conditional Granger causality was estimated using F-testing with false discovery rate (Q < 0.05). The numbers of channel pairs with significant Granger causality for the PMv-to-MPFC direction and the MPFC-to-PMv direction were counted for quantitative comparisons. The Granger information flow was also estimated in the frequency domain between 1 and 120 Hz with a 1.67-Hz resolution, again using F-testing with false discovery rate (Q < 0.05). For quantitative comparisons, the number of channel pairs with significant Granger causality was counted in the delta-frequency band (1–3 Hz) where high coherent activities were observed.

Pathway-selective intervention

Injection of viral vectors: To selectively block the synaptic transmission of PMv neurons whose axons terminate in the MPFC, the genetic dissection method using two viral vectors^37,38 was applied to monkey B. A retrograde gene transfer adeno-associated viral vector serotype 2, which carried an enhanced reverse tetracycline trans-activator (AAV2-retro-CMV-rtTAV16, 1.1 × 10¹³ viral genomes/mL), was injected into the MPFC. A different serotype of the AAV vector, AAV-DJ, which carried enhanced tetanus neurotoxin light chain (eTeNT) and enhanced green fluorescent protein (eGFP) downstream of the tetracycline-responsive element (TRE) (AAV-DJ-TRE-EGFP-eTeNT-PEST, 2.1 × 10¹³ viral genomes/mL), was injected into the PMv. With these injections, PMv neurons whose axons terminate in the MPFC were double infected by the vectors (Fig. 6a, b).

Vector injections were made bilaterally by pressure through a 10-μL Hamilton microsyringe. For the MPFC, the AAV2-retro injections were made into three sites centered 6.5 mm rostral from the SMA/pre-SMA border in each hemisphere. The viral vector was deposited at two different depths for each site, aiming at 2.0 and 3.5 mm from the cortical surface (0.25–0.3 μL at each depth). For the PMv, the AAV-DJ injections were made into three sites centered 1.5 mm caudal from the posterior end of the arcuate sulcus and 3 mm ventral from the arcuate spur in each hemisphere. The viral vector was deposited at two different depths for each site aiming at 1.5 and 3.0 mm from the cortical surface (0.25–0.3 μL at each depth). The injection sites in the PMv and MPFC were determined on the basis of the finding that task-related neurons were abundantly found at those sites. Note that these sites in the two areas correspond to cortical locations with known mutual anatomical connections^23,26.

Dox administration: We performed four courses of Dox administration. The first course was initiated 7 weeks after the vector injections. Dox was administered orally at a dose of 25–30 mg/kg/day. The first to the fourth experiments were continued for 14, 18, 20, and 50 days, respectively, with an interval of at least 4 weeks. Control data were collected for 7 days before Dox administration. Behavioral data during Dox administration were collected every day except the fourth course [n = 18 (RA), 7 (FM), and 7 (FO) days for the fourth course]. The Dox test period was divided into an early and a late half for statistical analysis. During the Dox administration experiments, monkey B performed the task with a conspecific or a human experimenter. The overall correct rates were comparable with a monkey partner (80%, n = 54 sessions) and with a human partner (78%, n = 47 sessions; P = 0.22, Welch’s t test); data with two RA conditions were thus combined.

Gaze behavior: Possible changes in gaze behavior due to Dox administration were evaluated. Specifically, the proportion of the subject’s gaze at the partner’s correct target button was compared before (control period) and after (late test period) Dox administration (P < 0.05, Welch’s t test). For this comparison, M1’s gaze positions were quantified during a 300-ms period before the target button press. Eye positions in the fourth course were not available due to a technical error during data collection.

Histology

After the Dox administration experiments, monkey B was anesthetized deeply with sodium pentobarbital (70 mg/kg, i.v.) and transcardially perfused with 0.1 M phosphate-buffered saline (PBS; pH 7.4) and then 10% formalin in 0.1 M phosphate buffer (pH 7.4). The brain was removed from the skull and postfixed overnight. After saturated with 30% sucrose for 2 weeks at 4 °C, the brain was sectioned coronally at 50 μm thickness. A series of every tenth section was initially treated with 0.3% hydrogen peroxide in 0.1 M PBS for 30 min at room temperature to inhibit endogenous peroxidase. Subsequently, the sections were immersed in 1% skim milk for 1 h and incubated overnight at 4 °C with rabbit anti-GFP antibody (1:3000; Thermo Fisher Scientific, MA, USA) in 0.1 M PBS containing 0.1% Triton X-100 and 1% horse serum. The sections were incubated for 2 h in the same medium containing biotinylated horse anti-rabbit IgG antibody (1:200; Vector Laboratories, Peterborough, UK) and reacted with the ABC Elite kit (Vector Laboratories) for 1.5 h. For visualization of the antigen, the sections were reacted in 0.1 M PBS containing 0.04% 3,3′-diaminobenzidine and 0.002% hydrogen peroxide. An adjacent series of sections was Nissl-stained with 5% Cresyl Violet.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The authors declare that the main data supporting the findings of this study are available within the paper and its supplementary information files. Extra data are available from the corresponding author upon reasonable request. Source data are provided with this paper.

Code availability

Analyses were performed by using Matlab toolboxes (see Reporting summary for details). Matlab scripts employed are available from the corresponding author upon reasonable request.

References

Van Overwalle, F. & Baetens, K. Understanding others’ actions and goals by mirror and mentalizing systems: a meta-analysis. Neuroimage 48, 564–584 (2009).
Article PubMed Google Scholar
Ninomiya, T., Noritake, A., Ullsperger, M. & Isoda, M. Performance monitoring in the medial frontal cortex and related neural networks: from monitoring self actions to understanding others’ actions. Neurosci. Res. 137, 1–10 (2018).
Article PubMed Google Scholar
Sliwa, J. & Freiwald, W. A. A dedicated network for social interaction processing in the primate brain. Science 356, 745–749 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
di Pellegrino, G., Fadiga, L., Fogassi, L., Gallese, V. & Rizzolatti, G. Understanding motor events: a neurophysiological study. Exp. Brain Res. 91, 176–180 (1992).
Article PubMed Google Scholar
Gallese, V. Before and below ‘theory of mind’: embodied simulation and the neural correlates of social cognition. Philos. Trans. R. Soc. Lond. B Biol. Sci. 362, 659–669 (2007).
Article PubMed PubMed Central Google Scholar
Williams, J. H., Whiten, A., Suddendorf, T. & Perrett, D. I. Imitation, mirror neurons and autism. Neurosci. Biobehav. Rev. 25, 287–295 (2001).
Article CAS PubMed Google Scholar
Oberman, L. M. et al. EEG evidence for mirror neuron dysfunction in autism spectrum disorders. brain Res. Cogn. brain Res. 24, 190–198 (2005).
Article PubMed Google Scholar
Ramachandran, V. S. & Oberman, L. M. Broken mirrors: a theory of autism. Sci. Am. 295, 62–69 (2006).
Article ADS PubMed Google Scholar
Yoshida, K., Saito, N., Iriki, A. & Isoda, M. Representation of others’ action by neurons in monkey medial frontal cortex. Curr. Biol. 21, 249–253 (2011).
Article CAS PubMed Google Scholar
Falcone, R., Cirillo, R., Ferraina, S. & Genovesio, A. Neural activity in macaque medial frontal cortex represents others’ choices. Sci. Rep. 7, 12663 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Livi, A. et al. Agent-based representations of objects and actions in the monkey pre-supplementary motor area. Proc. Natl Acad. Sci. USA 116, 2691–2700 (2019).
Article CAS PubMed PubMed Central Google Scholar
Isoda, M. & Noritake, A. What makes the dorsomedial frontal cortex active during reading the mental states of others? Front Neurosci. 7, 232 (2013).
Article PubMed PubMed Central Google Scholar
Yoshida, K. et al. Single-neuron and genetic correlates of autistic behavior in macaque. Sci. Adv. 2, e1600558 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Gallese, V., Fadiga, L., Fogassi, L. & Rizzolatti, G. Action recognition in the premotor cortex. Brain 119(Pt 2), 593–609 (1996).
Article PubMed Google Scholar
Rizzolatti, G., Fadiga, L., Gallese, V. & Fogassi, L. Premotor cortex and the recognition of motor actions. Brain Res. Cogn. Brain Res. 3, 131–141 (1996).
Article CAS PubMed Google Scholar
Tai, Y. F., Scherfler, C., Brooks, D. J., Sawamoto, N. & Castiello, U. The human premotor cortex is ‘mirror’ only for biological actions. Curr. Biol. 14, 117–120 (2004).
Article CAS PubMed Google Scholar
Oberman, L. M., McCleery, J. P., Ramachandran, V. S. & Pineda, J. A. EEG evidence for mirror neuron activity during the observation of human and robot actions: toward an analysis of the human qualities of interactive robots. Neurocomputing 70, 2194–2203 (2007).
Article Google Scholar
Caggiano, V., Fleischer, F., Pomper, J. K., Giese, M. A. & Thier, P. Mirror neurons in monkey premotor area F5 show tuning for critical features of visual causality perception. Curr. Biol. 26, 3077–3082 (2016).
Article CAS PubMed Google Scholar
Kohler, E. et al. Hearing sounds, understanding actions: action representation in mirror neurons. Science 297, 846–848 (2002).
Article ADS CAS PubMed Google Scholar
Bonini, L., Maranesi, M., Livi, A., Fogassi, L. & Rizzolatti, G. Ventral premotor neurons encoding representations of action during self and others’ inaction. Curr. Biol. 24, 1611–1614 (2014).
Article CAS PubMed Google Scholar
Jacob, P. & Jeannerod, M. The motor theory of social cognition: a critique. Trends Cogn. Sci. 9, 21–25 (2005).
Article PubMed Google Scholar
Saxe, R. Against simulation: the argument from error. Trends Cogn. Sci. 9, 174–179 (2005).
Article PubMed Google Scholar
Luppino, G., Matelli, M., Camarda, R. & Rizzolatti, G. Corticocortical connections of area F3 (SMA-proper) and area F6 (pre-SMA) in the macaque monkey. J. Comp. Neurol. 338, 114–140 (1993).
Article CAS PubMed Google Scholar
Luppino, G., Calzavara, R., Rozzi, S. & Matelli, M. Projections from the superior temporal sulcus to the agranular frontal cortex in the macaque. Eur. J. Neurosci. 14, 1035–1040 (2001).
Article CAS PubMed Google Scholar
Rizzolatti, G. & Luppino, G. The cortical motor system. Neuron 31, 889–901 (2001).
Article CAS PubMed Google Scholar
Gerbella, M., Belmalih, A., Borra, E., Rozzi, S. & Luppino, G. Cortical connections of the anterior (F5a) subdivision of the macaque ventral premotor area F5. Brain Struct. Funct. 216, 43–65 (2011).
Article PubMed Google Scholar
Bruni, S. et al. Cortical and subcortical connections of parietal and premotor nodes of the monkey hand mirror neuron network. Brain Struct. Funct. 223, 1713–1729 (2018).
PubMed Google Scholar
Albertini, D. et al. Connectional gradients underlie functional transitions in monkey pre-supplementary motor area. Prog. Neurobiol. 184, 101699 (2020).
Article CAS PubMed Google Scholar
Uddin, L. Q., Iacoboni, M., Lange, C. & Keenan, J. P. The self and social cognition: the role of cortical midline structures and mirror neurons. Trends Cogn. Sci. 11, 153–157 (2007).
Article PubMed Google Scholar
Keysers, C. & Gazzola, V. Integrating simulation and theory of mind: from self to social cognition. Trends Cogn. Sci. 11, 194–196 (2007).
Article PubMed Google Scholar
de Lange, F. P., Spronk, M., Willems, R. M., Toni, I. & Bekkering, H. Complementary systems for understanding action intentions. Curr. Biol. 18, 454–457 (2008).
Article PubMed CAS Google Scholar
Schippers, M. B., Gazzola, V., Goebel, R. & Keysers, C. Playing charades in the fMRI: are mirror and/or mentalizing areas involved in gestural communication? PLoS One 4, e6801 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Centelles, L., Assaiante, C., Nazarian, B., Anton, J. L. & Schmitz, C. Recruitment of both the mirror and the mentalizing networks when observing social interactions depicted by point-lights: a neuroimaging study. PLoS One 6, e15749 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Becchio, C. et al. Social grasping: from mirroring to mentalizing. Neuroimage 61, 240–248 (2012).
Article PubMed Google Scholar
Sperduti, M., Guionnet, S. & Fossati, P. & Nadel, J. Mirror Neuron System and Mentalizing System connect during online social interaction. Cogn. Process 15, 307–316 (2014).
Article PubMed Google Scholar
Vogeley K. Two social brains: neural mechanisms of intersubjectivity. Philos. Trans. R Soc. Lond B Biol. Sci. 372, 20160245 (2017).
Kinoshita, M. et al. Genetic dissection of the circuit for hand dexterity in primates. Nature 487, 235–238 (2012).
Article ADS CAS PubMed Google Scholar
Kinoshita, M. et al. Dissecting the circuit for blindsight to reveal the critical role of pulvinar and superior colliculus. Nat. Commun. 10, 135 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Yoshida, K., Saito, N., Iriki, A. & Isoda, M. Social error monitoring in macaque frontal cortex. Nat. Neurosci. 15, 1307–1312 (2012).
Article CAS PubMed Google Scholar
Muthukumaraswamy, S. D., Johnson, B. W. & McNair, N. A. Mu rhythm modulation during observation of an object-directed grasp. Brain Res. Cogn. Brain Res. 19, 195–201 (2004).
Article PubMed Google Scholar
Bernier, R., Dawson, G., Webb, S. & Murias, M. EEG mu rhythm and imitation impairments in individuals with autism spectrum disorder. Brain Cogn. 64, 228–237 (2007).
Article CAS PubMed PubMed Central Google Scholar
Kilner, J. M., Kraskov, A. & Lemon, R. N. Do monkey F5 mirror neurons show changes in firing rate during repeated observation of natural actions? J. Neurophysiol. 111, 1214–1226 (2014).
Article CAS PubMed Google Scholar
Caggiano, V., Giese, M., Thier, P. & Casile, A. Encoding of point of view during action observation in the local field potentials of macaque area F5. Eur. J. Neurosci. 41, 466–476 (2015).
Article PubMed Google Scholar
Waldert, S., Vigneswaran, G., Philipp, R., Lemon, R. N. & Kraskov, A. Modulation of the intracortical LFP during action execution and observation. J. Neurosci. 35, 8451–8461 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bimbi, M. et al. Simultaneous scalp recorded EEG and local field potentials from monkey ventral premotor cortex during action observation and execution reveals the contribution of mirror and motor neurons to the mu-rhythm. Neuroimage 175, 22–31 (2018).
Article PubMed Google Scholar
Ninomiya, T., Dougherty, K., Godlove, D. C., Schall, J. D. & Maier, A. Microcircuitry of agranular frontal cortex: contrasting laminar connectivity between occipital and frontal areas. J. Neurophysiol. 113, 3242–3255 (2015).
Article PubMed PubMed Central Google Scholar
Mechler, F. & Ringach, D. L. On the classification of simple and complex cells. Vis. Res. 42, 1017–1033 (2002).
Article PubMed Google Scholar
Bruce, C. J. & Goldberg, M. E. Primate frontal eye fields. I. Single neurons discharging before saccades. J. Neurophysiol. 53, 603–635 (1985).
Article CAS PubMed Google Scholar
Matsumoto, M. & Hikosaka, O. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 459, 837–841 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Mitchell, J. P., Macrae, C. N. & Banaji, M. R. Dissociable medial prefrontal contributions to judgments of similar and dissimilar others. Neuron 50, 655–663 (2006).
Article CAS PubMed Google Scholar
Kang, S. K., Hirsh, J. B. & Chasteen, A. L. Your mistakes are mine: self-other overlap predicts neural response to observed errors. J. Exp. Soc. Psychol. 46, 229–232 (2010).
Article Google Scholar
Buzsaki, G. & Draguhn, A. Neuronal oscillations in cortical networks. Science 304, 1926–1929 (2004).
Article ADS CAS PubMed Google Scholar
Jensen, O. & Colgin, L. L. Cross-frequency coupling between neuronal oscillations. Trends Cogn. Sci. 11, 267–269 (2007).
Article PubMed Google Scholar
Matsuzaka, Y., Aizawa, H. & Tanji, J. A motor area rostral to the supplementary motor area (presupplementary motor area) in the monkey: neuronal activity during a learned motor task. J. Neurophysiol. 68, 653–662 (1992).
Article CAS PubMed Google Scholar
Aizawa, H. & Tanji, J. Corticocortical and thalamocortical responses of neurons in the monkey primary motor cortex and their relation to a trained motor task. J. Neurophysiol. 71, 550–560 (1994).
Article CAS PubMed Google Scholar
Isoda, M. Understanding intentional actions from observers’ viewpoints: a social neuroscience perspective. Neurosci. Res. 112, 1–9 (2016).
Article PubMed Google Scholar
Asaad, W. F. & Eskandar, E. N. A flexible software tool for temporally-precise behavioral control in Matlab. J. Neurosci. Methods 174, 245–258 (2008).
Article PubMed PubMed Central Google Scholar
Schall, J. D. Neuronal activity related to visually guided saccades in the frontal eye fields of rhesus monkeys: comparison with supplementary eye fields. J. Neurophysiol. 66, 559–579 (1991).
Article CAS PubMed Google Scholar
Rizzolatti, G. et al. Functional organization of inferior area 6 in the macaque monkey. II. Area F5 and the control of distal movements. Exp. Brain Res 71, 491–507 (1988).
Article CAS PubMed Google Scholar
Umilta, M. A. et al. I know what you are doing. a neurophysiological study. Neuron 31, 155–165 (2001).
Article CAS PubMed Google Scholar
Fogassi, L. et al. Parietal lobe: from action organization to intention understanding. Science 308, 662–667 (2005).
Article ADS CAS PubMed Google Scholar
Cohen M. X. Analyzing neural time series data: Theory and practice. (The MIT Press, 2014).
Barnett, L. & Seth, A. K. The MVGC multivariate Granger causality toolbox: a new approach to Granger-causal inference. J. Neurosci. Methods 223, 50–68 (2014).
Article PubMed Google Scholar
Bastos, A. M. & Schoffelen, J. M. A tutorial review of functional connectivity analysis methods and their interpretational pitfalls. Front Syst. Neurosci. 9, 175 (2015).
PubMed Google Scholar
Granger, C. W. J. Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37, 424–438 (1969).
Article MATH Google Scholar
Noritake, A., Ninomiya, T. & Isoda, M. Social reward monitoring and valuation in the macaque brain. Nat. Neurosci. 21, 1452–1462 (2018).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank M. Yoshida, S. Tomatsu, I. Yokoi, N. Goda, and A. Uematsu for helpful discussions; and M. Togawa, Y. Yamanishi, T. Jochi, K. Takada, R. Kageyama, and A. Shibata for technical assistance. Japanese monkeys used in this study were provided by the National Bio-Resource Project ‘Japanese Macaques’ of Japan Agency for Medical Research and Development, AMED. This research was supported by AMED under Grant Numbers JP20dm0107145 (to M.I.) and JP20dm0307005 (to M.I.).

Author information

Authors and Affiliations

Division of Behavioral Development, Department of System Neuroscience, National Institute for Physiological Sciences, National Institutes of Natural Sciences, 38 Myodaiji, Okazaki, Aichi, 444-8585, Japan
Taihei Ninomiya, Atsushi Noritake & Masaki Isoda
Department of Physiological Sciences, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Hayama, Kanagawa, 240-0193, Japan
Taihei Ninomiya, Atsushi Noritake, Kenta Kobayashi & Masaki Isoda
Section of Viral Vector Development, National Institute for Physiological Sciences, National Institutes of Natural Sciences, 38 Myodaiji, Okazaki, Aichi, 444-8585, Japan
Kenta Kobayashi

Authors

Taihei Ninomiya
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Noritake
View author publications
You can also search for this author in PubMed Google Scholar
Kenta Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Masaki Isoda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.N. and M.I. designed the experiments. T.N. and A.N. performed the experiments. T.N. analyzed data. K.K. designed and made viral vectors. T.N. and M.I. wrote the paper. A.N. and K.K. assisted with data analysis and paper preparation. M.I. supervised the project.

Corresponding author

Correspondence to Masaki Isoda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Olga Dal Monte and other, anonymous, reviewers for their contributions to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Movie 1

Supplementary Movie 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ninomiya, T., Noritake, A., Kobayashi, K. et al. A causal role for frontal cortico-cortical coordination in social action monitoring. Nat Commun 11, 5233 (2020). https://doi.org/10.1038/s41467-020-19026-y

Download citation

Received: 14 April 2020
Accepted: 25 September 2020
Published: 16 October 2020
DOI: https://doi.org/10.1038/s41467-020-19026-y

This article is cited by

The role of dorsal premotor cortex in joint action inhibition
- Elisa Dolfini
- Pasquale Cardellicchio
- Alessandro D’Ausilio
Scientific Reports (2024)
Chemogenetic dissection of a prefrontal-hypothalamic circuit for socially subjective reward valuation in macaques
- Atsushi Noritake
- Taihei Ninomiya
- Masaki Isoda
Nature Communications (2023)
Cognitive genomics of learning delay and low level of social performance monitoring in macaque
- Taihei Ninomiya
- Atsushi Noritake
- Masaki Isoda
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.