Introduction

Anxiety disorders rank among the most common mental disorders, with a 12-month prevalence of 14% in the EU [1]. Although exposure-based techniques are effective in treating anxiety disorders, relapse is frequently observed [2, 3]. Therefore, various lines of translational research test new behavioral and pharmacological augmentation strategies that could improve long-term retention and reduce relapse rates [4].

d-cycloserine (DCS), a partial N-methyl-d-aspartate (NMDA) receptor agonist, might show promise as a cognitive enhancer in humans, as glutamatergic activation of the NMDA receptor influences long-term potentiation-dependent forms of learning and memory [5]. NMDA receptors are widespread in amygdala, hippocampus and other brain regions critically involved in fear processing and associative learning [6,7,8,9].

While animal studies found DCS pre- or post-learning administration to facilitate extinction retention [10, 11], meta-analyses from clinical trials investigating DCS-augmented exposure therapy in anxiety disorders remain inconclusive, with results ranging from no evidence [12] to small [13] to medium effects [14]. Human fear conditioning and extinction as a laboratory model for the development and treatment of anxiety disorders [4, 15, 16] therefore provide an important tool to investigate the precise working mechanism of DCS in human extinction learning thought to underlie exposure therapy.

During conditioning, a neutral stimulus (CS+) elicits a fear response (CR) via repeated coupling with an aversive stimulus (US). During extinction, the CS+ is no longer followed by the US, resulting in a reduced CR. Extinction is thought to establish a new, inhibitory association while leaving the initial CS−US association intact [17], explaining why fear often returns after successful extinction learning [15]. Neuroimaging studies revealed a network involved in fear conditioning, comprising amygdala, hippocampus, anterior insula, and dorsal anterior cingulate cortex (dACC) as key structures [18, 19]. Accumulating evidence points towards a specific role of the ventromedial prefrontal cortex (vmPFC) in extinction retention [20], mediating the inhibition of conditioned responding [21,22,23]. Supporting the dual-model [17], ROF following reinstatement has been associated with increased blood-oxygen-level-dependent (BOLD) activation in the amygdala and hippocampus, while decreasing vmPFC involvement [24, 25].

Only few human laboratory studies experimentally investigated DCS-augmented fear extinction in healthy humans [26,27,28,29], yielding inconclusive results. While two studies [27, 28] observed no effect of either 50 or 250 mg of DCS administered 2–3 h prior to extinction on ROF after simple extinction recall or reinstatement—evaluating fear-potentiated startle and skin conductance responses (SCRs)—Kuriyama et al. [29] found 100 mg of DCS administered 1.5 h prior to extinction to attenuate differential SCRs after a reactivation procedure but not after simple extinction recall 24 h later. One imaging study used a 2-day fMRI design in a combined context conditioning and extinction paradigm and found post-learning administration of 500 mg of DCS to increase differential SCRs across contexts and neural activation in posterior hippocampus/collateral sulcus and medial prefrontal cortex/ACC in a 72-h delayed reinstatement test, suggesting enhanced fear memory consolidation [26].

Regarding DCS-augmentation of appetitive extinction [30, 31], 125 mg of DCS administered immediately post-learning reduced renewal of conditioned sexual responses and subjective CS valence ratings 24 h later in healthy females [30] and 50 mg of DCS ingested 1 h before appetitive extinction attenuated amygdala activation during 24-h delayed extinction recall following a reactivation procedure [31].

Heterogeneous dosages and timing of DCS administration, reliance on different readout measures or concomitant conditioning and extinction within one session limit comparability between studies and complicate interpretation of results. The inconclusive situation regarding the mechanisms of DCS-augmented human fear extinction motivated us to study the mechanisms of DCS on ROF in more depth, using a pharmacological fMRI study with multiple outcome measures in healthy participants. We administered 50 mg of DCS 1 h before extinction learning in a double-blind, placebo-controlled, 3-day fMRI design, to allow consolidation between learning and separate the effect of DCS during extinction from conditioning. We hypothesized that DCS would facilitate extinction learning consolidation, such that participants from the DCS group should exhibit reduced ROF (i.e. attenuated differential CRs from extinction learning to recall) on a subjective (valence and arousal ratings), psychophysiological (SCRs), and neural (BOLD) level. We anticipated that DCS would attenuate differential BOLD activation in fear-associated structures (amygdala, hippocampus, anterior insula, dACC), while increasing activation in inhibitory structures (vmPFC).

Materials and methods

Participants

Thirty-seven healthy participants (n = 20, DCS/n = 17, placebo), providing complete and quality-controlled fMRI data for all 3 days, were included in the study (Fig. S1). Participants were recruited at two study sites (Berlin: n = 11, DCS/n = 13, placebo; Dresden: n = 9, DCS/n = 4, placebo) from the general public and student population via community- and university-based advertisement as part of a registered, multicenter clinical trial of the national research initiative Panic-Net. Exclusion criteria comprised current or past psychiatric (confirmed via standardized clinical interview [32]), neurological or internal medical disorders, pregnancy, positive urinary drug screening, and color blindness or weakness [33]. Experimental groups did not differ in sociodemographic or neuropsychological measures nor subclinical levels of anxiety sensitivity (ASI [34]; Table S1). Participants provided written informed consent and received 75€ for participation. A subsample (n = 6, DCS/n = 9, placebo) also underwent an appetitive conditioning task on the same day, as previously published [31]. The study was approved by the ethics committees of Berlin (LAGeSo; EudraCT-Nr.: 2010-023044-32) and Technische Universität Dresden (EK 62022010).

Experimental protocol

Subjects underwent a 3-day fear conditioning and delayed extinction paradigm [35] to investigate the effect of DCS-augmented extinction training on ROF during extinction recall. They received either 50 mg of DCS or placebo 1 h before extinction training on day 2 in a randomized, double-blinded trial (Fig. 1a).

Fig. 1
figure 1

Experimental design and paradigm. a Healthy participants were enrolled in a double-blind, placebo-controlled design with three experimental sessions—Pavlovian conditioning, extinction, and extinction recall—spaced approximately 24 h apart to allow consolidation between sessions. An oral dose of 50 mg of DCS or placebo was administered 1 h before extinction training. b Exemplary trial: In each trial, a CS was presented for 5 s, followed by a jittered ITI (range: 7.7–16.2 s). In case of a CS+ trial during acquisition, the US (aversive panic scream, 2 s duration) appeared simultaneously with CS+ offset (100% reinforcement schedule)

Fear conditioning and delayed extinction paradigm

The paradigm comprised three sessions: habituation and fear conditioning (day 1), extinction learning (day 2), and extinction recall (day 3) [35]. Two male faces from the Ekman series [36] served as cues, with picture-cue assignment counterbalanced across subjects. Each session comprised three phases with eight CS+/CS− trials in pseudo-randomized order, resulting in 48 trials per session (day 1: habituation, early/late acquisition; day 2: early/middle/late extinction training; day 3: early/middle/late extinction recall). During both acquisition phases, the CS+ was followed by an auditory US (aversive panic scream, 100% reinforcement schedule) presented using MR-compatible headphones, while the CS− was never reinforced (Fig. 1b). In the remaining phases (habituation, extinction learning, extinction recall), only unreinforced CS+ were presented. US intensity (volume) was individually adjusted prior to conditioning using a 9-point Likert scale (1 = not aversive; 9 = very aversive) to meet a predefined criterion (US rating ≥ 8).

Stimulus presentation was controlled via Presentation 14 software (Neurobehavioral Systems; https://www.neurobs.com).

Drug administration (day 2) and blinding

Participants received an oral dose of 50 mg of DCS (reformulated from 250 mg capsules, Seromycin®, USA) or placebo 1 h before extinction, a dosage and timing shown to be effective in augmenting exposure therapy for anxiety [13, 14]. To ensure double-blinding, identical capsules in blocks of four, each containing two DCS/placebo pills in random order, were prepared by the Charité pharmacy, who also carried out unblinding after study completion. Although participants were not explicitly asked about their suspicions regarding group assignment, none of the participants experienced side effects or adverse events assessed on day 3, suggesting effective blinding.

Data acquisition and preprocessing

Contingency knowledge

Participants’ awareness of the CS−US contingency was assessed in a post-experimental interview outside the scanner on day 1 (for details see [35]). Thirty-four participants were classified as aware, while three (2 DCS/1 placebo) were classified as unaware.

Valence and arousal ratings

Subjective ratings of CS valence and arousal were obtained at six time points within the fMRI: immediately before (i.e. after the habituation trials)/after acquisition (day 1), before/after extinction learning (day 2), and before/after extinction recall (day 3) using an MR-compatible button box. Participants rated each cue on two 9-point Likert scales ranging from −4 = “negative” to 4 = “positive” and 1 = “not at all” to 9 = “very much”, respectively.

SCRs

Skin conductance was recorded continuously from the nondominant hand during all sessions. Two Ag/AgCl electrodes were attached to either the thenar and hypothenar eminence sampled at 50 Hz in Berlin (MP150, Biopac Systems, Goleta, USA) or the second phalanx of middle and index finger sampled at 1000 Hz in Dresden (MR-compatible BrainAmp ExG amplifier, Brain Products, Munich, Germany). Preprocessing and statistical analysis of single-subject data was performed within the PsPM toolbox, using the general linear model (GLM) approach (4.0.2; http://pspm.sourceforge.net; see Supplementary Materials). Data from two participants were lost due to technical failure, leaving 35 subjects for analysis.

fMRI

Structural and functional images were acquired on 3-Tesla scanners (Trio, Siemens AG, Erlangen, Germany; see Supplementary Materials for scanner comparability analyses). Functional images were acquired in an interleaved fashion using a standard EPI sequence (voxel size = 3 × 3 × 3 mm, 41 slices, TR = 2.5 s, TE = 25 ms, 64 × 64 matrix; 192 × 192-mm FOV; 368 volumes on day 1, 356 volumes on days 2 and 3). Images were angled 20° to the anterior−posterior commissure. The first five volumes per session were discarded to avoid T1 equilibrium effects. A T1-weighted structural scan (MPRage, voxel size = 1 × 1 × 1 mm) was acquired.

Imaging data were analyzed within SPM8 (https://www.fil.ion.ucl.ac.uk/spm/). Preprocessing included slice time correction, realignment to the mean EPI, coregistration and segmentation of the structural image, spatial normalization to MNI space (3-mm isotropic voxel resolution), and iterative smoothness equalization to a target smoothness of 10-mm full-width at half maximum Gaussian kernel to account for differences in intrinsic smoothness between scanners [37].

Statistical analyses

Behavioral and psychophysiological measures

All analyses included study site as a covariate and were performed using R software (v3.4.3; [38]). Conditioning effects in valence and arousal ratings were analyzed in separate repeated measures ANCOVAs (rmANCOVA) with within-subject factors cue (CS+/CS−) and time (pre-/post-acquisition on day 1). DCS effects on ROF, i.e. increased conditioned responding from extinction learning to extinction recall, were assessed by contrasting participants’ post-extinction (day 2) with pre-recall (day 3) ratings in two mixed ANCOVAs with within-subject factors cue (CS+/CS−) and time (post-extinction/pre-recall) and between-subject factor group (DCS/placebo). SCRs were analyzed analogously, whereby “time” refers to early/late acquisition to assess conditioned responses on day 1 and to late extinction/early recall for evaluation of ROF. Significant effects were followed up by planned FDR-corrected [39] post-hoc t tests.

fMRI

Individual subject data were modeled using a GLM including each day as separate session. Event onsets (CS+/CS−/US) for each of the nine phases were modeled as stick functions and convolved with the canonical HRF. Rating phases (modeled as box-car functions) and six movement parameters per session were entered as additional regressors. Baseline contrasts for CS+ and CS− were computed for each phase and entered into two random-effects flexible factorial models on the group level, both including study site as a covariate.

The neural signatures of fear conditioning (day 1) were investigated in a flexible factorial model including CS+ and CS− regressors for both acquisition phases. The main effect of conditioning was assessed with the differential contrast “CS+ > CS−” during acquisition. Possible time effects over the course of conditioning were investigated by assessing interactions with conditioning phase (early[CS+ > CS−] vs. late[CS+ > CS−]).

The hypothesis that DCS facilitates extinction recall through enhanced extinction learning consolidation was investigated in a flexible factorial model including CS+ and CS− regressors for the three extinction learning (day 2) and recall (day 3) phases and the experimental group factor (DCS/placebo). Paralleling the behavioral and psychophysiological analyses, ROF 24 h post-extinction was evaluated by comparing each individuals’ differential BOLD responses during extinction learning on day 2 with extinction recall on day 3, examining the contrast extinction recall[CS+ > CS−] > extinction[CS+ > CS−] over subjects and the critical interaction with experimental group. This analysis incorporated all trials per session to maximize statistical power. To specify the direction of effects, post-hoc analyses were conducted for each group separately. As an additional exploratory analysis, we also investigated group differences during the first recall phase only (first eight CS+/CS− presentations).

Our analyses focused on predefined regions of interest (ROIs) using small volume correction (SVC) at p < 0.05 FWE-corrected, specifically, insula, dACC, amygdala, hippocampus and vmPFC [18,19,20]. ROI masks for insula, amygdala and hippocampus were derived from the WFU PickAtlas (http://www.fmri.wfubmc.edu/download.htm). ROIs of dACC and vmPFC were created using a 10-mm sphere on the midline-centered coordinates [x = 0, y = 18, z = 34] and [x = 0, y = 34, z = −6] derived from a meta-analysis on fear extinction recall in healthy participants [20]. We also performed whole-brain analyses, using an initial cluster-forming threshold of p < 0.001 uncorrected and ten contiguous voxels; clusters surviving p < 0.05 FWE correction at the cluster level are reported. For plotting purposes, mean beta estimates within a 6-mm sphere surrounding peak activations were extracted.

Results

Fear conditioning

Behavioral and psychophysiological measures

Valence ratings showed a main effect of cue (F1,36 = 4.47, p = 0.042, η2p = 0.11) and a cue × time interaction (F1,36 = 16.30, p < 0.001, η2p = 0.31; Fig. 2a). Post-hoc t tests revealed that CS+ valence significantly decreased (t36 = 4.23, pFDR < 0.001, d = 0.70) and CS− valence increased (t36 = −2.47, pFDR = 0.024, d = 0.41) from pre- to post-conditioning, causing a significant differentiation between cues after acquisition (t36 = 3.65, pFDR = 0.002, d = 0.60) but not at baseline (t36 = −0.23, p = 0.817). Analyzing arousal ratings, significant main effects of cue (F1,36 = 13.92, p = 0.001, η2p = 0.28) and time (F1,36 = 4.69, p = 0.037, η2p = 0.12), as well as a significant cue × time interaction emerged (F1,36 = 33.36, p < 0.001, η2p = 0.48; Fig. 2b). Post-hoc t tests confirmed that arousal increases towards the CS+ (t36 = −4.27, pFDR < 0.001, d = 0.70) and decreases towards the CS− (t36 = 3.10, pFDR = 0.004, d = 0.51) led to differential ratings post-conditioning (t36 = −5.01, pFDR < 0.001, d = 0.82) in absence of baseline differences (t36 = 0.53, pFDR = 0.597).

Fig. 2
figure 2

Behavioral, psychophysiological, and BOLD responses during fear conditioning. ac Conditioning was associated with decreased valence ratings (9-point Likert scale: −4: “negative” to +4: “positive”) and increased arousal ratings (9-point Likert scale: 1: “not at all” to 9: “very much”) towards the CS+ compared to the CS− from pre- to post conditioning, and induced differential SCRs (CS+ > CS−) across both phases of conditioning. Bar graphs represent the mean ± within-subject SEM [68, 69]. d Differential BOLD responses during conditioning, displayed at p < 0.001 with k ≥ 10 cluster extent

SCRs showed a main effect of cue with higher SCRs towards the CS+ compared to the CS− (F1,34 = 7.20, p = 0.011, η2p = 0.17; Fig. 2c) and a main effect of time due to general declines in SCR amplitudes over phases (F 1,34 = 9.08, p = 0.005, η2p = 0.21), but no significant cue × time interaction (F1,34 = 3.99, p = 0.054, η2p = 0.11).

fMRI

Differential BOLD responses during acquisition were observed in bilateral insula (left: x = −33, y = −31, z = 19, Z = 3.83, pFWE ROI = 0.013; right: x = 48, y = 8, z = 1, Z = 3.57, pFWE ROI = 0.013) and dACC (left: x = −3, y = 14, z = 31, Z = 3.02, pFWE ROI = 0.041; right: x = 9, y = 14, z = 37, Z = 3.53, pFWE ROI = 0.009). Moreover, whole-brain analyses revealed increased BOLD responses towards CS+ compared to CS− in midbrain, bilateral supplementary motor area (SMA), transverse temporal gyrus (TTG), and precentral gyrus (Table 1, Fig. 2d). No significant amygdala or hippocampal activation was observed. Time-based analyses over acquisition phases revealed no significant time-dependent activation changes but only a trendwise increase in differential responding from early to late acquisition in the right amygdala (x = 24, y = 2, z = −22, Z = 2.55, pFWE ROI = 0.090).

Table 1 Whole-brain results during conditioning for the contrast CS+ > CS− across participants

DCS effects on return of fear

Behavioral and psychophysiological measures

ROF in valence and arousal ratings was investigated by comparing post-extinction with pre-recall ratings. With respect to valence (Fig. 3a), no cue × time interaction was present (p = 0.658), indicating no ROF in this measure across subjects. The cue × time × group interaction was only marginally significant (F1,35 = 2.96, p = 0.094, η2p = 0.08) due to numerically decreasing valence ratings from post-extinction to pre-recall in the placebo group (corresponding to ROF) but increasing valence ratings in the DCS group. We further observed a significant main effect of group due to overall lower valence ratings in the placebo compared to the DCS group (F1,35 = 5.70, p = 0.023, η2p = 0.14). No further effects were significant (all p ≥ 0.508). Analyzing ROF in arousal ratings revealed a main effect of cue due to higher arousal towards the CS+ compared to the CS− across phases (F1,35 = 7.43, p = 0.010, η2p = 0.18), indicating incomplete extinction of subjective arousal that persisted until extinction recall. No cue × time or cue × time × group effects were observed (p ≥ 0.169), indicating no differential ROF. However, a significant group × time interaction (F1,35 = 7.98, p = 0.008, η2p = 0.19) yielded evidence for a rather generalized ROF in the placebo group (Fig. 3b). Post-hoc t tests confirmed a significant increase in overall arousal ratings from post-extinction to pre-recall only in the placebo group (t16 = −2.73, pFDR = 0.030, d = 0.66; DCS: t19 = 1.17, pFDR = 0.259); an effect mainly driven by increases towards the CS+ rather than the CS−. No significant main or interaction effects were observed in the analysis of SCRs (p ≥ 0.259).

Fig. 3
figure 3

Group differences in behavioral and neural measures of return of fear. a, b While no significant differential ROF was observed across or between groups in rating measures of CS valence or arousal, subjects in the placebo but not DCS group showed nondifferential ROF with increased arousal ratings from extinction training to extinction recall. Bar graphs represent the mean ± within-subject SEM [68, 69]. c Participants in the placebo compared to the DCS group showed stronger ROF on a neural level, that is increased differential BOLD responses from extinction learning to extinction recall in left amygdala (MNI peak at [x: −24, y: 2, z: −23]; Z = 3.15; pFWE ROI = 0.019) and posterior hippocampus (MNI peak at [x: −33, y: −34, z: −8], Z = 3.39, pFWE ROI = 0.033). Bar graphs represent mean parameter estimates from a 6-mm sphere surrounding peak voxel activation ± within-subject SEM [68, 69]. For display purposes, T-maps are shown on a visualization threshold of p < 0.005 with k ≥ 5 cluster extent

fMRI

While we did not observe an overall ROF in differential BOLD responses when examining the contrast recall[CS+ > CS−] > extinction[CS+ > CS−], placebo compared to DCS subjects showed significant increases in differential BOLD responses in the left amygdala (x = −24, y = 2, z = −23, Z = 3.15, pFWE ROI = 0.019) and left posterior hippocampus from extinction learning to recall (x = −33, y = −34, z = −8, Z = 3.39, pFWE ROI = 0.033; Fig. 3c). Separate post-hoc analyses showed that the amygdala effect was driven by a marginally significant decrease towards the CS+ from extinction learning to recall in the DCS group (x = −24, y = 2, z = −23, Z = 2.63, pFWE ROI = 0.075). In contrast, the placebo group exhibited significant increases in differential BOLD responses in the posterior hippocampus from extinction learning to recall (x = −33, y = −31, z = −8, Z = 4.07, pFWE ROI = 0.003).

As an additional exploratory analysis, we also investigated spontaneous recovery effects by analyzing the first phase of the recall session only. Group comparisons revealed stronger BOLD responses towards CS+ compared to CS− in the placebo compared to DCS group in the right dACC (x = 6, y = 14, z = 40; Z = 3.14, pFWE ROI = 0.033) and left insula (x = −33, y = −16, z = 7; Z = 3.53, pFWE ROI = 0.037; x = −27, y = 17, z = 10; Z = 3.30, pFWE ROI = 0.074; Fig. S2). The reverse contrast (DCS > placebo) revealed no significant effects.

Discussion

The present study investigated the effect of DCS-augmented extinction learning on behavioral, psychophysiological, and neural indices of ROF by applying a 3-day fear conditioning and delayed extinction paradigm that allowed for consolidation between learning sessions. We found that 50 mg of DCS facilitated long-term extinction retention, as only participants in the placebo but not the DCS group experienced a generalized ROF in arousal ratings. This was accompanied by relative downregulation of amygdala activation in the DCS group from extinction training to recall, while placebo subjects displayed increased posterior hippocampus activation. Exploratory analyses showed a downregulation in dACC and insula following DCS administration during the early recall trials.

Fear acquisition

The data demonstrate successful fear acquisition across participants, as reflected by reduced subjective valence and increased arousal ratings towards the CS+, as well as increased differential SCRs and a pattern of activation within bilateral insula, dACC, SMA and midbrain including thalamus, key regions of the fear network [19, 35]. Activation within the TTG, known to be involved in auditory processing [40], likely reflects US anticipation, in line with previous findings using this paradigm [35]. Although animal studies demonstrate the central role of the amygdala in fear acquisition [41], we did not observe significant amygdala activation during conditioning, conforming to a recent meta-analysis of human fear conditioning [19]. Besides methodological difficulties when measuring amygdala activity [18], it has been argued that human fear conditioning experiments might not primarily engage the basic threat detection circuit, but instead recruit an extended “autonomic-interoceptive network” for threat appraisal [19]. The temporal sensitivity of this structure observed in some studies [42, 43] might further contribute to the mixed evidence.

DCS prevented the return of fear in arousal ratings

Participants receiving placebo but not DCS experienced a generalized ROF in arousal ratings, mainly driven by increases towards the CS+ from post-extinction to pre-recall. Generalization of subjective or psychophysiological CRs towards the CS− is a common phenomenon in studies investigating ROF [44,45,46], and may further represent a characteristic feature during conditioning in anxiety disorders [47]. Our result suggests that DCS facilitates extinction memory retention and thereby prevents generalized ROF in arousal ratings. This is in line with a study using an appetitive sexual conditioning paradigm [30], where post-learning DCS administration attenuated differential valence and arousal ratings as well as conditioned physiological responses but not US expectancy in a delayed combined renewal and reinstatement test. In contrast, the available laboratory studies on DCS-augmented fear extinction focusing on psychophysiological readout measures remained inconclusive: While DCS-augmented extinction learning had no effect on SCRs [27, 28] or startle responses [28] during delayed fear extinction recall, SCRs were attenuated after a reactivation procedure [29]. Moreover, DCS administration after fear acquisition and immediate extinction resulted in increased SCRs during fear recall 72 h later, in line with a facilitation of fear memory consolidation [26].

Although we observed conditioned SCRs during fear acquisition, there was no evidence for ROF in terms of increased SCRs from extinction learning to extinction recall across participants or within groups, thus obscuring the evaluation of a potential DCS effect in this measure. Floor effects, i.e. insufficient recovery of CRs during recall in the placebo group, also precluded the evaluation of DCS effects on fear extinction consolidation in the study by Kalisch et al. [26] and might have contributed to the reported null findings from studies probing simple extinction recall. The heterogeneous SCR findings might be due to the method itself, as SCRs represent a rather noisy measure and appear to habituate quickly [48]. Moreover, the scanner environment negatively affects the signal-to-noise ratio [49], making it more difficult to detect a rather transient psychophysiological ROF [46]. To clarify the effect of DCS on a psychophysiological level, future research might benefit from a multimodal approach to evaluate CRs [50] and the use of conditioning indices especially suited for neuroimaging, i.e. pupillary responses [51, 52].

DCS attenuated neural activation patterns of fear-associated brain regions

Our finding that DCS prevented the ROF in subjective arousal ratings was corroborated by significant group differences in BOLD response shifts in amygdala and posterior hippocampus from extinction learning to recall. Specifically, we observed a relative increase in differential amygdala activation in placebo compared to DCS that was mainly driven by a trendwise deactivation towards the CS+ in the DCS group. Several studies highlight the pivotal role of the amygdala in human fear conditioning, extinction learning and recall [18, 53, 54]. Higher amygdala activity has been associated with stronger fear memory reconsolidation and ROF [55, 56]. Increased amygdala activity in the placebo compared to DCS group therefore suggests a stronger recall of the original CS−US association in the placebo group, which was abolished in DCS-treated participants. Using a similar 3-day design, Ebrahimi et al. [31] previously found 50 mg of DCS during extinction learning to attenuate amygdala activity during appetitive extinction recall, while groups did not differ on a behavioral level (reaction times). Trendwise deactivation observed during extinction recall in the DCS group potentially suggests active inhibition of amygdala activity towards the CS+. While the vmPFC is a prime candidate to explain the observed group difference in amygdala activity—as vmPFC activity has been linked to the successful extinction recall and top-down control of the amygdala [23, 57,58,59]—we did not observe significant group differences in vmPFC BOLD response. Preliminary evidence for a mediating role of the vmPFC in DCS-augmented appetitive extinction learning comes from Ebrahimi et al. [31], who observed increased amygdala-vmPFC functional connectivity during CS+ compared to CS− presentations in DCS compared to placebo.

Our finding that only the placebo group showed increased activation in the posterior hippocampus is in line with studies associating ROF with increased BOLD responses in this area [60], often together with increased amygdala activity [23, 24]. In animal studies, inactivation of the hippocampus reduces the expression of the CR and prevents the ROF after extinction [61, 62]. Previous work indicates a differential role of the anterior and posterior part of the hippocampus in human fear conditioning, where activation of the anterior hippocampus has been mainly associated with extinction memory recall [22, 23], whereas the posterior hippocampus has been related to ROF phenomena [23, 26]. In the same vein, SCRs have been shown to correlate with posterior hippocampal activation during ROF after reinstatement [24]. Increased posterior hippocampal activation under placebo therefore likely reflects recall of the original fear memory, which was attenuated under DCS.

In an exploratory analysis, we focused on the first phase of extinction recall and observed relatively increased dACC and insula activation towards the CS+ in the placebo but not the DCS group. These brain regions are associated with fear acquisition, threat anticipation and CR expression [18, 63]. The dACC mediates fear responses [64], whereas the insula plays an important role in interoception and experience of subjective feelings [65, 66]. As such, robust activation of these areas during fear conditioning and recall has been ascribed the subjective experience of fearful states, possibly in terms of interoceptive awareness [20]. Our finding of heightened dACC and insula activation in placebo compared to DCS during the early recall phase might reflect stronger threat anticipation in the placebo group and therefore provides further evidence that DCS attenuates ROF during extinction recall.

Conclusions and future perspectives

Our results support the hypothesis that 50 mg of DCS administered 1 h prior to extinction learning enhances long-term extinction retention, thereby preventing ROF on subjective arousal ratings and attenuating differential BOLD responses in key structures of the fear network, including the amygdala. Our study provides evidence that fear extinction learning is an important target mechanism for the effect of DCS in healthy humans, which should be extended to patient populations with larger sample-sizes and multimodal readout measures. More experimental work is needed to elucidate potential moderating factors and boundary conditions of the effectiveness of DCS. For example, although most clinical studies used 50 mg of DCS, it remains to be shown whether this is in fact the ideal dose [13]. As anxiety disorders are characterized by amygdala hyperactivity and heightened fear generalization [67], the present findings suggest DCS could be even more effective in this population. A better understanding of the mechanisms of DCS during experimental variation of extinction learning and recall may help to identify potential moderators of its augmentation effect in patients, thus offering the possibility for patient stratification and personalized treatments.

Funding and disclosure

This study was supported by the German Federal Ministry of Education and Research (BMBF grant to AS: 01GV0612). CE and JG were supported by Elsa-Neumann scholarships and CE further received a Charité postgraduate thesis scholarship. FS was funded by the DFG (SCHL1969/4-1). AS has received research funding from the German Federal Ministry of Education and Research (BMBF), the German Research Foundation (DFG), the European Commission (FP6), the Robert-Enke-Stiftung and Lundbeck, and speaker honoraria from AstraZeneca, Boehringer Ingelheim, Bristol-Myers Squibb, Eli Lilly & Co, Lundbeck, Pfizer, Wyeth and UCB. He was a consultant for Actelion. Educational grants were given by the Stifterverband für die Deutsche Wissenschaft, the donation Seelen Bewegt, the Berlin Brandenburgische Akademie der Wissenschaften, the Boehringer Ingelheim Fonds, the Eli Lilly International Foundation, Janssen-Cilag, Pfizer and Eli Lilly & Co. He received honoraria for teaching from the Zentrum für Psychotherapie of the Humboldt University Berlin and the Zentrum für Psychologische Psychotherapie Bremen. The authors declare no competing interests.