Abstract
The hardware design of supervised learning (SL) in spiking neural network (SNN) prefers 3-terminal memristive synapses, where the third terminal is used to impose supervise signals. In this work we address this demand by fabricating graphene transistor gated through organic ferroelectrics of polyvinylidene fluoride. Through gate tuning not only is the nonvolatile and continuous change of graphene channel conductance demonstrated, but also the transition between electron-dominated and hole-dominated transport. By exploiting the adjustable bipolar characteristic, the graphene–ferroelectric transistor can be electrically reconfigured as potentiative or depressive synapse and in this way complementary synapses are realized. The complementary synapse and neuron circuit is then constructed to execute remote supervise method (ReSuMe) of SNN, and quick convergence to successful learning is found through network-level simulation when applying to a SL task of classifying 3 × 3-pixel images. The presented design of graphene–ferroelectric transistor-based complementary synapses and quantitative simulation may indicate a potential approach to hardware implementation of SL in SNN.
Similar content being viewed by others
Introduction
By mimicking the plasticity of brain, neuromorphic computing is capable of self-learning, while with revolutionary speed and energy efficiency, and is thus regarded as a promising candidate to next generation computing.1 At hardware level, it calls for materials and devices that emulate the nonvolatile modulation of synaptic strengths.2,3,4,5 Currently, two-terminal memristors made from various kinds of materials such as resistive random access memory,6 phase-change memory,7 and conducting bridge random access memory,8 etc. are intensely studied as electronic synapses for artificial neural networks.9,10,11 In a typical supervised learning (SL) task, the two-terminal memristors implement algorithms with iterative read-and-write operations: during the forward step the outputs are obtained through multiplying voltages from input neurons by the conductance of memristive synapses (read), while during the update step the conductance of memristors is delicately tuned in order to minimize the error between real outputs and the desired ones (write). In this way, the outputs are calibrated to the targets through the SL process. By further adopting structures such as 1-transistor-1-memristor12 and 1-selector-1-memristor,13,14 the network-level computing is substantially facilitated by removing the sneak paths in the synaptic arrays, and excellent performances on face recognition12 and handwritten digit classification6 have been reported.
On the other side, a spike-based computing paradigm namely spiking neural network (SNN) has emerged as the third generation neural network.15 Since it is more similar to the operation of biological brains, SNN shares the advantages of real brains, such as ultralow power consumption and larger processing capacity. From the viewpoint of hardware, 3-terminal nonvolatile transistors that accommodate direct feedback modulation to synapse weight are desired.16,17 As seen in Fig. 1a, spike signals transmitting from drain to source of the transistor mimics that from presynaptic neuron to postsynaptic one in biological systems. Once neurons in the output layer fire, the error message is generated by comparing the timings of actual outputs and desired ones. The required amounts of synapse strength modulation, represented by the channel conductance changes, are then calculated through those auxiliary modules in SNN circuits and the corresponding conductance tuning is implemented by the feedback gate voltage. In this regard, memristive transistors such as 3-terminal ferroelectric memristor17 and organic ferroelectric synapses18 have been proposed, whereas further development on the device materials, functions, and the related implementation of advanced algorithms are called for.
In order to address the above demands, we fabricate field effect transistors using graphene as the channels and P(VDF-TrFE) ferroelectric polymer as the gate dielectric (graphene–ferroelectric field effect transistor, abbreviated as GrFeFET) as seen in Fig. 1b. The graphene channel serves as the synapse connecting the pre- and postsynaptic neurons, while the gate terminal accepts supervise signals and modulates the channel conductance. Here the gate tuning to channel conductance is nonvolatile due to the remnant of ferroelectric polarization in the polyvinylidene fluoride (PVDF) layer (100 nm thick), which then emulates the plastic changes of synapse strengths.19 Figure 1c illustrates a 3-dimensional optical view of our device with false color of the source, drain, top electrodes, and graphene channel (5 μm long and 10 μm wide). Such GrFeFETs were previously explored as nonvolatile memory device,20,21 while in the current work we will demonstrate its unique potential as complementary synapses and usage in SL of SNN.
Results and discussion
Figure 2a shows the channel conductance modulated by back gate voltage G(Vbg) through 300 nm SiO2. The left and right branches denote the hole- and electron-dominated transport, respectively. The electron and hole mobility is extracted to be μ = 1.7 × 103 cm2/V s by taking the permittivity of back gate dielectric SiO2 as κSiO2 = 3.9 (estimation details are provided in the “Method” section).21 In a back-gate sweep loop from −40 to 40 V and reversely to −40 V, the hysteresis-free conductance change in graphene indicated well-suppressed interface traps in transistor. It should be noted that the back gate does not cause memorable modulation of the channel conductance therefore not suitable for artificial synapses. The physical mechanism is that the polarization states in PVDF could hardly be changed by the back-gate voltage as the applied electric field is mostly screened by the graphene channel. On the other hand, in Fig. 2b, an obvious hysteresis window appears during the sweep of top gate voltage (Vtg) between −20 and 20 V due to the tuning of polarization in PVDF. It is known that the dielectric polarization of PVDF ferroelectric by gate voltage exhibits both the linear response that is proportional to external polarization field and a nonlinear component known as residual polarization when the external field is removed.21 Depending on the upward or downward polarization in PVDF, the graphene channel was hole doped or electron doped, causing a nonvolatile shift of Dirac point. We further extract p(E) curve of PVDF from Fig. 2b by adopting the theoretical description of ferroelectric FET provided in the “Method” section, and compare it with the direct electric displacement measurement of PVDF film with the same thickness alone, D’(E). As shown in Fig. 2c, D and D’ have very similar coercive fields EC ~ 50 MV/m, which are consistent with values previously reported on PVDF.20 Moreover, the capacitance–voltage(C–V) relationship measured from Au/PVDF/Al structure also indicates coercive fields EC ~ ±50 MV/m as shown in Fig. 2d. These quantitative agreements strongly suggest that the hysteresis observed in the transport measurements is indeed caused by the hysteretic polarization of the ferroelectric gate dielectric. The residual polarization is estimated to be ~ ±1 μC/cm2. It is worth reminding that compared with other tuning approaches of graphene channel conductance,22,23 the nonvolatile nature of ferroelectric polarization causes persistent and memorable modulation of the graphene channel conductance, thus providing a feasible approach toward a 3-terminal synapse.
Figure 3a illustrates the basic principle of tuning graphene channel to be hole- or electron-conduction through the different polarization of ferroelectric layer. A negative/positive gate voltage with large amplitude over the coercive voltage will result in upward/downward polarization of the PVDF dielectric, which causes hole/electron doping of graphene channel, respectively. From the viewpoint of energy band filling, the graphene channel becomes p- or n-type conduction depending on the polarization direction of PVDF dielectric, as indicated by the right subfigures shown in Fig. 3a. Moreover, given the same positive voltage pulses, while with small magnitudes (+15 V in Fig. 3b and +8 V in Fig. 3c), the upward (downward) PVDF polarization will be decreased (increased), while the Fermi levels will shift upward in both p- and n-type graphene channels. However, the ascending of Fermi levels in p- and n-type graphene channel will lead to different results: for the former it is a reduction of the hole density, while for the latter an enhancement of electron density near the Fermi level. In other words, the conductance tuning of GrFeFET based synapses can be depressive or potentiative depending on the initial status of graphene energy band filling and this filling is adjustable through opposite polarization of gate ferroelectric. It explains why given similar small amplitudes of positive voltage pulses on the gate, the variation trends of the measured p- and n-type graphene channel conductance become opposite as shown in Fig. 3b, c. Similar analysis can be conducted on the measurement of negative voltage pulses imposed to the gates of p- and n-type GrFeFET (−10 V in Fig. 3b and −6 V in Fig. 3c). In this way, the analog weight update of GrFeFET synapse is successfully realized as plotted in Fig. 3b, c. Compared with previous reports,2,12 a distinct feature here is that in our GrFeFET synapse the synaptic weight update can be switched to be potentiative or depressive depending on the conduction type of channel given SET/RESET voltage pulses.
Here we point out that the synapses with positive/negative changes of weights (∆w > 0 or ∆w < 0) under ordinary SET pulses should be defined as potentiative/depressive rather than as excitatory/inhibitive,22 since in neuroscience excitatory/inhibitive synapses mean positive/negative weights (w > 0 or w < 0).24 We further stress that the realization of both potentiative and depressive, i.e., complementary synapses, with the same material and device structures is credited to the unique characteristic of zero bandgap of graphene. Although it is usually regarded as a negative factor when trying to manage power consumption in the related devices, here the zero-bandgap feature plays a constructive role in achieving the complementary synapses. Only with this ultrasmall bandgap could a practical gate voltage tune the transition between electron- and hole-dominated conduction through ferroelectric polarization. As further illustrated in Fig. 3b and c, the electron or hole filling of graphene channel will be enhanced or reduced oppositely given similar variation of PVDF polarization field caused by gate tuning. In this way, the analog weight update of the corresponding synapse can be potentiative or depressive when imposing similar programming gate voltages. Just as the importance of complementary metal-oxide-semiconductor field effect transistors in integrated circuit design, the demonstrated GrFeFET-based complementary synapses may find promising usage in the future hardware neural network design.
We further evaluate the nonideal factors of the demonstrated analog weight update of GrFeFET synapse by formulas provided in “Method” section and the obtained quantities are listed in Table 1. They are at the same levels with those recently reported in other kinds of ferroelectric synapses.25 A conventional convolution neural network (CNN) is then set up where both the convolutional kernels and the connections in the fully connected layers are implemented with the GrFeFET synapses. By taking these nonideal parameters into account, the simulation yields a recognition rate of 94%, when implementing MNIST tasks (details are provided in Section “GrFeFET synapse in CNN for MNIST recognition” of Supplementary Information).
In order to implement SL of SNN, we further measure the conductance tuning of GrFeFET under different amplitudes of gate voltages and initial channel conductance ∆G(G0, V), and results are plotted in Fig. 3d, e. The directions of conductance tuning become opposite in p- and n-type GrFeFET synapses, just as expected. Moreover, saturation behaviors are observed when trying to further increase G in the presence of already large G0 or to decrease G given small G0. It is largely ascribed to the saturation of ferroelectric polarization under the applied voltage, as seen in Fig. 2. Note that the widths of imposed voltage pulses in Fig. 3d, e are different from those already shown in Fig. 3b and c, since they are set for different computing schemes of SL. The latter has been applied to level-based computing as seen in Section “GrFeFET synapse in CNN for MNIST recognition” of Supplementary Information, while the former will be used for spike-based computing26 (detailed discussions are provided in Section “Selection of write pulse widths when training with different computing schemes” of Supplementary Information).
Besides, it is worth reminding that the potentiative and depressive behaviors of the device are not symmetric. The figure illustrates that in the hole-conduction dominated depressive region (Fig. 3b, d) the conductance and its tuning range are about two times larger than those in the electron dominated potentiative one (Fig. 3c, e). It is ascribed to the fact that graphene material is usually p-doped in the natural environment. For depressive region, it is straightforward to tune the hole conduction. On the other hand, for potentiative counterpart, first of all, a top gate voltage with quite large amplitude (~16 V) has to be imposed to induce sufficient change of PVDF ferroelectric polarization so that electrons are attracted to, while holes are repelled from the graphene channel. After the transition of graphene channel from hole conduction to electron one, additional voltage pulses are applied to further modulate the PVDF polarization and hence induce continuous conductance changes. However, saturation of PVDF ferroelectric polarization is easily met in this situation since it has already been changed significantly during the hole-to-electron transition. Such asymmetry between hole and electron conductance tuning has also been reported in other GrFeFET devices.22,27 As we will see, it poses challenge to the implementation of our complementary synapses in SNN tasks.
Figure 3 further demonstrates that a conductance ON/OFF ratio about 3.2 has been realized in GrFeFET device. When using as synapses, the corresponding relative weight change ∆w/w0 is about 220%, which is significantly larger than other graphene-channel based synaptic devices, where ∆w/w0 ≈ 12.5%22 or 35%.28 Although it benefits the learning efficiency of neural network greatly,26 the relatively large conductance ON/OFF ratio is at the expense of using write voltage pulses with widths about hundreds or tens of milliseconds. Physically, it is ascribed to the remarkable OFF state conductance gOFF caused by the zero bandgap and p-dope nature of graphene. Comparing with other memristive materials with quite small gOFF,18 here a large ON state conductance gON is required to obtain the target ratio gON/gOFF in the graphene-channel devices. In order to achieve the large conductance tuning Δg = gON − gOFF, the gate voltages with amplitudes or widths several orders larger are necessitated. It explains the 6-order slower operation speed comparing with those found in the fastest memristive synapses.12,29 We remind that this is a common problem met by graphene-based synaptic devices,18,22 rather than a specific issue raised by the design of complementary synapses in this work. In order to promote the operation speed, a compromise has to be made with the conductance ON/OFF ratio and further strategies are called for in the future research.
In the following, by exploiting the above conductance tuning properties of GrFeFET, an approach of using complementary GrFeFETs as synapses to implement the ReSuMe30 is proposed as illustrated in Fig. 4a. Here ReSuMe is a widely used strategy to realize SL in SNN, since it does not resort to the conventional stochastic gradient descent method as employed by Spike Back Propagation (SpikeProp)17,31 or the Widrow–Hoff rule as by Spike Pattern Association Neuron .32,33 Instead, it uses window function to drive the spike timings of output neuron to the desire ones as follows30
where \(S_x\left( t \right) = \mathop {\sum}\nolimits_f {\delta (t - t_x^{(f)})}\) is the spike sequence of the supervise (desire), output or input neuron with subscript x = d, o, or in. Here f characterizes the f th spike emitted by the x neuron, a is a bias term, and W(τ) is the window function to convolute with the input. Compared with other SL methods, ReSuMe has several prominent advantages including that it is capable of learning spike sequence rather than single spikes and it is applicable to various types of neuron models. Therefore in the work, we choose to realize ReSuMe based on our complementary GrFeFETs. Figure 4a shows that the source and drain terminals of the two parallel connected GrFeFETs are for receiving spikes from the presynaptic neuron and transmitting them to the postsynaptic neuron, respectively. The gate terminals are for imposing the supervise signals to adjust the channel conductance. As illustrated in Fig. 4a, an input spike triggers a decaying waveform by convoluting the input with window function in the supervise circuit (the \(\mathop {\int }\nolimits_0^\infty W(\tau )S_{{\mathrm{in}}}(t - \tau )d\tau\) term in the above equation, as represented by the decaying waveform inside the square in the lower left corner of the circuit). The teach and output pulse signals will sample the waveform respectively according to their different timings, as depicted in the lower middle part of the supervise circuit. The resulted voltages are then fed to the gate terminals of the complementary GrFeFET synapses, respectively. Here the upper GrFeFET is a potentiative synapse, of which the weight is modified through the voltage sampled by teach signal, while the lower device is a depressive one tuned through voltage sampled by the output signal. Mathematically, the former implements \(+ S_{\mathrm{d}}\left( t \right)\mathop {\int }\nolimits_0^\infty W(\tau )S_{{\mathrm{in}}}(t - \tau )d\tau\) term, while the latter does \(- S_{\mathrm{o}}\left( t \right)\mathop {\int }\nolimits_0^\infty W(\tau )S_{{\mathrm{in}}}(t - \tau )d\tau\) in the above ReSuMe expression. Note that the positive and negative signs before Sd(t) and So(t) are realized through the potentiative and depressive properties of the two complementary synapses: given the positive voltages sampled by teach and output signals, the conductance of the upper GrFeFET is enhanced, while that of the latter is reduced. The time chart of Fig. 4b shows an example, where the first-round output fires earlier than the desired (tout < td). In this case, the amplitude of sampled voltage by the output signal is greater than that by the teach signal (c1 > c2 > 0). As results, the magnitude of conductance decrease of the lower depressive GrFeFET is greater than that of increase of the upper potentiative one. Therefore, the summing conductance of the two devices in parallel gets decreased and consequently the second-round output fires later. In this way the output timing tout approaches the desired one td round-by-round. For the other case, where the initial output timing is later than the desired (tout > td), similar SL is implemented by using this circuit. The above demonstration of working principle indicates that the key requirement on device properties is the symmetry of conductance tuning between the potentiative and depressive GrFeFETs. Without this symmetry, the training cannot get convergent. The mechanism is that assuming the same timings of td and tout, two voltages with the same amplitudes while with opposite signs will be sampled as seen in Fig. 4b; however, different magnitudes of conductance tuning will then be induced by these two mirror voltages in the two asymmetric devices. In this situation, the summing conductance will continue to change while the ReSuMe algorithm demands no more modulation of the synaptic weight. In the real devices, as seen previously in Fig. 3b–e the symmetry between n-doped (potentiative) and p-doped (depressive) conduction of one device is quite difficult to obtain. Therefore, in this task two devices are employed and their conduction behaviors have been delicately tuned to be highly symmetric as illustrated in Fig. 4c. The blue and red curves represent the cycle-to-cycle conductance tuning of two GrFeFETs under a series of 50 positive top-gate voltage pulses followed by 50 negative ones. The stimulated conductance changes in the two devices are almost equal while opposite in directions, indicating nice symmetric electrical properties between the two GrFeFETs (The measured conductance tuning as functions of gate voltage amplitudes and the initial conductance for the two devices ΔG(Vtg, G0) are further demonstrated in Section “Conductance tuning as functions of gate voltage amplitudes and the initial conductance for two complementary GrFeFETs” of the Supplementary Information for interested readers). Figure 4d shows the converging processes of our ReSuMe circuit by using the electrical properties of the above two complementary devices where the first-round output timing tout = 14 ms, while that of desire td varies from 13 to 15 ms. Here the convergence is defined as that the relative difference between the final output timing and the desired one |tout − td|/|td| is <1%. The figure indicates that convergence will be achieved with about 50 iterations at most.
Comparing with conventional approaches, in which single devices are used as synapses for SL in SNN,17 the major advantages here are the greatly reduced auxiliary circuit and the simplified operations. As seen in ref. 17 for conventional approach not only were at circuit module level the design of neuron and synapse circuits quite complicated (e.g., three waveforms with different amplitudes and durations had to be designed as a set of output spikes of neurons), but also at network level quite a few other types of circuit modules such as error detectors and analog adders were needed. On the other side, here by using complementary GrFeFET synapses both the circuit and operation have been greatly simplified as seen in Fig. 4a, b. The chip area efficiency would be drastically promoted while the power consumption would get substantially reduced due to the simplification. This improvement can also be found by comparing the present approach with that using single GrFeFET as synapse to implement ReSuMe (the latter is presented in Section “The approach of using single GrFeFET as synapse (S-approach) to ReSuMe” of Supplementary Information for interested readers).
In order to check the network-level performance of the above GrFeFET synapse, we design a SNN to implement the standard classification task of 3 × 3-pixel z, v, and n images and test through simulation as shown in Fig. 4e, f. The black-and-white images are encoded by pulses of nine input neurons, while the different timings of the output neuron infer which images are inputted17 (design, simulation results, and comparison of different encoding approaches are discussed are provided in Section “ReSuMe to MNIST recognition with GrFeFET synapses” of Supplementary Information). SL is then conducted through the GrFeFET-based complementary synapses as designed in Fig. 4a, where the network parameters and simulation details are provided in the “Method” section. As indicated by Fig. 4f, with <15 epochs of training satisfactory convergence is achieved for the three input patterns. The demonstrated capability of quick and accurate learning is ascribed to both the power of ReSuMe algorithm and the hardware implementation by our complementary GrFeFET synapses.
Finally, the figures of merits of our GrFeFET synapses are listed as follows: the energy consumption of each synaptic weight update operation is about 50 nJ (estimation method is provided in the “Method” section), the time step is about 50 ms and the effective area per synapse is about 50 μm2. By analyzing these performance indexes of GrFeFET synapses and the network level simulation results, we conclude that the major advantages of using graphene as channel material are the high mobility and large conductance ON/OFF ratio that are gained through optimizing the fabrication process in our experiments. The high mobility facilitates the signal transmission through synapse and thus helps reduce the power-delay product, while the large ON/OFF ratio greatly promotes the learning capacity at the network level. However, the asymmetry between potentiative and depressive synapses as surveyed previously is outlined as one major restriction of using GrFeFET as complementary synapses. As analyzed before, this nonideal factor is a by-product of the p-dope nature of graphene material. By further improving the fabrication process the p-dope problem can be alleviated. Other strategies include trials with 2-dimensional (2D) transition metal dichalcogenide (TMD) ferroelectric devices as complementary synapses for SNN design, since several types of 2D TMD have both modest bandgaps and bipolar conduction properties.34,35
In summary, compared with conventional 2-terminal memristors we have found 3-terminal nonvolatile transistor appropriate to implement the synaptic plasticity required by the SL tasks in SNN, where the source/drain terminals are for transmitting spike signals from presynaptic neuron to the post one, while the gate terminal is used to impose the supervise signals. Based on the fabricated graphene–ferroelectric transistor and the measured nonvolatile and continuous tuning of channel conductance by gate voltages, we have realized complementary synapses. In these synapses, the analog weight update can be positive or negative depending on hole or electron dominance within the graphene channels. It is physically ascribed to the zero bandgap characteristic of graphene, while can be utilized to reconfigure the synapse to be potentiative or depressive. Interestingly, successful transition between these potentiative and depressive synapses have been achieved through large amplitude gate modulation of the ferroelectric polarization. The synapses have been further applied to implement remote supervise method in SNN. Through system-level simulation, we have further verified that the constructed synapses and SNN can perform classification of 3 × 3-pixel images after tens of iterations of training. In the future, concerning the proposed complementary synapses we will try to develop more complicated functions such as using the two complementary synapses to implement spike timing dependent plasticity (STDP) and anti-STDP, respectively, and dynamically interchange them, and hopefully a number of hardware architectures and the associated designs of neuromorphic computing will be accomplished.
Methods
GrFeFET fabrication
First, the source and drain electrode regions (with 5 μm between them) were patterned by UV lithography on SiO2/p-Si (300 nm/500 um) substrate. Cr/Au (10/50 nm) electrodes were deposited through e-beam evaporation followed by a lift-off process. Commercial single layer graphene grown on copper foil with PMMA support layer was wet transferred onto the as-prepared electrodes. After removing PMMA layer by acetone, a photoresist bar with 10 μm wide was defined on the graphene between source and drain electrodes with UV lithography. The graphene within the uncovered region was removed by reactive-ion etching. By removing the residue photoresist with acetone, graphene channel with 5 μm × 10 μm size was fabricated. Note that the channel fabrication process should be finished in <2 h in order to reduce photoresist residue contamination as much as possible. After that, the sample was transferred in a glove box with argon atmosphere for top gate dielectric layer fabrication. PVDF-Trfe solution (70/30 mol%, dissolved in dimethylformamide with 3 wt%) was spin coated on graphene with film thickness of ∼70 nm. Sample was annealed at 115 °C for 10 min to evaporate the solvent followed by 4 h of further annealing at 135 °C to enhance the crystallinity of the organic ferroelectric film. The aluminum electrode was then fabricated as top gate through UV lithography patterning, e-beam evaporation, and 4-h immersing in isopropanol as lift off. The fabricated device was characterized by optical profiler Olympus LEXT OLS5000 Industrial Laser Confocal Microscopes as demonstrated in Fig. 1c.
Measurement
The polarization versus electric field (P-E) curves of a ferroelectric capacitor with gold (Au) bottom electrode and chromium (Cr) top electrode, and thickness of ~100 nm was measured by using Radiant Inc circuit. The capacitance–voltage (C–V) relationship obtained from Au/PVDF/Al structure, where PVDF is 100 nm was measured by using a B1500A parameter analyzer at 10 kHz applied voltage frequency. The other measurements were performed using a Keithley 4200A-SCS parameter analyzer. All channel conductance was collected by compelling a DC bias (0.1 V) between source and drain.
Model
The carrier concentrations (electrons or holes) in the graphene channel ntotal is estimated by21
n0 is the residual carrier concentration characterizing the density of carriers at the minimum conductivity, i.e., at Dirac point. n(Vtg) is the carrier concentration (electrons or holes) induced by the top gate voltage, measuring the Fermi level modulated away from the Dirac point. The total device resistance Rtotal is:
where Rcontact is the metal/graphene contact resistance, L and W are the length and width of graphene channel, and μ is the charge carrier mobility. The continuity of electric displacement field D at the PVDF/graphene interface then gives rise to the following equation21:
where ε0 = 8.854 × 10−12 F/m is the vacuum dielectric constant, εr = 10 is the dielectric constant of PVDF, and EPVDF is the electric field within PVDF. The item ε0εrEPVDF represents the linear component in the dielectric response of the ferroelectrics, which is the common property in most dielectric, while P(Vtg) is the hysteretic component. Two additional equations concerning the capacitive effect of gate dielectric and the electrostatics are as below:
where C is the capacitance of PVDF gate dielectric and VDirac is the Dirac point of graphene material. By combining the above equations, the relation between the measured conductance G and the imposed top gate voltage Vtg is derived as
By fitting this model to the measured top-gate transfer curves shown in Fig. 2a, parameters such as Rcontact and μ are obtained as Rcontact ≈ 600 Ω and μ ≈ 1.7 × 103 cm2/V s. Here we remind that the values of μ fitted from the left and right branches in Fig. 2a are almost the same.
The electric boundary condition at the interface between the gate dielectric PVDF and the graphene channel is described by the following equation:
By solving the above equation, D(E) is then extracted from the measured G(Vtg) shown in Fig. 2b.
Estimation of nonideal factors
The analog weight update behaviors of GrFeFET synapses shown in Fig. 3c, d are usually measured with the following nonideal factors. One is the nonlinearity factor α of long-term potentiation (LTP) and depression (LTD) processes36:
where Gmax and Gmin are the maximum and minimum conductance, Pm is the maximum number of pulses required to tune the conductance from Gmin to Gmax, while A and B are fitting parameters.
Another is the asymmetry β between LTP and LTD36:
The third is the cycle-to-cycle variation σ37:
where σ is the standard deviation of the conductance at different cycles obtained from the experiment, N(σ) is the normal distribution of the variation, n is the number of pulses to be applied for each update, and Gideal is the conductance when no variation is introduced.
Simulation
The synapse, neuron, and supervise circuits to execute ReSuMe are built with MATLAB Simulink. The parameters are listed in the following Table 2:
Figure 5 demonstrates the flowchart for implementing 3 × 3 pixel z, v, and n classification is as follows:
Estimation of energy consumption
The energy consumed per update operation of synaptic weight is calculated by Eup ≈ VtgItgΔt, where Vtg and Δt are the amplitude and duration of the imposed top gate voltage pulses, and Itg is the measured leaky current through the gate terminal during the update operation. In our measurement, Itg is found to be 100 nA.
Data availability
The authors confirm that the data supporting the findings of this study are available within the article.
References
Mead, C. Neuromorphic electronic systems. Proc. IEEE 78, 1629–1636 (1990).
Jo, S. H. et al. Nanoscale memristor device as synapse in neuromorphic systems. Nano Lett. 10, 1297–1301 (2010).
Yu, S. et al. A low energy oxide-based electronic synaptic device for neuromorphic visual systems with tolerance to device variation. Adv. Mater. 25, 1774–1779 (2013).
Duygu, K., Shimeng, Y. & Wong, H. S. P. Synaptic electronics: materials, devices and applications. Nanotechnology 24, 382001 (2013).
Yang, J. J., Strukov, D. B. & Stewart, D. R. Memristive devices for computing. Nat. Nanotechnol. 8, 13 (2012).
Ambrogio, S. et al. Novel RRAM-enabled 1T1R synapse capable of low-power STDP via burst-mode communication and real-time unsupervised machine learning. In Proc. 2017 IEEE Symposium on VLSI Technology 1–2. https://doi.org/10.1109/VLSIT.2016.7573432 (2016).
Kuzum, D., Jeyasingh, R. G. D., Lee, B. & Wong, H. S. P. Nanoelectronic programmable synapses based on phase change materials for brain-inspired computing. Nano Lett. 12, 2179–2186 (2012).
Suri, M. et al. Bio-inspired stochastic computing using binary CBRAM synapses. IEEE Trans. Electron Devices 60, 2402–2409 (2013).
Park, S. et al. RRAM-based synapse for neuromorphic system with pattern recognition function. In Proc. 2012 International Electron Devices Meeting. 10.12.11–10.12.14. https://doi.org/10.1109/IEDM.2012.6479016 (2012).
Wang, Z. et al. Fully memristive neural networks for pattern classification with unsupervised learning. Nat. Electron. 1, 137–145 (2018).
Burr, G. W. et al. Experimental demonstration and tolerancing of a large-scale neural network (165,000 synapses), using phase-change memory as the synaptic weight element. In Proc. 2017 IEEE International Electron Devices Meeting. 29.25.21–29.25.24. https://doi.org/10.1109/IEDM.2014.7047135 (2014).
Yao, P. et al. Face classification using electronic synapses. Nat. Commun. 8, 15199 (2017).
Hu, M. et al. Memristor-based analog computation and neural network classification with a dot product engine. Adv. Mater. 30, 1705914 (2018).
Kim, T., Kim, H., Kim, J. & Kim, J. Input voltage mapping optimized for resistive memory-based deep neural network hardware. IEEE Electron Device Lett. 38, 1228–1231 (2017).
Maass, W. Networks of spiking neurons: the third generation of neural network models. Neural Netw. 10, 1659–1671 (1997).
Nishitani, Y., Kaneko, Y., Ueda, M., Morie, T. & Fujii, E. Three-terminal ferroelectric synapse device with concurrent learning function for artificial neural networks. J. Appl. Phys. 111, 124108 (2012).
Nishitani, Y., Kaneko, Y. & Ueda, M. Supervised learning using spike-timing-dependent plasticity of memristive synapses. IEEE Trans. Neural Netw. Learn. Syst. 26, 2999–3008 (2015).
Tian, B. et al. A robust artificial synapse based on organic ferroelectric olymer. Adv. Electron. Mater. 5, 1800600 (2019).
Yang, Y. et al. Multifunctional nanoionic devices enabling simultaneous heterosynaptic plasticity and efficient in-memory boolean logic. Adv. Electron. Mater. 3, 1700032 (2017).
Zheng, Y. et al. Gate-controlled nonvolatile graphene-ferroelectric memory. Appl. Phys. Lett. 94, 163505 (2009).
Zheng, Y. et al. Graphene field-effect transistors with ferroelectric gating. Phys. Rev. Lett. 105, 166602 (2010).
Tian, H. et al. Graphene dynamic synapse with modulatable plasticity. Nano Lett. 15, 8013–8019 (2015).
Tian, H. et al. A novel artificial synapse with dual modes using bilayer graphene as the bottom electrode. Nanoscale 9, 9275–9283 (2017).
Gerstner, W., Kistler, W. M., Naud, R. & Paninski, L. Neuronal Dynamics: From Single Neurons to Networks and Models of Cognition (Cambridge University Press, 2014).
Jerry, M. et al. Ferroelectric FET analog synapse for acceleration of deep neural network training. In Proc. 2017 IEEE International Electron Devices Meeting (IEDM). 6.2.1–6.2.4. https://doi.org/10.1109/IEDM.2017.8268338 (2017).
Yang, J. J. Memristor crossbar arrays for analog and neuromorphic computing. https://apps.dtic.mil/docs/citations/AD1061408 (2018).
Raghavan, S. et al. Long-term retention in organic ferroelectric-graphene memories. Appl. Phys. Lett. 100, 023507 (2012).
Yao, Y. et al. Reconfigurable artificial synapses between excitatory and inhibitory modes based on single-gate graphene transistors. Adv. Electron. Mater. 0, 1800887 (2019).
Boyn, S. et al. Learning through ferroelectric domain dynamics in solid-state synapses. Nat. Commun. 8, 14736 (2017).
Ponulak, F. & Kasinski, A. J. Supervised learning in spiking neural networks with ReSuMe: sequence learning, classification, and spike shifting. Neural Comput. 22, 467–510 (2010).
Bohte, S. M., Kok, J. N. & La Poutré, H. Error-backpropagation in temporally encoded networks of spiking neurons. Neurocomputing 48, 17–37 (2002).
Mohemmed, A., Schliebs, S., Matsuda, S. & Kasabov, N. Span: spike pattern association neuron for learning spatio-temporal spike patterns. Int. J. Neural Syst. 22, 1250012 (2012).
Mohemmed, A., Schliebs, S., Matsuda, S. & Kasabov, N. Training spiking neural networks to associate spatio-temporal input–output spike patterns. Neurocomputing 107, 3–10 (2013).
Agnihotri, P., Dhakras, P. & Lee, J. U. Bipolar junction transistors in two-dimensional WSe2 with large current and photocurrent gains. Nano Lett. 16, 4355–4360 (2016).
Rasmussen, F. A. & Thygesen, K. S. Computational 2D materials database: electronic structure of transition-metal dichalcogenides and oxides. J. Phys. Chem. C. 119, 13169–13183 (2015).
Chen, P. et al. Mitigating effects of non-ideal synaptic device characteristics for on-chip learning. In Proc. 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) 194–199. https://doi.org/10.1109/ICCAD.2015.7372570 (2015).
Chen, P., Peng, X. & Yu, S. NeuroSim+: an integrated device-to-algorithm framework for benchmarking synaptic devices and array architectures. In Proc. 2017 IEEE International Electron Devices Meeting (IEDM) 6.1.1–6.1.4. https://doi.org/10.1109/IEDM.2017.8268337 (2017).
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grant Nos. 61841404, 51732003, and by Hubei Engineering Research Center on Microelectronics.
Author information
Authors and Affiliations
Contributions
Y.C., Y.Z., and F.Z. made the same contributions; Y.H. and F.Z. conceived the idea; Y.C. fabricated the devices; Y.Z. performed the electrical measurement, circuit design, and simulation; Y.H., F.Z., B.T., and Y.L. conducted the analysis; M.Y. helped with the measurement; Y.H. and F.Z. wrote and revised the paper; X.S.M. supervised and supported the whole work.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, Y., Zhou, Y., Zhuge, F. et al. Graphene–ferroelectric transistors as complementary synapses for supervised learning in spiking neural network. npj 2D Mater Appl 3, 31 (2019). https://doi.org/10.1038/s41699-019-0114-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41699-019-0114-6
This article is cited by
-
The Roadmap of 2D Materials and Devices Toward Chips
Nano-Micro Letters (2024)
-
Unsupervised character recognition with graphene memristive synapses
Neural Computing and Applications (2024)
-
Ferroelectric gating of two-dimensional semiconductors for the integration of steep-slope logic and neuromorphic devices
Nature Electronics (2023)
-
Recent progress in ferroelectric synapses and their applications
Science China Materials (2023)
-
Wind speed forecasting at wind farm locations with an unique hybrid PSO-ALO based modified spiking neural network
Energy Systems (2023)