1 Introduction

The recent transition of NAND Flash arrays from planar to 3-dimensional (3-D) [1, 2] has boosted interest towards deca-nanometer polysilicon-channel MOSFETs. The mainstream technological solution to vertically stack many layers of memory cells in 3-D NAND Flash arrays, in fact, is the so-called punch-and-plug process and allows to create strings of cylindrical gate-all-around (GAA) memory transistors in series connection along a silicon channel with polycrystalline structure [3,4,5,6]. The presence of grains separated by highly defective grain boundaries in the polysilicon channel of the strings introduces some relevant issues in the operation and in the reliability of the array. Among them, it is worth mentioning (i) a severe limitation to the string current, especially at low temperature [7]; (ii) transient instabilities in the string current, due to the dependence of the average occupancy of the traps at the grain boundaries on the bias history of the string [8,9,10]; (iii) variability in the cell threshold voltage (\(V_{\mathrm{T}}\)) and in its temperature dependence, due to the haphazardness in the configuration of the polysilicon grains [11, 12]; (iv) the worsening of the amplitude statistics of random telegraph noise (RTN) affecting cell \(V_{\mathrm{T}}\), due to the contribution to percolative channel conduction of the nonuniform inversion of the intra-grain and inter-grain regions [13,14,15,16,17,18,19]. The adoption of a thin annular channel instead of a full nanowire was demonstrated to be a successful solution to mitigate some of these issues [20, 21]. In that case, which is commonly referred to as macaroni channel, the inner part of the annular region is filled up with silicon oxide and the thickness of the polysilicon region is decreased down to the \(10 \div 20\) nm range, taking benefits from the thin-body device electrostatics.

In order to optimize the performance and the reliability of 3-D NAND Flash arrays from the standpoint of polysilicon-related issues, accurate models able to reproduce the polysilicon physics in deca-nanometer MOSFETs are mandatory. In this regard, it is worth mentioning that quite different approaches have been proposed in the literature to address polysilicon-channel transistors. In the simplest case, polysilicon is treated under the effective medium approximation, i.e., by neglecting its granular nature and introducing a spatially uniform distribution of traps in the volume of the material [22,23,24,25]. This approach is assumed to be valid when the nonuniformities in device electrostatics arising from the localized nature of the traps at the grain boundaries are negligible. In a more physics-based approach, the polysilicon channel is considered as the haphazard ensemble of monocrystalline grains separated by highly defective grain boundaries. In this latter case, pure drift-diffusion transport [26,27,28] or a mix of intra-grain drift-diffusion and inter-grain thermionic emission have been considered [12, 18, 29, 30]. In the presence of variety of approaches, a clear assessment of the impact of different modeling assumptions on the results for the electrical characteristics of the devices has not been provided so far. In this paper, which extends our preliminary work presented in [31], we provide such an assessment, focusing on both the average value and the variability in the electrical characteristics of polysilicon-channel GAA nanowire and macaroni MOSFETs.

Fig. 1
figure 1

Schematic cross section of the 3-D polysilicon-channel cylindrical GAA devices investigated in this work: a single-gate structure used in Sect. 3.1 to address the average polysilicon current and b three-gate structure used in Sect. 3.2 to address current variability. The color scale represents the electron concentration in the polysilicon channel at \(V_{\mathrm{G}}=1.5\) V

2 Device structures and models

Figure 1 shows a cross section of the two 3-D polysilicon-channel cylindrical GAA devices investigated in this work. Both devices feature a channel outer radius \(r_{\mathrm{c}}=25\) nm, a silicon dioxide gate insulator with thickness \(t_{\mathrm{ox}}=12.5\) nm and a metal gate (work function \(\phi _m=4.5\) eV). The inner radius of the channel, corresponding to the radius \(r_\mathrm{f}\) of the filler oxide, was varied between 0 (nanowire channel) and 20 nm. In (a), a single and very long gate region (length \(L_{\mathrm{G}}=950\) nm) was introduced to drive channel inversion. The long gate length allows not only to avoid any short-channel effect in device operation but also to average along the channel direction the nonuniformities coming from the haphazardness in the configuration of the polysilicon grains. Note that this structure can be considered representative of a 3-D NAND string with all of its word lines driven by the same voltage. In case (b) of Fig. 1, instead, channel inversion is driven by three gates with length equal to 100 nm, 30 nm and 100 nm, with spacing regions with length \(L_\mathrm{s}=30\) nm in between them. By biasing the lateral gates with a pass voltage \(V_\mathrm{pass}\) and sweeping the voltage applied to the central gate, this structure can be considered representative of a 3-D NAND string under read conditions and allows to investigate the variability in the read outcome due to the random configuration of polysilicon grains in the channel. Both in cases (a) and (b) of Fig. 1, the channel was doped with a donor concentration \(N_{\mathrm{D}}\) equal to \(10^{15}\) \(\hbox {cm}^{-3}\) in the inner regions of the string and \(10^{20}\) \(\hbox {cm}^{-3}\) beyond the upper and lower gate edges.

Fig. 2
figure 2

Volumetric acceptor trap density assumed when modeling polysilicon under the effective medium approximation [23]. The distribution features a double-exponential behavior in energy, with tail and deep states (see the formulas in the inset). \(E_{\mathrm{C}}\) is the conduction band edge, E is energy

The device structures shown in Fig. 1 were implemented in a commercial device simulator [32]. In order to reproduce the polycrystalline nature of the silicon channel, two different approaches were used. In the first, the channel was considered as monocrystalline silicon with the volumetric acceptor trap density adopted in [23] and is reported in Fig. 2. This latter density was uniformly introduced in the channel volume to account for the high defect density that is typically present in polysilicon, neglecting its localization at the grain boundaries of the material (effective medium approximation). In the second approach, the channel was divided into Voronoi volumes of average size \(\phi _g=40\) nm separated by surfaces representing the polysilicon grain boundaries [11, 12, 30]. At these surfaces, a planar acceptor trap density calibrated to give the same total number of traps in the channel resulting from the effective medium approximation was introduced.

The device drain current versus gate voltage (\(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\)) transcharacteristic was simulated for a drain voltage \(V_{\mathrm{D}}=0.5\) V, assuming pure drift-diffusion transport (effective medium approximation and Voronoi grains) or mixed intra-grain drift-diffusion and inter-grain thermionic emission (Voronoi grains only, see [11, 12] for details). A constant electron mobility \(\mu _n=100\) \(\hbox {cm}^2\)/Vs [24, 25] and a Richardson velocity \(v_r=2\sqrt{kT/2\pi m_n}\) [33] were adopted, respectively, for the drift-diffusion process and for thermionic emission. As a final remark, note that quantum corrections to the electrostatics of the simulated devices were verified to have a negligible impact on the results presented in this work. For the sake of simplicity, then, the results presented here will not include these corrections.

Fig. 3
figure 3

Simulated \(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\) curve of the device in Fig. 1a, as resulting from the different polysilicon models described in Sect. 2, at RT and for different filler oxide radii \(r_\mathrm{f}\). Current is shown on a a logarithmic and b linear scale

Fig. 4
figure 4

Effective electron mobility extracted from the simulated \(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\) curve of the device in Fig. 1a in the on-state, for different polysilicon models, different temperatures and as a function of \(r_\mathrm{f}\)

3 Simulation results

3.1 Average polysilicon current

Figure 3 shows the simulated \(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\) curve of the device in Fig. 1a as resulting from different polysilicon models described in the previous section, at room temperature (RT) and for different filler oxide radii \(r_\mathrm{f}\). Similarly to what shown in [31] for nanowire channels, relevant discrepancies appear among the predictions of different polysilicon models in the on-state regime, while negligible mismatches among the models are present in the subthreshold regime. This can be explained by considering that in the subthreshold regime the low free-electron concentration in the intra-grain regions and the low trapped-electron concentration at the grain boundaries give rise to weak nonuniformities in channel electrostatics. This makes the effective medium approximation a valid description of the polysilicon channel, minimizing the constraints of the localized grain boundaries on current transport. In the on-state regime, instead, strong nonuniformities in channel electrostatics arise from the high free-electron concentration in the intra-grain regions and the strong filling of the acceptor traps at the grain boundaries. In particular, the nonuniformities in channel electrostatics result in high energy barriers at the grain boundaries [11, 12, 31], strongly limiting current transport through the polysilicon channel. This makes the gradual channel approximation less accurate and the activation of thermionic emission at the grain boundaries more relevant for \(I_{\mathrm{D}}\).

It is worth noting that the differences among the predictions of the investigated models in the on-state can be summarized in a different device transconductance and, in turn, in a different effective mobility of electrons in the channel, as shown in Fig. 4. The effective medium approximation provides, in this regard, an effective mobility that is intermediate between the values resulting in the presence of Voronoi grains with pure drift-diffusion transport and with mixed intra-grain drift-diffusion and inter-grain thermionic emission. Referring, first, to the case of pure drift-diffusion simulations, this result can be explained by considering that moving from the effective medium approximation to the Voronoi grains corresponds to clustering the acceptor traps in the channel at localized grain boundaries. This reduces the average number of filled traps in the channel, as shown in Fig. 5, since trap filling is more affected by the electrostatic feedback reducing the local potential when negative charge builds up in the filled acceptor states. As a result of that, \(I_{\mathrm{D}}\) is higher and increases more rapidly in the presence of Voronoi grains than with the effective medium approximation. When thermionic emission is activated at the grain boundaries, then, the reduction in the current (Fig. 3) and of the effective electron mobility (Fig. 4) with respect to the effective medium approximation is the direct consequence of a stronger constraint to the electron flow coming from the relatively high energy barriers arising at the grain boundaries due to acceptor trap filling [11, 12, 31]. All these differences among the models must be carefully considered when investigating current transport through polysilicon-channel devices in the on-state regime.

Fig. 5
figure 5

Average number of filled acceptor traps in the channel of the device in Fig. 1a as a function of \(V_{\mathrm{G}}\), under the effective medium approximation and with Voronoi grains with pure drift-diffusion transport. Results for \(r_\mathrm{f}=0\) are reported

The results in Fig. 3a also reveal that, irrespective of the polysilicon model, the increase in \(r_\mathrm{f}\) gives rise to an improvement in the subthreshold slope of the device. This is expected since the first proposals of the macaroni channel [20] and can be attributed to the removal of all the acceptor traps in the region of the channel that is replaced with the filler oxide. The removal of these traps, in fact, reduces the amount of spurious fixed negative charge that is present in the region of the device at radii less than \(r_{\mathrm{c}}\) as the gate voltage is increased. This leads to a more rapid increase in the free electron density in the channel and to a steeper rise of \(I_{\mathrm{D}}\) in the subthreshold regime. On the other hand, a weak dependence of \(I_{\mathrm{D}}\) on \(r_\mathrm{f}\) is observed in the on-state regime. This is attributed to the fact that (i) for the adopted trap density in the polysilicon channel the change of device \(V_{\mathrm{T}}\) is rather small (see Fig. 3) and (ii) once strong inversion is reached, \(I_{\mathrm{D}}\) depends mainly on the electron concentration at the outer surface of the channel, whose growth is weakly affected by the removal of the acceptor traps at radii less than \(r_\mathrm{f}\).

Fig. 6
figure 6

Same as in Fig. 3b, but at a \(T=250\) K and b \(T=400\) K

As a final remark, note that the increase in temperature results in the increase in device current in the on-state, but qualitatively preserving all the previously discussed trends, as shown in Fig. 6. The current increase with temperature in the on-state is a typical signature of polysilicon conduction [7, 25, 26] and is due both to the reduction in trap occupancy and to the enhancement of thermionic emission at the grain boundaries [12]. This effect results in the increase in the electron effective mobility with temperature, with a stronger growth in the case inter-grain thermionic emission is activated, as shown in Fig. 4.

Fig. 7
figure 7

Simulated \(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\) curves (gray lines) of the device in Fig. 1b for different randomly generated configurations of the polysilicon grains in the channel, at RT, with pure DD transport and for \(r_\mathrm{f}\) equal to a 0, b 10 nm and c 20 nm. The average curve resulting from about 100 Monte Carlo simulations is also highlighted (red lines) for each \(r_\mathrm{f}\), along with the curve obtained in the case of monocrystalline silicon (blue lines) (Color figure online)

3.2 Current variability

In order to investigate the variability in the channel current coming from the haphazardness in the configuration of the polysilicon grains, we calculated \(I_{\mathrm{D}}\) when sweeping the voltage applied to the central gate of the device structure in Fig. 1b, with lateral gates kept at a constant \(V_\mathrm{pass}=7\) V. This voltage scheme makes the lateral transistors operate as pass transistors. This means that \(I_{\mathrm{D}}\) is mainly limited by the portion of the channel under (or next to) the central gate up to currents making the voltage drop over the remaining part of the channel relevant with respect to \(V_{\mathrm{D}}\). This solution allows, on the one hand, to investigate the variability in the current through a short region of polysilicon comparable to the grain size and, on the other hand, to keep the focus on the read operation of a cell in a 3-D NAND string. In the variability analysis, the \(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\) curve was simulated for about 100 randomly generated Voronoi configurations of the polysilicon channel.

Figure 7 shows the results obtained at RT in the case of \(r_\mathrm{f}\) equal to (a) 0, (b) 10 nm and (c) 20 nm, in the case of pure drift-diffusion transport through the channel. The variability in \(I_{\mathrm{D}}\) arising from the polycrystalline silicon channel is clearly evident in all cases. Along with that, the figures also reveal that the increase in \(r_\mathrm{f}\) gives rise to a rightward shift of the average \(I_{\mathrm{D}}\)\(V_{\mathrm{G}}\) curve of the devices (see the red lines) and to an enhancement of the subthreshold slope of this curve. These latter trends appear also in the case of monocrystalline silicon channel (blue lines) and are attributed to a stronger electrostatic control of the central gate over the silicon region under it when \(r_\mathrm{f}\) is increased. Note, in fact, that the lateral gates biased at \(V_\mathrm{pass}\) may significantly contribute to the increase in the electrostatic potential and of the electron concentration in the deep silicon region under the central gate when \(r_\mathrm{f}\) is small, creating a sort of short-channel effect impacting the electrostatics of the central transistor. This effect makes the impact of \(r_\mathrm{f}\) on the \(I_{\mathrm{D}}{-}V_{\mathrm{G}}\) curve much stronger than that observed in the previous section on the structure of Fig. 1a, resulting in a relevant degradation of the subthreshold slope of the devices and in a relevant reduction in their \(V_{\mathrm{T}}\) for small \(r_\mathrm{f}\). In the case of a polysilicon channel, this effect comes along with the change of the subthreshold slope with \(r_\mathrm{f}\) discussed with Fig. 3a.

Fig. 8
figure 8

Cumulative distributions of \(V_{\mathrm{T}}\) obtained at RT for different \(r_\mathrm{f}\), with pure drift-diffusion transport and with mixed intra-grain drift-diffusion and inter-grain thermionic emission

Fig. 9
figure 9

\(\langle V_{\mathrm{T}} \rangle \) and \(\sigma _{V_{\mathrm{T}}}\) as a function of \(r_\mathrm{f}\), at RT

In order to quantify the statistical spread in polysilicon conduction, we extracted a constant current \(V_{\mathrm{T}}\) at \(I_{\mathrm{D}}=100\) nA from the curves in Fig. 7 and from similar curves calculated with inter-grain thermionic emission on. The obtained statistical distributions of \(V_{\mathrm{T}}\) are reported in Fig. 8. Results reveal that not only the average value (\(\langle V_{\mathrm{T}} \rangle \)) but also the spread (\(\sigma _{V_{\mathrm{T}}}\)) of the \(V_{\mathrm{T}}\) distribution changes with \(r_\mathrm{f}\). These trends are highlighted in Fig. 9, where the benefits expected from the increase in \(r_\mathrm{f}\) in terms of variability [20] are clearly evident. Besides, the figure also reveals that the activation of thermionic emission at the grain boundaries gives rise to a higher \(\sigma _{V_{\mathrm{T}}}\). Since \(V_{\mathrm{T}}\) is extracted at an \(I_{\mathrm{D}}\) not far from the on-state regime, this is attributed to the more severe limitation to current transport introduced by the polysilicon grain boundaries when thermionic emission is on, making the haphazardness in their number and position more relevant for the current flow. Finally, Fig. 10 highlights that the increase in temperature results in a reduction in both \(\langle V_{\mathrm{T}} \rangle \) and \(\sigma _{V_{\mathrm{T}}}\). While the reduction in \(\langle V_{\mathrm{T}} \rangle \) with temperature is a well-known dependence affecting also monocrystalline MOSFETs, the reduction in \(\sigma _{V_{\mathrm{T}}}\) is strictly related to the dependence on temperature of the nonuniformities in channel inversion arising from the highly defective grain boundaries. In particular, the increase in temperature smooths the nonuniformities in channel inversion induced by the grain boundaries, making their configuration less relevant for current transport and leading, in turn, to the decrease in \(\sigma _{V_{\mathrm{T}}}\) highlighted in Fig. 10. The reduction in the nonuniformities in channel electrostatics with temperature arises both from the reduction in trap occupancy at the grain boundaries [12] and to a less severe constraint of thermionic emission. All of these points must be carefully considered when addressing the variety of phenomena affected by percolative channel conduction, such as RTN [7, 13, 19, 34,35,36] and charge detrapping [37,38,39,40], when performing spectroscopic analyses of oxide traps based on local tunneling currents [41,42,43,44] and when addressing the impact of localized electron storage in charge trap layers on nonuniform channel inversion [45].

Fig. 10
figure 10

\(\langle V_{\mathrm{T}} \rangle \) and \(\sigma _{V_{\mathrm{T}}}\) as a function of temperature, for \(r_\mathrm{f} = 10\) nm

4 Conclusions

In this paper, we have shown that the models and approximations typically adopted to reproduce the physics of polysilicon channels in a simulation framework may lead to non-negligible differences in the results, both in terms of average current and of current variability. This has to be carefully considered when investigating technology performance and reliability by means of TCAD tools. Besides, the analyses as a function of the filler oxide radius of cylindrical GAA transistors clearly confirmed the benefits coming from the adoption of a macaroni over a nanowire polysilicon channel, representing a cornerstone for the design of 3-D NAND Flash arrays.