On the potential of recurrent neural networks for modeling path dependent plasticity

doi:10.1016/j.jmps.2020.103972

Journal of the Mechanics and Physics of Solids

Volume 143, October 2020, 103972

https://doi.org/10.1016/j.jmps.2020.103972 Get rights and content

Abstract

The mathematical description of elastoplasticity is a highly complex problem due to the possible change from elastic to elasto-plastic behavior (and vice-versa) as a function of the loading path. Advanced physics-based plasticity models usually feature numerous internal variables (often of tensorial nature) along with a set of evolution equations and complementary conditions. In the present work, an attempt is made to come up with a machine-learning based model that can replicate the predictions anisotropic Yld2000-2d model with homogeneous anisotropic hardening (HAH). For this, a series of modeling problems of increasing complexity is formulated and sequentially addressed using neural network models. It is demonstrated that basic fully-connected neural network models can capture the characteristic non-linearities in the uniaxial stress-strain response such as the Bauschinger effect, permanent softening or latent hardening. A neural network with gated recurrent units (GRUs) and fully-connected layer is proposed for the modeling of plane stress plasticity for arbitrary loading paths. After training and testing the model through comparison with the Yld2000-2d/HAH model, the recurrent neural network model is also used to model the multi-axial stress-strain response of a two-dimensional foam. Here, the comparison with the results from unit cell simulations provided another validation of the proposed data-driven modeling approach.

Introduction

The availability of reliable computational models describing the large deformation behavior of solids is an essential element for the success of new material solutions. Most existing material models for finite element software have been developed using a physics-based approach: first, the governing mechanisms are identified and then described in an approximate manner using a set of algebraic and partial differential equations. Depending on the length scale at which the relevant mechanisms take place, multiple levels of homogenization are needed to come up with estimates of the stress-strain response at the macroscopic level.

In sheet metal plasticity, the main ingredients of macroscopic material models are the elastic constitutive equation, the yield function, the flow rule, and a set of evolution equations describing the hardening response. Aside from the basic isotropic Levy-von Mises model, anisotropic yield functions and flow potentials have emerged over the last six decades such as the Hill (1948, 1990) and Hershey families (e.g. Karafillis and Boyce, 1993; Barlat et al., 2003a, 2003b). Furthermore, major developments were concerned with hardening laws including the description of the Bauschinger effect (e.g. Bauschinger, 1886; Prager, 1956; Armstrong et al., 1966; Mollica et al., 2001), kinematic hardening (e.g. Chaboche, 1989), work-hardening stagnation (e.g. Yoshida and Uemori, 2002), permanent softening (e.g. Rauch et al., 2007), latent hardening (e.g. Barlat et al., 2013), distortional hardening (e.g. Barlat et al., 2011, 2014), strain rate effects (e.g. Marsh and Campbell, 1963; Gurrutxaga-Lerma et al., 2015; Balasubramanian and Anand, 2002; Nguyen et al., 2017), and thermal softening (e.g. Stainier et al., 2002; Ling and Belytschko, 2009). For practical applications, the large number of parameters of modern constitutive models are usually calibrated such as to provide an optimal fit for a wide range of experiments (e.g. Abi-Akl and Mohr, 2017; Gorji and Mohr, 2018). In other words, the analysis of the governing physical mechanisms guides the selection of suitable mathematical models, while the model parameters are identified through optimization (minimization of the error between model predictions and experimental results). Given that modern physics-based material models require the use of optimization software for identifying their parameters from experiments (e.g. Gorji et al. 2018; Pack et al., 2018), the question comes up if the direct use of data-driven neural network-based models may provide a viable alternative to physics-based models.

Artificial neural networks (ANNs) are a class of machine learning algorithms that consist of many artificial neurons (often referred to as units) that are arranged in a layered structure. As demonstrated by Cybenko (1989), any smooth non-linear function may be represented by a neural network function with one hidden layer and sigmoidal activation. In mechanics, deep feedforward networks composed of multiple Fully-Connected Neural Network (FCNN) layers have been used as metamodeling technique to offer alternatives to traditional approaches for structural applications (e.g. Kohar et al., 2016, 2017), material parameter identification (e.g. Haj-Ali et al., 2007), constitutive modeling (e.g. Al-Haik et al., 2006), control manufacturing part attributes such as springback (Cao et al., 1999.), as well as the prediction of the mechanical response of polycrystalline metals (Frankel et al., 2019).

The ANN approach provides a useful framework for describing unconventional hardening behavior observed in tensile experiments. For example, Jenab et al. (2016) employed a shallow neural network to describe the stress-strain response of anisotropic aluminum 5182-O for strain rates ranging from 10⁻³ to 10³/s as a function of the material orientation, the true strain, and the true strain rate. Li et al. (2019) trained an ANN as an integral part of a Johnson-Cook type of law to capture the effects of temperature and strain rate on the hardening of dual phase (DP) steels. Pandya et al. (2019) developed a neural network model to describe the temperature-dependent plasticity and fracture of aluminum 7075 in hot-stamping processes. Gorji and Mohr (2019a) showed the potential of FCNN to characterize the hardening and softening behavior of a metals for temperatures ranging from 25°C to 1250°C and strain rates ranging from 10⁻³/s to 10³/s. Jordan et al. (2019) used a fully connected network to describe the temperature-dependent viscoplastic hardening function of polypropylene material. It is worth noting that they developed a robot-assisted automated testing system to generate the wealth of experimental data needed for machine-learning based constitutive modeling.

The complexity of the needed neural network models increases when other material model elements such as the yield function and flow rule need to be represented. In addition to the strain hardening, Ali et al. (2019) trained an ANN with 80 neurons per hidden layer to describe the behavior of single crystals subjected to uniaxial tension or shear loading. They chose the current stress and three Eulerian angles as the inputs and trained their network based on the results from crystal plasticity simulations. Greve et al. (2019) showed a substantial computational speedup when using ANN to predict the path-dependent forming limit curve (FLC) instead of Marciniak–Kuczynski (M-K) type of analysis. Lefik and Schrefler (2003) discussed the possibility of incorporating FCNN based constitutive models into an FE code for modeling the non-linear stress-strain response of composites subject to elasto-plastic hysteresis and biaxial loading. Man and Furukawa (2011) developed an ANN-based constitutive model of anisotropic carbon fiber reinforced plastics. Palau et al. (2012) used the von Mises plasticity model to generate training data for monotonic, cyclic, butterfly, and random loading paths. Subsequently, they trained an FCNN model representing incremental plasticity. Gorji and Mohr (2019b) took a similar approach to train the FCNN architecture and to describe the plane stress response of DP780 steel for monotonic loading. Rovinelli et al. (2018a, 2018b) used a semi-supervised machine learning model to predict the crack propagation of polycrystalline materials under fatigue loading conditions. Liu et al (2019) proposed a data-driven multiscale material modeling method, which incorporates analytical homogenization solutions into a neural network model. Liu and Wu (2019) extended their model for application to 3D heterogeneous materials with multiscale sources of nonlinearity such as particle-reinforced hyperelastic rubbers, polycrystalline materials, and elasto-plastic composites. While all of the above work demonstrates the high potential of neural networks as a data-driven alternative to physics-based models, prediction of the stress-strain response of elasto-plastic solids for arbitrary multi-axial loading paths remains challenging.

The family of the neural network models is constantly growing. In addition to FCNN, Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) models have been developed for applications such as image analysis and speech recognition (e.g. Goodfellow et al., 2017). A recent study by Mozaffar et al. (2019) showed their merits in predicting the behavior of composites subject to nonlinear loading. Even though RNNs have not yet been applied to the modeling of sheet metal, they seem to be particularly suitable for modeling path-dependent plasticity. Unlike FCNNs, recurrent neural networks are equipped with history-dependent internal variables that may potentially mimic the role of objects such as plastic strains and back stresses in physics-based plasticity. Training of RNNs has long been considered a challenge due to vanishing and exploding gradients in the backpropagation process (Hochreiter et al., 2001). However, Long Short-Term Memory (LSTM) models (e.g. Hochreiter and Schmidhuber, 1997) and Gated Recurrent Units (GRU) (e.g. Cho et al., 2014) are examples for successful formulations of RNNs that overcome their notorious training issues.

It is the goal of the present work to construct FCNN- as well as RNN-based constitutive models to describe the large deformation response of elasto-plastic solids. The Homogeneous Anisotropic Hardening (HAH) model with the Yld2000-2d yield function (e.g. Barlat et al., 2011; Barlat et al., 2013, 2014) will serve as a physics-based reference solution, thereby challenging the RNN model to represent complex anisotropic hardening phenomena. To gain insight into the modeling capabilities of neural networks, several constitutive modeling problems of increasing degree of difficulty are treated. Section 2 provides an overview of the corresponding problem statements. In Section 3, we recall briefly the constitutive equations for the HAH model, before detailing the formulations of the fully-connected and recurrent neural network models with gated recurrent units in Section 4. In Section 5, it is then demonstrated that FCNNs and RNNs are able to replicate the predictions of conventional plasticity models, including those of the complex HAH model for an aluminum alloy and a mild steel. A combined RNN-FCNN model is then further challenged in Section 6, where we describe the macroscopic stress-strain response of a two-dimensional metallic foam after training based on the results from unit cell simulations.

Section snippets

Problem statements

The primary goal of this work is to demonstrate that conventional physics-based (or at least mechanism-inspired) plasticity models may be substituted by neural network models. As a representative reference solution, an anisotropic plasticity model (Yld2000-2D) with combined isotropic, kinematic and distortional hardening (HAH model) is chosen. Since the application of machine learning in the area of plasticity is still at an early stage of development, we elaborate neural network models for

Constitutive equations

The yield function is written in terms of the anisotropic equivalent stress $\bar{σ}$ and the deformation resistance $k [{\bar{ε}}_{p}]$ , $f = \bar{σ} - k [{\bar{ε}}_{p}] = 0 .$

In this work, isotropic hardening is defined by the Hockett-Sherby function (Hockett and Sherby, 1975), $k [{\bar{ε}}_{p}] = σ_{s a t} - (σ_{s a t} - σ_{y}) \exp (- m {\bar{ε}}_{p}^{n}) .$ σ_sat and σ_y denote the ultimate stress and the initial yield stress, respectively, whereas m and n control the hardening evolution. The non-quadratic Yld2000-2d yield function (Barlat et al., 2003) is employed to describe the

Neural network model formulations

Neural networks are used to describe the relationship between the input and output variables for the constitutive modeling problems listed in Section 2. Today, there exists a wealth of different types of neural networks (e.g. van Veen, 2016) among which two distinct types are applied in the present study. Firstly, when estimating the non-linear functional relationship between an input and an output vector (problems I to III), Fully-Connected Neural Networks (FCNNs) are employed. Secondly, when

Problem I: uniaxial tension with loading direction reversal

The uniaxial stress-strain response of an HAH material (i.e., a material obeying the constitutive equations outlined in Section 3) is described through a fully-connected neural network. Training data of the form ${ε_{i}, L_{i}, σ_{i}}$ is generated through single element simulations with the parameters for AA5182. Simulations are performed with tensile loading up to a strain of 0.05, 0.10, and 0.15, respectively, followed by reverse loading up to a logarithmic strain of $- 0.1$ (Fig. 1c). From each

Application: constitutive model for 2D foam

The results obtained for arbitrary loading paths (Problem IV, Section 5.4) are particularly promising as the same neural network modeling framework is expected to apply to any anisotropic elasto-plastic solid provided that the wealth of data required for training can be generated. This is the case whenever virtual experiments can be performed, i.e., if a material model is available at a small length scale. The neural network can then be trained based on the results from the detailed model and

Conclusions

It has been demonstrated that deep learning with neural networks provides a very powerful modeling framework that is suitable for data-driven constitutive modeling. This strength of Fully-Connected Neural Networks (FCNNs) has been shown in the context of problems dealing with the description of the uniaxial stress-strain response of elasto-plastic solids. Even rather complex phenomena such as the Bauschinger effect and hardening stagnation can be successfully described. For these types of

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The partial financial support of the MIT Industrial Fracture Consortium is gratefully acknowledged. Special thanks are due to Prof. Tomasz Wierzbicki (MIT) for his support of this informal collaboration.

References (57)

R. Abi-Akl et al.
Paint-bake effect on the plasticity and fracture of pre-strained aluminum 6451 sheets
Int. J. Mech. Sci.
(2017)
M.S. Al-Haik et al.
Prediction of nonlinear viscoelastic behavior of polymeric composites using an artificial neural network
Int. J. Plast.
(2006)
U. Ali et al.
Application of artificial neural networks in micromechanics for polycrystalline metals
Int. J. Plast.
(2019)
S. Balasubramanian et al.
Elasto-viscoplastic constitutive equations for polycrystalline fcc materials at low homologous temperatures
J. Mech. Phys. Solids
(2002)
F. Barlat et al.
Plane stress yield function for aluminum alloy sheets—part 1: theory
Int. J. Plast.
(2003)
F. Barlat et al.
Plastic flow for non-monotonic loading conditions of an aluminum alloy sheet sample
Int. J. Plast.
(2003)
F. Barlat et al.
An alternative to kinematic hardening in classical plasticity
Int. J. Plast.
(2011)
F. Barlat et al.
Extension of homogeneous anisotropic hardening model to cross-loading with latent effects
Int. J. Plast.
(2013)
F. Barlat et al.
Enhancements of homogenous anisotropic hardening model and application to mild and dual-phase steels
Int. J. Plast.
(2014)
J.L. Chaboche
Constitutive equations for cyclic plasticity and cyclic viscoplasticity
Int. J. Plast.
(1989)

M.B. Gorji et al.

Predicting shear fracture of aluminum 6016-T4 during deep drawing: combining Yld-2000 plasticity with Hosford–Coulomb fracture model

Int. J. Mech. Sci.

(2018)

M.B. Gorji et al.

Heterogeneous random medium plasticity and fracture model of additively-manufactured Ti-6Al-4V

Acta Mater.

(2018)

L. Greve et al.

Necking-induced fracture prediction using an artificial neural network trained on virtual test data

Eng. Fract. Mech.

(2019)

B.E.N.A.T. Gurrutxaga-Lerma et al.

The mechanisms governing the activation of dislocation sources in aluminum at different strain rates

J. Mech. Phys. Solids

(2015)

R. Hill

Constitutive modelling of orthotropic plasticity in sheet metals

J. Mech. Phys. Solids

(1990)

J.E. Hockett et al.

Large strain deformation of polycrystalline metals at low homologous temperatures

J. Mech. Phys. Solids

(1975)

A. Jenab et al.

The use of genetic algorithm and neural network to predict rate-dependent tensile flow behaviour of AA5182-O sheets

Mater. Des.

(2016)

A.P. Karafillis et al.

A general anisotropic yield criterion using bounds and a transformation weighting tensor

J. Mech. Phys. Solids

(1993)

C.P. Kohar et al.

Effects of coupling anisotropic yield functions with the optimization process of extruded aluminum front rail geometries in crashworthiness

Int. J. Solids Struct.

(2017)

C.P. Kohar et al.

Development of high crush efficient, extrudable aluminium front rails for vehicle lightweighting

Int. J. Impact Eng.

(2016)

M. Lefik et al.

Artificial neural network as an incremental non-linear constitutive model for a finite element code

Comput. Meth. Appl. Mech. Eng.

(2003)

X. Li et al.

Machine-learning based temperature-and rate-dependent plasticity model: application to analysis of fracture experiments on DP steel

Int. J. Plast.

(2019)

X. Ling et al.

Thermal softening induced plastic instability in rate-dependent materials

J. Mech. Phys. Solids

(2009)

Z. Liu et al.

Exploring the 3D architectures of deep material network in data-driven multiscale mechanics

J. Mech. Phys. Solids

(2019)

Z. Liu et al.

A deep material network for multiscale topology learning and accelerated nonlinear modeling of heterogeneous materials

Comput. Meth. Appl. Mech. Eng.

(2019)

R.W. Logan et al.

Upper-bound anisotropic yield locus calculations assuming< 111>-pencil glide

Int. J. Mech. Sci.

(1980)

K.J. Marsh et al.

The effect of strain rate on the post-yield flow of mild steel

J. Mech. Phys. Solids

(1963)

F. Mollica et al.

The inelastic behavior of metals subject to loading reversal

Int. J. Plast.

(2001)

Cited by (150)

Top-down constitutive modelling to capture nanoscale shear localization
2024, Journal of the Mechanics and Physics of Solids
Deformation localization as exemplified by earthquakes, landslides, shear banding in solids, and failure of engineering components is of utmost importance. In practice, differentiating the mechanical behavior in such generative narrow bands from the rest part, with difference by orders of magnitude in characteristic size, flow strength, temperature, and shearing rate, is both experimentally and computationally formidable. Here we propose a machine-learning-based constitutive modeling framework to overcome this barrier borne from conventional top-down continuum modelling approach. The model enables us to realize ultra-fine resolutions for deformation in those narrow bands with high efficiency. Taking metallic glasses (MGs) as an example, our model captures well shear localization in BMGs across a broad range of temperatures (0 K to its melting point of ∼1000 K) and strain rates ( $10^{- 4}$ to $10^{8} / s$ ). We verify through this model the width of shear bands (SBs) in MGs is on the order of 5–8 nanometers, which is resulted from a cascade of (intervening) events, from localized shearing to plastic heating, subsequent temperature rise to thermal softening, and accelerated flow rate to strain-rate hardening. Temperature rise in SBs is a resultant of heat flow and plastic dissipation, but strongly depend on thermal conductivity: Low thermal conductivity facilitates strain localization and great temperature rise. It helps understanding the current controversy upon experimentally measured temperature rise ranging from several K to ∼1000 K. Lastly, strain rates within SBs are approximately one to two orders of magnitude higher than externally applied strain rates, and in general shearing in adiabatic SBs is faster than that in isothermal condition.
Simple shear methodology for local structure–property relationships of sheet metals: State-of-the-art and open issues
2024, Progress in Materials Science
Simple shear presents a local material structure–property relationship and plays an important role in the development of material design, mechanical modeling, and manufacturing processes for sheet metals. Simple shear tests are extensively adopted to reveal the stress-state-dependent mechanisms of material microstructure evolution with their corresponding mechanical properties, to develop sophisticated constitutive models capturing complex mechanical behaviors, and to precisely characterize the failure limits for shear-dominated or large-strain deformation processes. Thus, the simple shear methodology including specimen geometry, fixing and loading device, data acquisition and the set of procedures for results analysis, has become a topic of growing interest because of its various distinctive capacities. Over the years, several simple shear analyses and test methods have been proposed without a unified understanding. Interpreting the experimental results can be confusing due to the complexity of finite deformation, variety of boundary conditions in practice, and complexity of the mechanical behavior of materials; however, neither a widely accepted protocol nor a systematic overview of this topic exists. To fill this gap, the present study attempts to provide a comprehensive review of the simple shear methodology for sheet metals, which will serve as a reference for summarizing substantial efforts to improve the understanding and gain valuable scientific insight, a guideline to discover local structure–property relationships of materials, and a solid step for shedding light on its standardization. In this paper, the motivation for the development of a simple shear methodology is first discussed, and the recent progress in experimental mechanics and experimental technologies is summarized. Its application in understanding the mechanical behaviors (hardening, yielding, and ductile fracture) is focused on, and the structure–property relationships revealed by simple shear are further highlighted. The challenges and prospects for future research are discussed. The principles, methodologies, and perspectives provided are highly relevant and are expected to benefit emerging areas such as heterostructured materials, micro/nanoscale mechanical testing, nonlocal plasticity, and additive manufacturing (AM).
Unsupervised learning of history-dependent constitutive material laws with thermodynamically-consistent neural networks in the modified Constitutive Relation Error framework
2024, Computer Methods in Applied Mechanics and Engineering
This article proposes a consistent and general approach to train physics-augmented neural networks with observable data to enrich and represent nonlinear history-dependent material behaviors in terms of both state equations and evolution laws. In this learning strategy consistent with thermodynamics, the constitutive model is expressed using two potentials (free energy and dissipation potential) which are represented by input-convex neural networks, thus automatically satisfying the principles of thermodynamics. The neural network is trained thanks to an unsupervised procedure that does not rely on strain–stress pairs but needs only partial strain or displacement measurements inside the structure, moreover with uncertain boundary conditions. This method is based on the minimization of the modified Constitutive Relation Error functional, and it extends previous works on this error measure for neural networks to the case of history-dependent behaviors, which requires to design a specific minimization procedure. Given that neural networks for typical structural health monitoring applications often need to be trained online, there is here a significant emphasis placed on automatically and adaptively tuning crucial hyperparameters such as learning rate or weighting between losses.
The method is evaluated on elastoplastic and elastoviscoplastic test cases with synthetic data collected from optic fiber or full-field measurements. It is shown that the method can properly learn hidden behaviors, achieves high robustness to noise level, and low sensitivity to user-defined hyperparameters.
I-FENN with Temporal Convolutional Networks: Expediting the load-history analysis of non-local gradient damage propagation
2024, Computer Methods in Applied Mechanics and Engineering
In this paper, we demonstrate for the first time how the Integrated Finite Element Neural Network (I-FENN) framework, previously proposed by the authors [1], [2], can efficiently simulate the entire loading history of non-local gradient damage propagation. To achieve this goal, we first adopt a Temporal Convolutional Network (TCN) as the neural network of choice to capture the history-dependent evolution of the non-local strain in a coarsely meshed domain. The quality of the network predictions governs the computational performance of I-FENN, and therefore we perform an extended investigation aimed at enhancing them. We explore a data-driven vs. physics-informed TCN setup to arrive at an optimum network training, evaluating the network based on a coherent set of relevant performance metrics. We address the crucial issue of training a physics-informed network with input data that span vastly different length scales by proposing a systematic way of input normalization and output un-normalization. We then integrate the trained TCN within the nonlinear iterative FEM solver and apply I-FENN to simulate the damage propagation analysis. I-FENN is always applied in mesh idealizations different from the one used for the TCN training, showcasing the framework’s ability to be used at progressively refined mesh resolutions. We illustrate several cases that I-FENN completes the simulation using either a modified or a full Newton–Raphson scheme, and we showcase its computational savings compared to both the classical monolithic and staggered FEM solvers. We underline that we satisfy very strict convergence criteria for every increment across the entire simulation, providing clear evidence of the robustness and accuracy of I-FENN. All the code and data used in this work will be made publicly available upon publication of the article.
An indirect training approach for implicit constitutive modelling using recurrent neural networks and the virtual fields method
2024, Computer Methods in Applied Mechanics and Engineering
Accurate material description is crucial to achieve high-quality results in computational analysis software. Phenomenological constitutive laws generalize the material behaviour observed in simple mechanical tests. The resulting empirical expressions contain parameters that need to be calibrated through an inverse optimization process. Advancements in Digital Image Correlation (DIC) techniques have enabled the extraction of non-uniform multi-axial displacement fields, facilitating the development of heterogeneous mechanical specimens and unlocking access to richer material behaviour data. Despite these advancements, constitutive models are still limited by mathematical formulations and biases due to simplifying assumptions. Artificial Neural Networks (ANNs), as universal function approximators, could potentially drive a paradigm shift in the field. ANNs do not require explicit pre-formulations, hence avoiding the need for identifying unknown parameters. Moreover, ANNs are able to implicitly learn patterns purely from data, allowing to circumvent the biases induced by the simplifying assumptions of first-order principles. However, the training of ANNs for implicit constitutive modelling is not straightforward, given the impossibility of obtaining direct measurements of stress. This paper explores the integration of Recurrent Neural Networks (RNNs) with the Virtual Fields Method (VFM) for material modelling. This approach uses global force and displacement data to indirectly train the neural network, with the equilibrium being evaluated globally through the VFM loss function. The proposed method is (i) computationally efficient by not requiring Finite Element Analysis (FEA), (ii) compatible with both full-field measurements and numerically generated data, and (iii) able to handle experimental boundary conditions.
Self-consistency Reinforced minimal Gated Recurrent Unit for surrogate modeling of history-dependent non-linear problems: Application to history-dependent homogenized response of heterogeneous materials
2024, Computer Methods in Applied Mechanics and Engineering
Multi-scale simulations can be accelerated by substituting the meso-scale problem resolution by a surrogate trained from off-line simulations. In the context of history-dependent materials, Recurrent Neural Networks (RNN) have widely been considered to act as such a surrogate, since their hidden variables allow for a memory effect.
However, defining a data-set for the training, which virtually covers all the possible strain–stress state evolution encountered during the online phase, remains a daunting task. This is particularly true in the case in which the strain increment size is expected to vary by several orders of magnitude. Self-Consistent recurrent networks were thus introduced by Bonatti and Mohr (2022) to reinforce the self-consistency of the neural network with respect to the input increment size when acting as a surrogate of an elasto-plastic material model.
When designing RNN to act as a surrogate of a meso-scale Boundary Value Problem (BVP) defined by a Representative Volume Element (RVE) of complex micro-structures, the number of learnable parameters required for existing Recurrent Neural Network (RNN) to be accurate could remain high, impeding the training performance. In this work, we revisit and design alternative self-consistent recurrent units in order to limit the number of hidden variables required for the neural network to act as a composite material surrogate in multi-scale simulations. Although the RNNs based on the newly suggested self-consistency reinforced recurrent units have a reduced number of learnable parameters yielding good training performance, they remain accurate in the context of multi-scale simulations considering various strain increment sizes.

View all citing articles on Scopus

View full text