On the potential of recurrent neural networks for modeling path dependent plasticity

https://doi.org/10.1016/j.jmps.2020.103972Get rights and content

Abstract

The mathematical description of elastoplasticity is a highly complex problem due to the possible change from elastic to elasto-plastic behavior (and vice-versa) as a function of the loading path. Advanced physics-based plasticity models usually feature numerous internal variables (often of tensorial nature) along with a set of evolution equations and complementary conditions. In the present work, an attempt is made to come up with a machine-learning based model that can replicate the predictions anisotropic Yld2000-2d model with homogeneous anisotropic hardening (HAH). For this, a series of modeling problems of increasing complexity is formulated and sequentially addressed using neural network models. It is demonstrated that basic fully-connected neural network models can capture the characteristic non-linearities in the uniaxial stress-strain response such as the Bauschinger effect, permanent softening or latent hardening. A neural network with gated recurrent units (GRUs) and fully-connected layer is proposed for the modeling of plane stress plasticity for arbitrary loading paths. After training and testing the model through comparison with the Yld2000-2d/HAH model, the recurrent neural network model is also used to model the multi-axial stress-strain response of a two-dimensional foam. Here, the comparison with the results from unit cell simulations provided another validation of the proposed data-driven modeling approach.

Introduction

The availability of reliable computational models describing the large deformation behavior of solids is an essential element for the success of new material solutions. Most existing material models for finite element software have been developed using a physics-based approach: first, the governing mechanisms are identified and then described in an approximate manner using a set of algebraic and partial differential equations. Depending on the length scale at which the relevant mechanisms take place, multiple levels of homogenization are needed to come up with estimates of the stress-strain response at the macroscopic level.

In sheet metal plasticity, the main ingredients of macroscopic material models are the elastic constitutive equation, the yield function, the flow rule, and a set of evolution equations describing the hardening response. Aside from the basic isotropic Levy-von Mises model, anisotropic yield functions and flow potentials have emerged over the last six decades such as the Hill (1948, 1990) and Hershey families (e.g. Karafillis and Boyce, 1993; Barlat et al., 2003a, 2003b). Furthermore, major developments were concerned with hardening laws including the description of the Bauschinger effect (e.g. Bauschinger, 1886; Prager, 1956; Armstrong et al., 1966; Mollica et al., 2001), kinematic hardening (e.g. Chaboche, 1989), work-hardening stagnation (e.g. Yoshida and Uemori, 2002), permanent softening (e.g. Rauch et al., 2007), latent hardening (e.g. Barlat et al., 2013), distortional hardening (e.g. Barlat et al., 2011, 2014), strain rate effects (e.g. Marsh and Campbell, 1963; Gurrutxaga-Lerma et al., 2015; Balasubramanian and Anand, 2002; Nguyen et al., 2017), and thermal softening (e.g. Stainier et al., 2002; Ling and Belytschko, 2009). For practical applications, the large number of parameters of modern constitutive models are usually calibrated such as to provide an optimal fit for a wide range of experiments (e.g. Abi-Akl and Mohr, 2017; Gorji and Mohr, 2018). In other words, the analysis of the governing physical mechanisms guides the selection of suitable mathematical models, while the model parameters are identified through optimization (minimization of the error between model predictions and experimental results). Given that modern physics-based material models require the use of optimization software for identifying their parameters from experiments (e.g. Gorji et al. 2018; Pack et al., 2018), the question comes up if the direct use of data-driven neural network-based models may provide a viable alternative to physics-based models.

Artificial neural networks (ANNs) are a class of machine learning algorithms that consist of many artificial neurons (often referred to as units) that are arranged in a layered structure. As demonstrated by Cybenko (1989), any smooth non-linear function may be represented by a neural network function with one hidden layer and sigmoidal activation. In mechanics, deep feedforward networks composed of multiple Fully-Connected Neural Network (FCNN) layers have been used as metamodeling technique to offer alternatives to traditional approaches for structural applications (e.g. Kohar et al., 2016, 2017), material parameter identification (e.g. Haj-Ali et al., 2007), constitutive modeling (e.g. Al-Haik et al., 2006), control manufacturing part attributes such as springback (Cao et al., 1999.), as well as the prediction of the mechanical response of polycrystalline metals (Frankel et al., 2019).

The ANN approach provides a useful framework for describing unconventional hardening behavior observed in tensile experiments. For example, Jenab et al. (2016) employed a shallow neural network to describe the stress-strain response of anisotropic aluminum 5182-O for strain rates ranging from 10−3 to 103/s as a function of the material orientation, the true strain, and the true strain rate. Li et al. (2019) trained an ANN as an integral part of a Johnson-Cook type of law to capture the effects of temperature and strain rate on the hardening of dual phase (DP) steels. Pandya et al. (2019) developed a neural network model to describe the temperature-dependent plasticity and fracture of aluminum 7075 in hot-stamping processes. Gorji and Mohr (2019a) showed the potential of FCNN to characterize the hardening and softening behavior of a metals for temperatures ranging from 25°C to 1250°C and strain rates ranging from 10−3/s to 103/s. Jordan et al. (2019) used a fully connected network to describe the temperature-dependent viscoplastic hardening function of polypropylene material. It is worth noting that they developed a robot-assisted automated testing system to generate the wealth of experimental data needed for machine-learning based constitutive modeling.

The complexity of the needed neural network models increases when other material model elements such as the yield function and flow rule need to be represented. In addition to the strain hardening, Ali et al. (2019) trained an ANN with 80 neurons per hidden layer to describe the behavior of single crystals subjected to uniaxial tension or shear loading. They chose the current stress and three Eulerian angles as the inputs and trained their network based on the results from crystal plasticity simulations. Greve et al. (2019) showed a substantial computational speedup when using ANN to predict the path-dependent forming limit curve (FLC) instead of Marciniak–Kuczynski (M-K) type of analysis. Lefik and Schrefler (2003) discussed the possibility of incorporating FCNN based constitutive models into an FE code for modeling the non-linear stress-strain response of composites subject to elasto-plastic hysteresis and biaxial loading. Man and Furukawa (2011) developed an ANN-based constitutive model of anisotropic carbon fiber reinforced plastics. Palau et al. (2012) used the von Mises plasticity model to generate training data for monotonic, cyclic, butterfly, and random loading paths. Subsequently, they trained an FCNN model representing incremental plasticity. Gorji and Mohr (2019b) took a similar approach to train the FCNN architecture and to describe the plane stress response of DP780 steel for monotonic loading. Rovinelli et al. (2018a, 2018b) used a semi-supervised machine learning model to predict the crack propagation of polycrystalline materials under fatigue loading conditions. Liu et al (2019) proposed a data-driven multiscale material modeling method, which incorporates analytical homogenization solutions into a neural network model. Liu and Wu (2019) extended their model for application to 3D heterogeneous materials with multiscale sources of nonlinearity such as particle-reinforced hyperelastic rubbers, polycrystalline materials, and elasto-plastic composites. While all of the above work demonstrates the high potential of neural networks as a data-driven alternative to physics-based models, prediction of the stress-strain response of elasto-plastic solids for arbitrary multi-axial loading paths remains challenging.

The family of the neural network models is constantly growing. In addition to FCNN, Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) models have been developed for applications such as image analysis and speech recognition (e.g. Goodfellow et al., 2017). A recent study by Mozaffar et al. (2019) showed their merits in predicting the behavior of composites subject to nonlinear loading. Even though RNNs have not yet been applied to the modeling of sheet metal, they seem to be particularly suitable for modeling path-dependent plasticity. Unlike FCNNs, recurrent neural networks are equipped with history-dependent internal variables that may potentially mimic the role of objects such as plastic strains and back stresses in physics-based plasticity. Training of RNNs has long been considered a challenge due to vanishing and exploding gradients in the backpropagation process (Hochreiter et al., 2001). However, Long Short-Term Memory (LSTM) models (e.g. Hochreiter and Schmidhuber, 1997) and Gated Recurrent Units (GRU) (e.g. Cho et al., 2014) are examples for successful formulations of RNNs that overcome their notorious training issues.

It is the goal of the present work to construct FCNN- as well as RNN-based constitutive models to describe the large deformation response of elasto-plastic solids. The Homogeneous Anisotropic Hardening (HAH) model with the Yld2000-2d yield function (e.g. Barlat et al., 2011; Barlat et al., 2013, 2014) will serve as a physics-based reference solution, thereby challenging the RNN model to represent complex anisotropic hardening phenomena. To gain insight into the modeling capabilities of neural networks, several constitutive modeling problems of increasing degree of difficulty are treated. Section 2 provides an overview of the corresponding problem statements. In Section 3, we recall briefly the constitutive equations for the HAH model, before detailing the formulations of the fully-connected and recurrent neural network models with gated recurrent units in Section 4. In Section 5, it is then demonstrated that FCNNs and RNNs are able to replicate the predictions of conventional plasticity models, including those of the complex HAH model for an aluminum alloy and a mild steel. A combined RNN-FCNN model is then further challenged in Section 6, where we describe the macroscopic stress-strain response of a two-dimensional metallic foam after training based on the results from unit cell simulations.

Section snippets

Problem statements

The primary goal of this work is to demonstrate that conventional physics-based (or at least mechanism-inspired) plasticity models may be substituted by neural network models. As a representative reference solution, an anisotropic plasticity model (Yld2000-2D) with combined isotropic, kinematic and distortional hardening (HAH model) is chosen. Since the application of machine learning in the area of plasticity is still at an early stage of development, we elaborate neural network models for

Constitutive equations

The yield function is written in terms of the anisotropic equivalent stress σ¯ and the deformation resistance k[ε¯p],f=σ¯k[ε¯p]=0.

In this work, isotropic hardening is defined by the Hockett-Sherby function (Hockett and Sherby, 1975),k[ε¯p]=σsat(σsatσy)exp(mε¯pn).σsat and σy denote the ultimate stress and the initial yield stress, respectively, whereas m and n control the hardening evolution. The non-quadratic Yld2000-2d yield function (Barlat et al., 2003) is employed to describe the

Neural network model formulations

Neural networks are used to describe the relationship between the input and output variables for the constitutive modeling problems listed in Section 2. Today, there exists a wealth of different types of neural networks (e.g. van Veen, 2016) among which two distinct types are applied in the present study. Firstly, when estimating the non-linear functional relationship between an input and an output vector (problems I to III), Fully-Connected Neural Networks (FCNNs) are employed. Secondly, when

Problem I: uniaxial tension with loading direction reversal

The uniaxial stress-strain response of an HAH material (i.e., a material obeying the constitutive equations outlined in Section 3) is described through a fully-connected neural network. Training data of the form {εi,Li,σi} is generated through single element simulations with the parameters for AA5182. Simulations are performed with tensile loading up to a strain of 0.05, 0.10, and 0.15, respectively, followed by reverse loading up to a logarithmic strain of 0.1 (Fig. 1c). From each

Application: constitutive model for 2D foam

The results obtained for arbitrary loading paths (Problem IV, Section 5.4) are particularly promising as the same neural network modeling framework is expected to apply to any anisotropic elasto-plastic solid provided that the wealth of data required for training can be generated. This is the case whenever virtual experiments can be performed, i.e., if a material model is available at a small length scale. The neural network can then be trained based on the results from the detailed model and

Conclusions

It has been demonstrated that deep learning with neural networks provides a very powerful modeling framework that is suitable for data-driven constitutive modeling. This strength of Fully-Connected Neural Networks (FCNNs) has been shown in the context of problems dealing with the description of the uniaxial stress-strain response of elasto-plastic solids. Even rather complex phenomena such as the Bauschinger effect and hardening stagnation can be successfully described. For these types of

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The partial financial support of the MIT Industrial Fracture Consortium is gratefully acknowledged. Special thanks are due to Prof. Tomasz Wierzbicki (MIT) for his support of this informal collaboration.

References (57)

  • M.B. Gorji et al.

    Predicting shear fracture of aluminum 6016-T4 during deep drawing: combining Yld-2000 plasticity with Hosford–Coulomb fracture model

    Int. J. Mech. Sci.

    (2018)
  • M.B. Gorji et al.

    Heterogeneous random medium plasticity and fracture model of additively-manufactured Ti-6Al-4V

    Acta Mater.

    (2018)
  • L. Greve et al.

    Necking-induced fracture prediction using an artificial neural network trained on virtual test data

    Eng. Fract. Mech.

    (2019)
  • B.E.N.A.T. Gurrutxaga-Lerma et al.

    The mechanisms governing the activation of dislocation sources in aluminum at different strain rates

    J. Mech. Phys. Solids

    (2015)
  • R. Hill

    Constitutive modelling of orthotropic plasticity in sheet metals

    J. Mech. Phys. Solids

    (1990)
  • J.E. Hockett et al.

    Large strain deformation of polycrystalline metals at low homologous temperatures

    J. Mech. Phys. Solids

    (1975)
  • A. Jenab et al.

    The use of genetic algorithm and neural network to predict rate-dependent tensile flow behaviour of AA5182-O sheets

    Mater. Des.

    (2016)
  • A.P. Karafillis et al.

    A general anisotropic yield criterion using bounds and a transformation weighting tensor

    J. Mech. Phys. Solids

    (1993)
  • C.P. Kohar et al.

    Effects of coupling anisotropic yield functions with the optimization process of extruded aluminum front rail geometries in crashworthiness

    Int. J. Solids Struct.

    (2017)
  • C.P. Kohar et al.

    Development of high crush efficient, extrudable aluminium front rails for vehicle lightweighting

    Int. J. Impact Eng.

    (2016)
  • M. Lefik et al.

    Artificial neural network as an incremental non-linear constitutive model for a finite element code

    Comput. Meth. Appl. Mech. Eng.

    (2003)
  • X. Li et al.

    Machine-learning based temperature-and rate-dependent plasticity model: application to analysis of fracture experiments on DP steel

    Int. J. Plast.

    (2019)
  • X. Ling et al.

    Thermal softening induced plastic instability in rate-dependent materials

    J. Mech. Phys. Solids

    (2009)
  • Z. Liu et al.

    Exploring the 3D architectures of deep material network in data-driven multiscale mechanics

    J. Mech. Phys. Solids

    (2019)
  • Z. Liu et al.

    A deep material network for multiscale topology learning and accelerated nonlinear modeling of heterogeneous materials

    Comput. Meth. Appl. Mech. Eng.

    (2019)
  • R.W. Logan et al.

    Upper-bound anisotropic yield locus calculations assuming< 111>-pencil glide

    Int. J. Mech. Sci.

    (1980)
  • K.J. Marsh et al.

    The effect of strain rate on the post-yield flow of mild steel

    J. Mech. Phys. Solids

    (1963)
  • F. Mollica et al.

    The inelastic behavior of metals subject to loading reversal

    Int. J. Plast.

    (2001)
  • Cited by (150)

    • Top-down constitutive modelling to capture nanoscale shear localization

      2024, Journal of the Mechanics and Physics of Solids
    View all citing articles on Scopus
    View full text