This study aims to develop a robust and generalizable hybrid strategy for investigating impact behaviors of high-performance composites, significantly advancing composite impact engineering and demonstrating strong potential for applications in protective and structural systems.
With the hybrid framework that integrates experimental testing, finite element (FE) modeling and machine learning (ML), the study on the low-velocity impact behavior of Kevlar fiber-reinforced composites with thermoplastic polyurethane matrix was carried out: with the validation of the FE model by experiments, the numerical model was used to produce data to train the ML methods.
The best-performing Levenberg–Marquardt artificial neural network model achieved excellent agreement with FE simulation data, yielding a correlation coefficient R > 0.98 and a low mean squared error, which was also proven through experimental validation with satisfactory accuracy.
In the current work, the combined method with the FE model, experiments and ML was developed for low-velocity impact of thermoplastic composite materials. The damage process was investigated, while the accuracy of the proposed methodology was verified when compared to experimental outcomes.
1. Introduction
Composite materials are widely adopted in advanced structural applications owing to their excellent stiffness-to-weight ratios, resistance to corrosion and fatigue and the tailorability of mechanical response (Yakin et al., 2025; Senol et al., 2025a, b). Accordingly, their response to impact loading is a pivotal consideration in design and safety assessments (Siengchin, 2023; Karsandik et al., 2023). Fiber-reinforced composites conventionally adopt thermoset polymer matrices like epoxies, which deliver high stiffness and strength, albeit at the expense of limited fracture toughness (Rezasefat et al., 2022a; Staroverov et al., 2022). Thermoplastic composites have gained increasing attention as a promising alternative, offering improved manufacturing flexibility, enhanced damage tolerance and excellent mechanical stability under extreme service conditions (Boria et al., 2017; Yildirim et al., 2025; Kurdiş et al., 2025; Şenyurt et al., 2025). The inherent ductility and deformation capacity of thermoplastics can facilitate composites' superior delamination resistance and energy absorption capabilities (Sun et al., 2018). Recent investigations have been conducted on characterizing the mechanical behavior and failure mechanisms of thermoplastic matrix composites under impact conditions (Hazzard et al., 2017). For instance, Shah et al. (2021) demonstrated that glass-fiber-reinforced thermoplastic acrylic composites exhibited higher damage initiation thresholds compared to their thermoset counterparts. Liu et al. (2021) found that plastic deformation is one of the primary energy dissipation mechanisms during impacts on thermoplastic polyethylene composites. Thermoplastic polyurethane (TPU) – also broadly known as thermoplastic elastomers, a relatively recent development among thermoplastics – features a microphase-separated architecture comprising alternating soft and hard segments, resulting in an exceptional combination of elasticity, strength and impact resilience surpassing that of conventional epoxies (Miller et al., 2015). Growing evidence supports the superior impact resistance of TPU composites (Lin et al., 2023; Montazeri et al., 2025a, b). Applying TPU either as a surface coating or as electrospun interleaves redistributes impact energy, suppresses delamination and improves post-impact performance (Rizzo et al., 2020; Lu et al., 2025). Russo et al. (2016) reported no delamination in TPU glass-fiber laminates following low-velocity impacts at ambient and sub-zero temperatures. In the carbon fiber (CF) system, Pechlivani (2024) validated the improved mechanical response and flexural performance of 3D-printed TPU/CF composites under Quasi-static tests. Although current findings underscore the promise of TPU composite systems for impact mitigation, further in-depth exploration and advanced methodologies are essential to fully harness their mechanical potential.
Traditional methodologies for evaluating composite impact behavior integrate experimental testing with numerical modeling, addressing a wealth of literature (Paolo et al., 2011; Boukar et al., 2022; Han et al., 2024). Explicit finite element (FE) tools such as Livermore Software DYNA (LS-DYNA) and Abaqus are widely employed for simulating impact phenomena in composites due to their capacity to capture complex damage modes and facilitate parameter calibration to align with experimental observations (Zhang et al., 2025a; Kumar et al., 2024). For example, Boria et al. (2015) performed both experimental and numerical investigations on carbon and/or epoxy composites and demonstrated that, although requiring higher computational cost compared to shell-element models, solid-element FE models using the MAT 55 card in LS-DYNA yielded higher fidelity in predicting intra- and inter-laminar damage. Nevertheless, relying solely on FE simulations becomes computationally prohibitive when examining multivariate parameter spaces. To overcome this limitation, alternative methods have been proposed to augment conventional techniques. Machine learning (ML) has increasingly been integrated into composite mechanics, offering promising outcomes in impact-related scenarios (Sorour et al., 2024; Hasebe et al., 2025; Qiu et al., 2022). Among ML techniques, artificial neural networks (ANNs) have emerged as robust tools for modeling the mechanical response of composite materials (Ding et al., 2022). Feed-forward neural networks, in particular, have shown strong predictive capabilities for composite behavior under diverse loading conditions (Xiong et al., 2023; Zhang et al., 2025b). Zhang et al. (2025b) developed an ANN capable of predicting impact force histories of composite laminates with a high correlation coefficient of 0.98. Mezeix et al. (2023) reported that their neural network effectively detected delamination and estimated damage-related parameters with error rates below 10%. Once trained, ANN models provide near-instantaneous predictions, enabling real-time analysis and high-throughput evaluation, which are unattainable with conventional FE simulations.
However, a key challenge in employing ANNs lies in the need for large, diverse and high-quality training datasets. Generating such datasets purely through experiments can be prohibitively costly and time-consuming, thereby undermining the efficiency gains of ML. To address this, validated FE models can be used as synthetic data generators, forming the basis of hybrid FE-ML frameworks that combine the strengths of physics-based and data-driven approaches (Lei et al., 2021; Stephen et al., 2022; Pittie et al., 2023). For instance, Stephen et al. (2022) utilized simulation-generated datasets to train ANN models that predicted impact responses of various laminate configurations with approximately 6% error. Likewise, Pittie et al. (2023) combined experimental calibration with FE simulations to train ANNs capable of predicting back-face deformation and the number of perforated layers. These studies underscore the increasing relevance of ML and FE-ML hybrids in accelerating composite impact analysis and enhancing design processes.
Building on this foundation, we develop a hybrid, data-driven framework to investigate the low-velocity impact response of TPU-matrix Kevlar-fiber composites with emphasis on the effect of matrix content, enabling efficient prediction of the laminate's impact response. The methodology integrates impact experiments, FE modeling and ANNs. Low-velocity impact (LVI) experiments were conducted to establish reliable data, followed by the development of a ply-level macro-homogeneous FE model in LS-DYNA. The numerical model was successfully validated against experimental tests, and the influence of matrix content on impact behavior has been examined. Subsequently, the model was utilized to generate a structured dataset spanning impact energy and TPU content for network training. Alternative network architectures and training algorithms are systematically assessed to learn mappings from inputs to key impact-response metrics. The resulting ANN surrogate delivers accurate, near-instantaneous predictions and exhibits strong agreement with independent unseen experimental measurements, enabling rapid performance evaluation and early-stage design of impact-resistant TPU composites.
2. Material
The reinforcement material employed in this study is Kevlar® XP S307 fabric (305 g/m2, thickness 0.27 mm, 440 tex), which features two layers of parallel fibers superimposed, stitched together and arranged at +45o and −45o orientations. Mantoflex® 2198A TPU was selected as the matrix material. Initially, the TPU material was provided in pellet form with a density of 1.23 g/cm3, a Shore hardness of 98A and a melt flow rate of 4.5 g/10 min (205 °C, 2.16 kg). Prior to processing, these TPU pellets underwent drying at 105 °C for 5 h using a Fanem 3155E oven. Subsequently, the dried pellets were then processed using an AX Plásticos Lab-16 extruder operated at 140–200 °C and 72 rpm to produce TPU films with an areal density of 60 g/m2 and an average thickness of 60 µm. An annealing treatment was conducted at 80 °C for 8 h during post-processing to enhance the uniformity of the material's crystalline microstructure.
Previous studies have identified several techniques for integrating thermoplastic matrices with reinforcements, including prepregging (Goodman and Loos, 1990), film calendaring (Ali et al., 2003) and aqueous suspension methods (Texier et al., 1993). Among these, hot press forming (Al-Dhaheri et al., 2024), recognized as one effective process in thermoplastic composite manufacturing, was utilized in this study, and the process cycle is depicted in Figure 1(a). In the current fabrication, hot press forming was conducted using pre-heated metal molds installed on a semi-automated hydraulic press (Carver CMG30H-12-ASTM). The process initiates by arranging laminate layers at 200 °C and 10 MPa pressure. After forming, the laminate is transferred to an oven, where it undergoes curing at 80 °C for 15 h to complete solidification.
The diagram contains two labeled sections. The first section (a) at the top shows three stages of a molding process. The three stages are present from left to right. Stage 1, labeled “heated mold,” shows a dark upper mold above a layer stack and a dark bottom mold, with spacing between the molds. Stage 2, labeled “pressing,” shows the upper and bottom molds pressing the layers together. Stage 3, labeled “laminate removal plus curing and solidification,” shows the molds separated and a single flat laminate to the right. The second section (b) at the bottom shows three stacked laminate illustrations. Each stack alternates yellow-labeled Aramid and teal-labeled T P U film layers. From left to right, the first stack is labeled “K 8 underscore 1 P U,” the second is labeled “K 8 2 P U,” and the third is labeled “K 8 3 P U.” The size of the stack increases from left to right. Below each stack is a microstructural image. The first image is labeled “K 8 1 P U” in the upper left corner and displays horizontal irregular yellow lines with darker gaps between them. A rectangular scale bar at the bottom right corner reads “500 micrometers.” The second microstructural image is labeled “K 8 2 P U” and shows similar horizontal yellow bands with darker gaps. A rectangular scale bar at the bottom right corner reads “500 micrometers.” The third microstructural image is labeled “K 8 underscore 3 P U” and contains thicker yellow horizontal bands separated by darker gaps. A rectangular scale bar at the bottom right corner reads “500 micrometers.”(a) Hot press forming cycle and (b) illustration of the laminate configurations and the optical microscopy images on cross-section
The diagram contains two labeled sections. The first section (a) at the top shows three stages of a molding process. The three stages are present from left to right. Stage 1, labeled “heated mold,” shows a dark upper mold above a layer stack and a dark bottom mold, with spacing between the molds. Stage 2, labeled “pressing,” shows the upper and bottom molds pressing the layers together. Stage 3, labeled “laminate removal plus curing and solidification,” shows the molds separated and a single flat laminate to the right. The second section (b) at the bottom shows three stacked laminate illustrations. Each stack alternates yellow-labeled Aramid and teal-labeled T P U film layers. From left to right, the first stack is labeled “K 8 underscore 1 P U,” the second is labeled “K 8 2 P U,” and the third is labeled “K 8 3 P U.” The size of the stack increases from left to right. Below each stack is a microstructural image. The first image is labeled “K 8 1 P U” in the upper left corner and displays horizontal irregular yellow lines with darker gaps between them. A rectangular scale bar at the bottom right corner reads “500 micrometers.” The second microstructural image is labeled “K 8 2 P U” and shows similar horizontal yellow bands with darker gaps. A rectangular scale bar at the bottom right corner reads “500 micrometers.” The third microstructural image is labeled “K 8 underscore 3 P U” and contains thicker yellow horizontal bands separated by darker gaps. A rectangular scale bar at the bottom right corner reads “500 micrometers.”(a) Hot press forming cycle and (b) illustration of the laminate configurations and the optical microscopy images on cross-section
Three laminate configurations with different TPU matrix content were fabricated. The stacking sequences and configurations are shown in Figure 1(b), and detailed information are reported in Table 1. For the nomenclature on the sample, the letter “K” denotes the aramid layers and “PU” refers to the TPU film layers. Each laminate configuration consists of eight aramid layers, intercalated by 1, 2 or 3 TPU films. The final laminate productions were sectioned along the thickness direction, and the laminate's cross-sectional views were observed using optical microscopy, which is presented in Figure 1(b). The dark regions correspond to the aramid layers, while the lighter regions represent the TPU matrix. K8_1PU and K8_3PU are considered as the TPU composite with the minimum and maximum matrix capacity. Below the minimum content, insufficient infiltration of TPU into the aramid fabric occurs, while above the maximum content, excessive resin accumulation leads to degraded laminate quality after hot pressing. The K8_2PU configuration provides a middle state between these two extremes. To obtain the fundamental material mechanical properties of the specimens, a series of tests adhering to the American Society for Testing and Materials (ASTM) standards were performed as material characterization for the FE models.
Summary of the TPU composite laminates
| Laminates | Aramid layers | TPU film layers | Thickness (mm) | Matrix volume fraction |
|---|---|---|---|---|
| K8_1PU | 8 | 7 | 2.44 | 13.9% |
| K8_2PU | 14 | 2.86 | 29.9% | |
| K8_3PU | 21 | 3.17 | 38.1% |
| Laminates | Aramid layers | TPU film layers | Thickness (mm) | Matrix volume fraction |
|---|---|---|---|---|
| K8_1PU | 8 | 7 | 2.44 | 13.9% |
| K8_2PU | 14 | 2.86 | 29.9% | |
| K8_3PU | 21 | 3.17 | 38.1% |
3. Methodology
Figure 2 provides a schematic illustration outlining the three-stage hybrid framework adopted in this study. First, Section 3.1 establishes the experimental foundation through low-velocity impact tests to provide reliable observations. Second, Section 3.2 develops a matrix-content-related ply-level FE model in LS-DYNA based on the laminate's design and specimen architecture. After validation, an LS-OPT workflow has been built, which runs in conjunction with the model in LS-DYNA, to generate a structured dataset for the ML. Third, Section 3.3 introduces the Levenberg–Marquardt and scaled conjugate gradient (SCG) algorithms. These algorithms are employed for feed-forward neural network training to identify the optimal network and achieve accurate prediction on impact response. In the end, this hybrid methodology establishes an efficient predictive surrogate for fast impact-resistance evaluation of TPU composites and guidance of pre-production design towards impact. The following sub-sections will provide detailed explanations of each component.
The workflow diagram is arranged from left to right using outlined circles connected by right-pointing arrows. The first circle on the left contains the text “Experiment” with the word “Foundation” centered below it. A right-pointing arrow leads to a larger overlapping double circle labeled “F E M” at the top, containing two smaller inner circles. The left inner circle contains the text “Ply-level model,” and the right inner circle contains the text “Validation.” A right-pointing arrow leads to a larger circle labeled “Machine Learning” at the top. Inside this circle, there are two ovals on the left, and between the ovals, there is the text “Neural network” at the center. Above this text, there is a curved arrow pointing from left to right. Below the oval, a curved arrow labeled “Training” loops from right to left and then upward toward the left side of the oval. To the right of the oval is a smaller circle labeled “Impact response prediction.” A final right-pointing arrow leads to italic text reading “Fast predictive surrogate for impact-resistance evaluation.”Scheme of the proposed hybrid experiment-FE modeling-machine learning methodology for impact response prediction of TPU composites
The workflow diagram is arranged from left to right using outlined circles connected by right-pointing arrows. The first circle on the left contains the text “Experiment” with the word “Foundation” centered below it. A right-pointing arrow leads to a larger overlapping double circle labeled “F E M” at the top, containing two smaller inner circles. The left inner circle contains the text “Ply-level model,” and the right inner circle contains the text “Validation.” A right-pointing arrow leads to a larger circle labeled “Machine Learning” at the top. Inside this circle, there are two ovals on the left, and between the ovals, there is the text “Neural network” at the center. Above this text, there is a curved arrow pointing from left to right. Below the oval, a curved arrow labeled “Training” loops from right to left and then upward toward the left side of the oval. To the right of the oval is a smaller circle labeled “Impact response prediction.” A final right-pointing arrow leads to italic text reading “Fast predictive surrogate for impact-resistance evaluation.”Scheme of the proposed hybrid experiment-FE modeling-machine learning methodology for impact response prediction of TPU composites
3.1 Low-velocity impact tests
As illustrated in Figure 3, low-velocity impact tests were conducted in accordance with the ASTM D7136 standard using a drop-weight impact tower with a modified fixture setup. Rectangular composite specimens (150 mm × 100 mm) were centrally impacted by a hemispherical impactor (diameter: 12.7 mm, weight: 5.47 kg). A sensor was used to measure the initial impact velocity upon contact with the laminate, while a load cell was employed to record the contact force between the impactor and the specimen. The tests were performed at three energy levels: 10 J, 20 J and 50 J. For each condition, three replicates were conducted to ensure repeatability and stability (Samlal and Santhanakrishnan, 2022; Meireman et al., 2024; Llanos et al., 2024). To prevent rebound-induced secondary impacts, a pneumatic arm was incorporated into the system. A circular fixture was employed to ensure stable positioning and firm fixation of the specimens during testing, as shown in Figure 3(c). This fixture featured a central opening with a diameter of 75 mm and was rigidly fastened in place using four rubber clamps that provided consistent clamping pressure to minimize specimen movement. Experimental outcomes, including impact velocity, displacement and absorbed energy, were derived and computed from the load cell and the sensor.
Three labeled photographs are arranged in sections. The first section (a) on the left shows a tall vertical drop-weight tower composed of multiple vertical supports, a central guide, and a cylindrical component labeled “Impactor.” A diagonal structure is labeled “Pneumatic Arm.” A smaller labeled section near the center is marked “Optical sensor.” At the bottom, a flat platform is labeled “Rigid Fixture Base.” An inset at the lower left corner shows a cylindrical piece labeled “Impactor tip” with the handwritten number “47” on its surface. The second section (b) on the right side shows a rectangular T P U composite sheet with dimensions marked as “150 millimeters” vertically and “100 millimeters” horizontally, indicated by double-headed arrows. The text “L equals 2.5 millimeters” is written near the top of the sheet. Additional written text on the sheet includes “G 1 P 1” and “1 P U K 8.” A dashed circle highlights a portion of the sheet, leading to a zoomed inset at the right showing horizontal parallel lines. Below the inset is a ruler showing visible numbers “2,” “3,” and “4.” The third section (c) at the lower right side shows a metal assembly labeled “Circular fixture” at the center. Above the fixture, two labeled components are marked “Rubber clamps.” Levers and fasteners are visible around the fixture.(a) Drop-weight impact tower, (b) TPU composite specimen and (c) circular fixture
Three labeled photographs are arranged in sections. The first section (a) on the left shows a tall vertical drop-weight tower composed of multiple vertical supports, a central guide, and a cylindrical component labeled “Impactor.” A diagonal structure is labeled “Pneumatic Arm.” A smaller labeled section near the center is marked “Optical sensor.” At the bottom, a flat platform is labeled “Rigid Fixture Base.” An inset at the lower left corner shows a cylindrical piece labeled “Impactor tip” with the handwritten number “47” on its surface. The second section (b) on the right side shows a rectangular T P U composite sheet with dimensions marked as “150 millimeters” vertically and “100 millimeters” horizontally, indicated by double-headed arrows. The text “L equals 2.5 millimeters” is written near the top of the sheet. Additional written text on the sheet includes “G 1 P 1” and “1 P U K 8.” A dashed circle highlights a portion of the sheet, leading to a zoomed inset at the right showing horizontal parallel lines. Below the inset is a ruler showing visible numbers “2,” “3,” and “4.” The third section (c) at the lower right side shows a metal assembly labeled “Circular fixture” at the center. Above the fixture, two labeled components are marked “Rubber clamps.” Levers and fasteners are visible around the fixture.(a) Drop-weight impact tower, (b) TPU composite specimen and (c) circular fixture
3.2 Numerical simulation
As shown in Figure 4, a three-dimensional FE model was constructed in LS-DYNA. Due to symmetry, only a quarter of the laminate geometry was constructed. In the simulation, the motion of the impactor was restricted to the translation along the vertical direction against the laminate's plane to eliminate any unintended in-plane displacement. For the composite laminate, a homogenized macroscopic ply-level model was developed, capturing the bi-layer cross-ply architecture of the studied aramid fabric and variations in TPU content.
The schematic diagram is arranged from left to right. On the left, a three-dimensional quarter model is shown with block elements. At the center edge of the model is a curved meshed wall composed of block-shaped elements. Two double-sided arrows at the top edges point toward surfaces labeled “Friction.” Text arrows along the bottom edges are labeled “Symmetry,” pointing outward from the center. A text arrow along the front edge is labeled “Fixed,” pointing left to right. Near the inner curved edge, a downward-facing shield-shaped element contains the letter “v.” Slightly right of center, a small cube labeled “Friction” sits under an overhang. At the lower center, a stacked schematic of alternating thin layers is displayed. A legend beneath it labels three colors as “Aramid negative 45 degrees,” “Aramid positive 45 degrees,” and “T P U.” On the right side of the figure, a top-view circular clamp arrangement is shown. A circular ring is labeled “Clamp fixture.” At the center of the ring is a circular shape labeled “Impactor.” A diagonal arrow pointing outward from the center to the inner circle edge is labeled “D equals 12.7 millimeters” above “5.47 kilograms.” A horizontal arrow pointing outward to the outer fixture ring is labeled “phi equals 75 millimeters.” A dashed quarter boundary box surrounds the upper left corner of the circular view. Vertical double-headed arrows on the right edge indicate a height dimension labeled “150 millimeters” from top to bottom. Horizontal double-headed arrows at the bottom edge indicate a width dimension labeled “100 millimeters” from left to right.Schematic of the quarter ply-level FE model for TPU composite LVI simulation
The schematic diagram is arranged from left to right. On the left, a three-dimensional quarter model is shown with block elements. At the center edge of the model is a curved meshed wall composed of block-shaped elements. Two double-sided arrows at the top edges point toward surfaces labeled “Friction.” Text arrows along the bottom edges are labeled “Symmetry,” pointing outward from the center. A text arrow along the front edge is labeled “Fixed,” pointing left to right. Near the inner curved edge, a downward-facing shield-shaped element contains the letter “v.” Slightly right of center, a small cube labeled “Friction” sits under an overhang. At the lower center, a stacked schematic of alternating thin layers is displayed. A legend beneath it labels three colors as “Aramid negative 45 degrees,” “Aramid positive 45 degrees,” and “T P U.” On the right side of the figure, a top-view circular clamp arrangement is shown. A circular ring is labeled “Clamp fixture.” At the center of the ring is a circular shape labeled “Impactor.” A diagonal arrow pointing outward from the center to the inner circle edge is labeled “D equals 12.7 millimeters” above “5.47 kilograms.” A horizontal arrow pointing outward to the outer fixture ring is labeled “phi equals 75 millimeters.” A dashed quarter boundary box surrounds the upper left corner of the circular view. Vertical double-headed arrows on the right edge indicate a height dimension labeled “150 millimeters” from top to bottom. Horizontal double-headed arrows at the bottom edge indicate a width dimension labeled “100 millimeters” from left to right.Schematic of the quarter ply-level FE model for TPU composite LVI simulation
Through measurements on the cross-sectional graphs of the laminates, 16 aramid plies were defined, each with a consistent thickness of 0.152 mm. Each pair of adjacent aramid plies represents a single layer of fabric in the laminate specimen, resulting in a total of 8 ply-couples. Within each couple, the fiber orientations were assigned to +45° and −45°, corresponding to the fabric architecture. While the model integrates 7 TPU matrix plies, all laminate plies are modeled with 8-node solid bricks. Aramid plies use the constant-stress under-integrated formulation (ELFORM = 1). To mitigate shear and volumetric locking in the very thin TPU plies, the improved modified-strain solid is adopted (ELFORM = −2). Node merging was employed between adjacent plies to establish interface contact in laminates. As observed in Figure 1(b), the gaps between adjacent aramid layers are minimal in the K8_1PU case due to TPU material infiltration into the fabric during hot-press forming fabrication processing. In contrast, K8_2PU and K8_3PU both exhibit thick and measurable interlayer gap as the matrix content rises. Hence, to this evidence, these inter-ply TPU thicknesses were set as uniform values of 0.001 mm for K8_1PU and 0.060 mm and 0.104 mm for K8_2PU and K8_3PU, respectively. This FE model enables to tune the matrix content by altering TPU ply thicknesses, while the interlaminar failure can be described by the failure of these plies. To replicate the realistic experimental setup, the model also incorporated rigid circular clamps, as shown in Figure 4, and an automatic-surface-to-surface contact with a friction coefficient of 0.3 was defined at the contact interface (Kumar and Bijwe, 2011; Massaq et al., 2016). Hourglass modes can arise in FE models as a consequence of employing under-integrated element formulations. In this study, the Type 5 formulation with coefficient QH = 0.05 has been utilized: the Flanagan–Belytschko stiffness form with exact volume integration formulation. Mesh sensitivity analysis was conducted utilizing different-sized elements, in-depth explanation is not pursued herein. To balance reasonable accuracy and computational efficiency, a uniform mesh size of 1 mm × 1 mm × ply-thickness on composite laminate was adopted for the simulations.
For the material model, MAT 54/55(ENHANCED_COMPOSITE_DAMAGE) model is a recognized widely employed progressive damage material model in LS-DYNA for composite materials, which provides two failure theory options: Chang–Chang failure criteria (MAT 54) and Tsai–Wu failure criteria (MAT 55). In this study, MAT 55, which has been validated in many impact conditions (Sy et al., 2019; Berk et al., 2016), was adopted to describe the behavior of the studied aramid layer. Within the MAT 55 card, the modes of the tensile and compressive fiber are driven as in the Chang-Chang criteria (Chang and Chang, 1987), and the tensile and compressive matrix failures are defined by the Tsai–Wu criteria (Tsai and Wu, 1971). Damage evolution is governed through a set of residual stress limit factors (SLIMT1/2 = 0.1; SLIMC1/2 = 0.9; SLIMS = 0.6), which quantify the residual strength retained by the composite after meeting the failure criteria. As for the TPU inter-plies, MAT 3(PLASTIC_KINEMATIC) model was chosen, which is suitable for capturing the elastic–plastic response of isotropic materials (Hallquist, 2018). In the material model, the elastic and plastic behaviors are defined through Young's modulus and tangent modulus, respectively, while material failure is defined by the effective plastic strain. The input parameters for both the aramid and TPU were derived from standard characterization experiments. A summary of these properties is presented in Table 2.
Material properties of aramid and TPU
| Properties | Symbol | Unit | Value |
|---|---|---|---|
| DuPont Kevlar® XP S307 aramid composite laminate | |||
| Density | 1,410 | ||
| Young's modulus | 21.83 | ||
| Poisson's ratio | 0.33 | ||
| Shear modulus | 256.58 | ||
| Longitudinal tensile strength | 764.20 | ||
| Longitudinal compressive strength | 442.12 | ||
| Transverse tensile strength | 764.20 | ||
| Transverse compressive strength | 442.12 | ||
| MantoFlex TPU Grade 2198A | |||
| Density | 1,230 | ||
| Tensile modulus | E | MPa | 120 |
| Yield stress | MPa | 15 | |
| Ultimate tensile strength | MPa | 98 | |
| Elongation at Failure | FS | % | 400 |
| Properties | Symbol | Unit | Value |
|---|---|---|---|
| DuPont Kevlar® XP S307 aramid composite laminate | |||
| Density | 1,410 | ||
| Young's modulus | 21.83 | ||
| Poisson's ratio | 0.33 | ||
| Shear modulus | 256.58 | ||
| Longitudinal tensile strength | 764.20 | ||
| Longitudinal compressive strength | 442.12 | ||
| Transverse tensile strength | 764.20 | ||
| Transverse compressive strength | 442.12 | ||
| MantoFlex TPU Grade 2198A | |||
| Density | 1,230 | ||
| Tensile modulus | E | MPa | 120 |
| Yield stress | MPa | 15 | |
| Ultimate tensile strength | MPa | 98 | |
| Elongation at Failure | FS | % | 400 |
After comparison and validation of the numerical simulations against experiment results, Livermore Software Optimization (LS-OPT) was employed to automate and streamline the impact simulation process in conjunction with LS-DYNA. LS-OPT is a standalone design-optimization and probabilistic‐analysis software developed by Livermore Software Technology Corporation that interfaces directly with LS-DYNA to facilitate structured workflow, explore design spaces and manage simulation batches (Stander et al., 2020). In this study, LS-OPT was configured to iteratively launch explicit impact simulations across a predetermined range of energy levels, enabling efficient generation of a comprehensive dataset for training the neural networks. A detailed schematic of the LS-OPT workflow adopted in this study is provided in Figure 5. Deterministically and uniformly sampled impact energy levels, ranging from 10 J to 50 J at equal intervals (1.5 J), were employed as input variables (10 J, 20 J and 50 J levels were not included). A total of 78 virtual impact simulation cases, 26 for each laminate case, were conducted to construct the dataset used for subsequent ML.
The flowchart begins at the top with a rectangle labeled “Start,” followed by a downward arrow leading to a rectangle labeled “Setup of sampling, responses, and variables.” A downward arrow leads to an oval labeled “Impact energy E, impactor initial velocity v, and weight m: E equals 1 over 2 m v squared.” A downward arrow leads to a rectangle labeled “Insert velocity value into dyna input file,” followed by another downward arrow leading to a rectangle labeled “Simulation case executing.” A downward arrow leads to a diamond decision box labeled “Sampling complete.” From the right side of the diamond, a rightward arrow labeled “No” loops upward and connects back to the right side of the oval, adjacent to vertical text reading “Derive another energy value.” From the bottom point of the diamond, a downward arrow labeled “Yes” leads to a rectangle labeled “Extract the response metrics from simulation results,” followed by a downward arrow leading to a rectangle labeled “Finish.”LS-OPT work flowchart of automatic virtual LVI testing campaign and database generation for ANN
The flowchart begins at the top with a rectangle labeled “Start,” followed by a downward arrow leading to a rectangle labeled “Setup of sampling, responses, and variables.” A downward arrow leads to an oval labeled “Impact energy E, impactor initial velocity v, and weight m: E equals 1 over 2 m v squared.” A downward arrow leads to a rectangle labeled “Insert velocity value into dyna input file,” followed by another downward arrow leading to a rectangle labeled “Simulation case executing.” A downward arrow leads to a diamond decision box labeled “Sampling complete.” From the right side of the diamond, a rightward arrow labeled “No” loops upward and connects back to the right side of the oval, adjacent to vertical text reading “Derive another energy value.” From the bottom point of the diamond, a downward arrow labeled “Yes” leads to a rectangle labeled “Extract the response metrics from simulation results,” followed by a downward arrow leading to a rectangle labeled “Finish.”LS-OPT work flowchart of automatic virtual LVI testing campaign and database generation for ANN
3.3 Artificial neural network
In this study, ANN was utilized to predict the impact response of the TPU composites. The ANN employed was a feed-forward neural network developed in MATLAB. To train the network, two distinct optimization algorithms were implemented, compared and evaluated: Levenberg–Marquardt (LM) and SCG. These algorithms were selected due to their contrasting strengths: LM is renowned for its fast and accurate convergence, while SCG offers scalable optimization with lower memory demands. The loss function was defined by the mean squared error (MSE), as shown in Equation 1:
where is the expected value, the output value predicted by the network and the number of samples. For both algorithms, model performance was monitored and the weights were updated by minimizing the MSE.
3.3.1 Levenberg–Marquardt algorithm
The LM algorithm addresses non-linear least squares problems by integrating the Gauss–Newton method with gradient descent methods (Gavin, 2016). This hybrid approach enhances convergence stability and accuracy, rendering LM particularly effective for training neural networks requiring high precision. The gradient descent and Gauss–Newton update steps are formulated in Equations 2 and 3, respectively.
where is the Jacobian matrix, is the weight, is the perturbation. A damping parameter () is introduced in Equation 4 to regulate the optimization process and ensure stable convergence. Both in Equation 2 and in Equation 4 play similar roles within the algorithm. The selection of an appropriate is critical, and iterative schemes are typically employed to determine its optimal value. When is small, the algorithm behaves like Gauss–Newton, offering rapid convergence. When is large, it defaults to a more conservative gradient descent approach, ensuring stability near local minima. The LM algorithm exhibits a hybrid structure, which is characterized as follows:
where is the perturbation of the LM. Similarly, in Equation (3), the second-order approximation of Equation (4) is defined by . In cases involving multiple input variables, as employed in this study, it becomes necessary to update the Jacobian matrix at each iteration. Utilizing Broyden's rank-1 update method, the Jacobian renewal can be formulated as follows:
where is an updated Jacobian matrix of the first derivatives. This approximation avoids recomputing from scratch at each iteration, reducing computational overhead without sacrificing significant accuracy. The distinguishing feature of the LM algorithm, in comparison to other optimization methods, is its capability to dynamically update the Jacobian matrix during training. This adaptive mechanism enhances the algorithm's robustness by mitigating convergence difficulties commonly associated with fixed Jacobian approximations, thereby promoting stable and reliable performance across diverse datasets (Transtrum and Sethna, 2012).
3.3.2 Scaled conjugate gradient
The SCG algorithm is a member of the conjugate gradient family of optimization methods, designed to accelerate convergence without requiring expensive line searches at each iteration (Møller, 1993). This design lowers computational complexity, thereby improving efficiency. In the SCG algorithm, the step size is estimated using a quadratic approximation of the error criterion, which reduces reliance on user-defined parameters. These features render SCG a robust optimization method for neural network training, especially in scenarios where accelerated convergence is essential (Babani et al., 2016).
The step size in the SCG algorithm is specified in Equation 6, where denotes the weight vector and represents the global error function. The vectors , , ,.. are non-zero search directions and refers to the quadratic approximation of the error function. The parameters and govern the determination of the step size.
The comparison parameter () serves as a measure of how closely the quadratic approximation aligns with the global error function .
If ≥ 0, the step is accepted, and the algorithm proceeds. If < 0, the approximation is considered poor, and an alternative direction is chosen. This mechanism eliminates the need for repeated line searches while maintaining convergence efficiency (Oliver et al., 2021).
4. Results and discussion
This section first characterizes the impact behaviors of TPU composites through combined experimental-numerical outcomes. Close agreement confirms the fidelity of the numerical approach and the ply-level FE model. The effect of matrix content on the laminate's impact resistance performance has then been explored and discussed. Subsequently, the FE-generated dataset is employed to train the neural network surrogate. By altering algorithms and network architecture, the performance and accuracy of the network has been evaluated to identify the optimal model. After all, the eventual network model has been tested against unseen experiment results, corroborating both the reliability of the proposed hybrid framework and the robustness of the surrogate model.
4.1 Experimental and numerical results
Table 3 compiles the comparison between the simulations and experiments on impact response feature values of peak force, peck displacement and energy absorption, and the comparison on detailed response histories across all impact scenarios is shown in Figure 6, including the force-time and force-displacement plots. From the plots in Figure 6, closed-shape force-displacement curves indicate that all the TPU composite specimens rebounded after impacts of up to 50 J without penetration, confirming the excellent dynamic impact-recovery behavior. Compared with studies on Kevlar-fiber composite with the thermoset matrix, severe fiber breakage and perforation started from 37 J impact on even thicker laminate (Rezasefat et al., 2022b), underscoring the TPU toughening properties on composite impact enhancement and effectiveness in delaying damage initiation. Both the statistics and the curves reveal that increasing the impact energy raises the peak impact force and deflection. While the ascending portions of the force–displacement traces nearly coincide across the energy levels, demonstrating the stability and continuity of the laminate's impact stiffness, together with the evidence that the force–time and force–displacement records are remarkably smooth, exhibiting neither oscillations nor abrupt drops indicative of catastrophic internal failure. Post-impact inspections corroborate the absence of fiber breakage or delamination, suggesting the composite's superior impact resistance performance attributing to the TPU material.
Experimental and numerical comparison on key impact response parameters with deviation
| Impact energy | Peak force [N] | ||||||
|---|---|---|---|---|---|---|---|
| K8_1PU | K8_2PU | K8_3PU | |||||
| Experiment | 10 J | 2346.12 | 6.96% | 2607.82 | 1.92% | 2859.22 | 4.62% |
| Simulation | 2183.85 | 2557.96 | 2727.84 | ||||
| Experiment | 20 J | 3571.24 | 1.40% | 3888.32 | 3.98% | 4067.94 | 1.38% |
| Simulation | 3521.86 | 3733.84 | 4124.53 | ||||
| Experiment | 50 J | 5669.45 | 9.64% | 6445.48 | 9.88% | 7011.71 | 5.83% |
| Simulation | 5122.74 | 5797.92 | 6602.82 | ||||
| Impact energy | Peak force [N] | ||||||
|---|---|---|---|---|---|---|---|
| K8_1PU | K8_2PU | K8_3PU | |||||
| Experiment | 10 J | 2346.12 | 6.96% | 2607.82 | 1.92% | 2859.22 | 4.62% |
| Simulation | 2183.85 | 2557.96 | 2727.84 | ||||
| Experiment | 20 J | 3571.24 | 1.40% | 3888.32 | 3.98% | 4067.94 | 1.38% |
| Simulation | 3521.86 | 3733.84 | 4124.53 | ||||
| Experiment | 50 J | 5669.45 | 9.64% | 6445.48 | 9.88% | 7011.71 | 5.83% |
| Simulation | 5122.74 | 5797.92 | 6602.82 | ||||
| Impact Energy | Peak displacement [mm] | ||||||
|---|---|---|---|---|---|---|---|
| K8_1PU | K8_2PU | K8_3PU | |||||
| Experiment | 10 J | 10.06 | 1.19% | 9.51 | 4.62% | 8.52 | 0.58% |
| Simulation | 9.94 | 9.07 | 8.47 | ||||
| Experiment | 20 J | 13.31 | 1.12% | 13.07 | 0.68% | 11.02 | 0.99% |
| Simulation | 13.16 | 12.98 | 11.13 | ||||
| Experiment | 50 J | 20.95 | 3.44% | 19.47 | 0.67% | 17.12 | 3.71% |
| Simulation | 20.23 | 19.34 | 17.78 | ||||
| Impact Energy | Peak displacement [mm] | ||||||
|---|---|---|---|---|---|---|---|
| K8_1PU | K8_2PU | K8_3PU | |||||
| Experiment | 10 J | 10.06 | 1.19% | 9.51 | 4.62% | 8.52 | 0.58% |
| Simulation | 9.94 | 9.07 | 8.47 | ||||
| Experiment | 20 J | 13.31 | 1.12% | 13.07 | 0.68% | 11.02 | 0.99% |
| Simulation | 13.16 | 12.98 | 11.13 | ||||
| Experiment | 50 J | 20.95 | 3.44% | 19.47 | 0.67% | 17.12 | 3.71% |
| Simulation | 20.23 | 19.34 | 17.78 | ||||
| Impact Energy | Absorbed energy [J] | ||||||
|---|---|---|---|---|---|---|---|
| K8_1PU | K8_2PU | K8_3PU | |||||
| Experiment | 10 J | 4.35 | 5.07% | 3.95 | 5.18% | 3.46 | 5.08% |
| Simulation | 4.86 | 3.43 | 2.95 | ||||
| Experiment | 20 J | 12.70 | 4.85% | 11.96 | 4.91% | 11.16 | 1.50% |
| Simulation | 13.67 | 10.98 | 10.86 | ||||
| Experiment | 50 J | 42.15 | 4.99% | 40.35 | 2.11% | 38.05 | 4.41% |
| Simulation | 44.71 | 39.27 | 35.79 | ||||
| Impact Energy | Absorbed energy [J] | ||||||
|---|---|---|---|---|---|---|---|
| K8_1PU | K8_2PU | K8_3PU | |||||
| Experiment | 10 J | 4.35 | 5.07% | 3.95 | 5.18% | 3.46 | 5.08% |
| Simulation | 4.86 | 3.43 | 2.95 | ||||
| Experiment | 20 J | 12.70 | 4.85% | 11.96 | 4.91% | 11.16 | 1.50% |
| Simulation | 13.67 | 10.98 | 10.86 | ||||
| Experiment | 50 J | 42.15 | 4.99% | 40.35 | 2.11% | 38.05 | 4.41% |
| Simulation | 44.71 | 39.27 | 35.79 | ||||
Six multi-line graphs in three parts are arranged in a 3 by 2 grid. Parts (a) to (c) are labeled “K 8 underscore 1 P U, K 8 underscore 2 P U, and K 8 underscore 3 P U,” respectively. In parts (a) to (c), two graphs are present, one on the left and the other on the right. On the left and right, the vertical axis is labeled “Force (Newtons),” ranging from 0 to 6000 in (a), 0 to 7000 in (b), and 0 to 8000 in (c), in increments of 1000. On the left graph in each part, the horizontal axis is labeled “Displacement (millimeters),” ranging from 0 to 24 with an interval of 4. On the right graph in each part, the horizontal axis is labeled “Time (milliseconds),” ranging from 0 to 20 with an interval of 4. Dotted lines represent “10 Joules,” dashed lines represent “20 Joules,” and long dashed lines represent “50 Joules,” with two different color dots for “experiment” in black dots and “simulation” in red dots. All curves in each graph begin from the origin and increase toward the right and attain a peak and then decrease to the bottom in a concave-down pattern. Part (a): In the left graph, the curves for 10 joules peak just above 2000 at the displacement of 9.5, the curves for 20 joules peak above 3000 at the displacement of 13, and the curves for 50 joules peak above 5000 at the displacement of around 19.5. In the right graph, the curves for 10 joules peak just above 2000 at the time of 8, the curves for 20 joules peak above 3000 at the time of 7, and the curves for 50 joules peak around 5500 at the time of around 6.5. Part (b): In the left graph, the curves for 10 joules peak just above 2000 at the displacement of 8.5, the curves for 20 joules peak above 3500 at the displacement of 13, and the curves for 50 joules peak above 5000 at the displacement of around 19. In the right graph, the curves for 10 joules peak just above 2000 at the time of 7.6, the curves for 20 joules peak above 3500 at the time of 7, and the curves for 50 joules peak around 6000 at the time of around 6. Part (c): In the left graph, the curves for 10 joules peak just above 2500 at the displacement of 8.5, the curves for 20 joules peak above 3600 at the displacement of 11, and the curves for 50 joules peak above 6000 at the displacement of around 17. In the right graph, the curves for 10 joules peak just above 2500 at the time of 7, the curves for 20 joules peak around 4000 at the time of 6.5, and the curves for 50 joules peak around 6500 at the time of around 5.5. Note: All numerical data values are approximated.Comparison of numerical and experimental results at 10 J, 20 J and 50 J: (a) K8_1PU, (b) K8_2PU and (c) K8_3PU
Six multi-line graphs in three parts are arranged in a 3 by 2 grid. Parts (a) to (c) are labeled “K 8 underscore 1 P U, K 8 underscore 2 P U, and K 8 underscore 3 P U,” respectively. In parts (a) to (c), two graphs are present, one on the left and the other on the right. On the left and right, the vertical axis is labeled “Force (Newtons),” ranging from 0 to 6000 in (a), 0 to 7000 in (b), and 0 to 8000 in (c), in increments of 1000. On the left graph in each part, the horizontal axis is labeled “Displacement (millimeters),” ranging from 0 to 24 with an interval of 4. On the right graph in each part, the horizontal axis is labeled “Time (milliseconds),” ranging from 0 to 20 with an interval of 4. Dotted lines represent “10 Joules,” dashed lines represent “20 Joules,” and long dashed lines represent “50 Joules,” with two different color dots for “experiment” in black dots and “simulation” in red dots. All curves in each graph begin from the origin and increase toward the right and attain a peak and then decrease to the bottom in a concave-down pattern. Part (a): In the left graph, the curves for 10 joules peak just above 2000 at the displacement of 9.5, the curves for 20 joules peak above 3000 at the displacement of 13, and the curves for 50 joules peak above 5000 at the displacement of around 19.5. In the right graph, the curves for 10 joules peak just above 2000 at the time of 8, the curves for 20 joules peak above 3000 at the time of 7, and the curves for 50 joules peak around 5500 at the time of around 6.5. Part (b): In the left graph, the curves for 10 joules peak just above 2000 at the displacement of 8.5, the curves for 20 joules peak above 3500 at the displacement of 13, and the curves for 50 joules peak above 5000 at the displacement of around 19. In the right graph, the curves for 10 joules peak just above 2000 at the time of 7.6, the curves for 20 joules peak above 3500 at the time of 7, and the curves for 50 joules peak around 6000 at the time of around 6. Part (c): In the left graph, the curves for 10 joules peak just above 2500 at the displacement of 8.5, the curves for 20 joules peak above 3600 at the displacement of 11, and the curves for 50 joules peak above 6000 at the displacement of around 17. In the right graph, the curves for 10 joules peak just above 2500 at the time of 7, the curves for 20 joules peak around 4000 at the time of 6.5, and the curves for 50 joules peak around 6500 at the time of around 5.5. Note: All numerical data values are approximated.Comparison of numerical and experimental results at 10 J, 20 J and 50 J: (a) K8_1PU, (b) K8_2PU and (c) K8_3PU
As shown in Figure 6, the simulated curves generally exhibit close agreement with the experimental results regarding overall shape, trends and key feature points, thus validating the reliability of the ply-level modeling approach and the FE model. However, it should also be noted that under the high-energy 50 J impact scenario, a noticeable discrepancy emerges at the peak force. Specifically, the experimental force-displacement curves exhibit a pronounced sharp peak at the maximum force, whereas the simulation curves display a smoother transition, consequently underestimating the peak force. Such a discrepancy is not observed in lower-energy impact cases. Experimental observations suggest that during impact loading, the laminate undergoes sliding relative to the clamp fixture, moving towards the center during loading and backward during the rebound phase. This sliding behavior is particularly pronounced in high-energy impact cases. Despite rigorous attempts to secure and tighten the rubber clamps throughout the tests, this sliding could not be entirely eliminated, thereby introducing a limitation on the simulation's ability to precisely replicate significant sliding phenomena.
In Table 3, it also reveals that at the same impact energy level, laminates with higher TPU matrix content exhibit increased peak force, reduced displacement and absorbed energy, suggesting an increase in the rebounding-released energy. Leveraging the elastic properties of TPU, a greater matrix content enables the matrix to sustain a larger proportion of the impact load. During the rebound phase, the accumulated energy is elastically released, thereby decreasing the impact energy transmitted to the aramid fibers. Consequently, higher matrix content effectively reduces final residual deformation and the proportion of energy absorbed. Impact tolerance of composites, characterized by peak force and displacement, is an essential indicator of impact resistance performance. To further elucidate the role of TPU matrix content, Figure 7 presents a line-symbol plot of the impact tolerance point in three configurations across various impact scenarios. It is clearly observed that laminates with increasing TPU matrix content exhibit higher peak forces and less maximum displacement. This trend indicates the enhanced impact stiffness of the laminate, highlighting the robust influence of TPU on the evolution of the laminate's impact tolerance.
The horizontal axis represents “displacement (millimeters),” ranging from 0 to 25 in increments of 5. The vertical axis represents “force (Newtons),” ranging from 0 to 8000 in increments of 1000. The black solid circle represents “Experiment,” and the red square represents “Simulation.” For the first group, 10 joules: Experiment points: K 8 underscore 3 P U (8.1, 2897), K 8 underscore 2 P U (9, 2591), K 8 underscore 1 P U (9.9, 2367). Simulation points: K 8 underscore 3 P U (8.2, 2714), K 8 underscore 2 P U (8.9, 2530.6), and K 8 underscore 1 P U (9.7, 2224). For the second group, 20 joules: Experiment points: K 8 underscore 3 P U (11, 4601.2), K 8 underscore 2 P U (13.1, 3938), K 8 underscore 1 P U (13.3, 3632.6). Simulation points: K 8 underscore 3 P U (11.3, 4163.2), K 8 underscore 2 P U (13, 3816), and K 8 underscore 1 P U (13, 3510). For the third group, 50 joules: Experiment points: K 8 underscore 3 P U (16.7, 7081), K 8 underscore 2 P U (19.1, 6448), K 8 underscore 1 P U (20.5, 5632). Simulation points: K 8 underscore 3 P U (17.3, 6714), K 8 underscore 2 P U (19.1, 5877), and K 8 underscore 1 P U (20.14, 5285). Note: All numerical data values are approximated.The impact tolerance of TPU composites across different energy impacts
The horizontal axis represents “displacement (millimeters),” ranging from 0 to 25 in increments of 5. The vertical axis represents “force (Newtons),” ranging from 0 to 8000 in increments of 1000. The black solid circle represents “Experiment,” and the red square represents “Simulation.” For the first group, 10 joules: Experiment points: K 8 underscore 3 P U (8.1, 2897), K 8 underscore 2 P U (9, 2591), K 8 underscore 1 P U (9.9, 2367). Simulation points: K 8 underscore 3 P U (8.2, 2714), K 8 underscore 2 P U (8.9, 2530.6), and K 8 underscore 1 P U (9.7, 2224). For the second group, 20 joules: Experiment points: K 8 underscore 3 P U (11, 4601.2), K 8 underscore 2 P U (13.1, 3938), K 8 underscore 1 P U (13.3, 3632.6). Simulation points: K 8 underscore 3 P U (11.3, 4163.2), K 8 underscore 2 P U (13, 3816), and K 8 underscore 1 P U (13, 3510). For the third group, 50 joules: Experiment points: K 8 underscore 3 P U (16.7, 7081), K 8 underscore 2 P U (19.1, 6448), K 8 underscore 1 P U (20.5, 5632). Simulation points: K 8 underscore 3 P U (17.3, 6714), K 8 underscore 2 P U (19.1, 5877), and K 8 underscore 1 P U (20.14, 5285). Note: All numerical data values are approximated.The impact tolerance of TPU composites across different energy impacts
This subsection investigates the impact behavior of TPU composites, emphasizing the influence of the TPU matrix, while concurrently validating the reliability of the proposed ply-level numerical model. Drawing on considerations of impact-resistant performance evaluation, three key response metrics, peak impact force, maximum displacement and energy absorption ratio, were identified to characterize the mechanical response of the laminates under impacts. These metrics subsequently define the response space for the further neural network learning.
4.2 Neural network surrogate modeling and prediction performance
Following validation of the FE modeling approach, the developed model was employed in a virtual testing campaign using LS-OPT to generate the dataset for neural network surrogate model training. The neural network inputs include impact energy and the TPU matrix volume fraction. Three critical response metrics derived from simulation outcomes constitute the prediction space of the network, as shown in Figure 8. After the trail-and-error procedure on identifying the initial architecture of networks, a one-hidden-layer fully-connected network was selected as the initial architecture, as depicted in Figure 9. The total data of 78 observations were partitioned into 3 subsets: 69% (54 samples) for training, 21% (16 samples) for validation and the remaining 10% (8 samples) for testing. Prior to network training, normalization of both input and output parameters was essential, which ensures stable network performance, facilitates efficient convergence and maintains the relative relationships among features, thus promoting the reliability and generalizability of the training results. Accordingly, min-max normalization was applied in this study to rescale each feature to the range [0, 1], according to Equation 8:
The horizontal axis at the bottom right is labeled “Force (Newton)” and ranges from 2000 to 7000 in increments of 1000. The horizontal axis at the bottom left is labeled “Displacement (millimeters)” and ranges from 10 to 20 with an interval of 2. The vertical axis on the left is labeled “Energy absorption percentage” and ranges from 30 to 90 in increments of 10. Three sets of data points are shown: Red spheres labeled “K 8 underscore 1 P U,” yellow spheres labeled “K 8 underscore 2 P U,” and teal spheres labeled “K 8 underscore 3 P U.” The data points are scattered from the bottom left side and rise diagonally towards the top right side. The points for “K 8 underscore 3 P U” lie at the bottom, the points for “K 8 underscore 2 P U” lie in the middle, and the points for “K 8 underscore 1 P U” lie at the top. These data points follow an almost similar pattern, starting above 30 percent on the left and ending above 65 percent on the right. Note: All numerical data values are approximated.The prediction response space for neural network learning
The horizontal axis at the bottom right is labeled “Force (Newton)” and ranges from 2000 to 7000 in increments of 1000. The horizontal axis at the bottom left is labeled “Displacement (millimeters)” and ranges from 10 to 20 with an interval of 2. The vertical axis on the left is labeled “Energy absorption percentage” and ranges from 30 to 90 in increments of 10. Three sets of data points are shown: Red spheres labeled “K 8 underscore 1 P U,” yellow spheres labeled “K 8 underscore 2 P U,” and teal spheres labeled “K 8 underscore 3 P U.” The data points are scattered from the bottom left side and rise diagonally towards the top right side. The points for “K 8 underscore 3 P U” lie at the bottom, the points for “K 8 underscore 2 P U” lie in the middle, and the points for “K 8 underscore 1 P U” lie at the top. These data points follow an almost similar pattern, starting above 30 percent on the left and ending above 65 percent on the right. Note: All numerical data values are approximated.The prediction response space for neural network learning
The neural network diagram begins on the left with the “Input layer,” which contains two circles labeled “x subscript 1” and “x subscript 2.” A rectangular label to the left of “x subscript 1” reads “Impact energy,” and a rectangular label to the left of “x subscript 2” reads “Matrix volume fraction.” Multiple straight connecting lines extend from both “x subscript 1” and “x subscript 2” toward a vertical sequence of circles forming the “Hidden layer” at the center, labeled consecutively from top to bottom as “n subscript 1,” “n subscript 2,” “n subscript 3,” followed by a series of dots, then “n subscript i plus 1,” “n subscript i plus 2,” and “n subscript i plus 3.” Each circle in the “Hidden layer” is connected by multiple straight lines to three circles on the right that form the “Output layer,” labeled from top to bottom as “y subscript 1,” “y subscript 2,” and “y subscript 3.” A rectangular label to the right of “y subscript 1” reads “Maximum force,” a rectangular label to the right of “y subscript 2” reads “Maximum displacement,” and a rectangular label to the right of “y subscript 3” reads “Energy absorption percentage.”The utilized neural network architecture
The neural network diagram begins on the left with the “Input layer,” which contains two circles labeled “x subscript 1” and “x subscript 2.” A rectangular label to the left of “x subscript 1” reads “Impact energy,” and a rectangular label to the left of “x subscript 2” reads “Matrix volume fraction.” Multiple straight connecting lines extend from both “x subscript 1” and “x subscript 2” toward a vertical sequence of circles forming the “Hidden layer” at the center, labeled consecutively from top to bottom as “n subscript 1,” “n subscript 2,” “n subscript 3,” followed by a series of dots, then “n subscript i plus 1,” “n subscript i plus 2,” and “n subscript i plus 3.” Each circle in the “Hidden layer” is connected by multiple straight lines to three circles on the right that form the “Output layer,” labeled from top to bottom as “y subscript 1,” “y subscript 2,” and “y subscript 3.” A rectangular label to the right of “y subscript 1” reads “Maximum force,” a rectangular label to the right of “y subscript 2” reads “Maximum displacement,” and a rectangular label to the right of “y subscript 3” reads “Energy absorption percentage.”The utilized neural network architecture
where represents the original value, is the normalized value and and are the minimum and maximum values within the training set.
The size of the hidden layer, characterized by the number of hidden-layer neurons, significantly influences the predictive performance of a neural network model. Determining the optimal number of neurons, however, remains challenging and typically relies on the developer's expertise and trial, as currently there is no definitive theoretical guidance for calculating the accurate size for best performance. An insufficient number of neurons may prevent the neural network from adequately identifying meaningful patterns within complex datasets, resulting in underfitting. Conversely, an excessively large number of neurons can lead to overfitting, wherein the network's extensive processing capacity exceeds the informational content available within the training dataset. In this study, varying numbers of hidden-layer neurons were employed, trained and tested using LM and SCG algorithms. The network performance evaluated through MSE for training and validation subsets is presented in Figure 10. Results indicate that network performance initially improves with increasing hidden-layer size for both algorithms. Specifically, the LM algorithm achieves its lowest validation MSE of 0.0029 with 10 neurons, whereas the SCG algorithm attains its minimum MSE of around 0.0031 with 12 neurons. These values correspond to the optimal network capacities for each respective algorithm. Increasing the hidden-layer size beyond these points did not yield further improvement; instead, the MSE began to increase, signifying the onset of overfitting.
The horizontal axis represents “The number of hidden layer neurons,” ranging from 4 to 16 in increments of 2, and the vertical axis represents “Mean Squared Error (M S E) (10 to the negative 3 power),” ranging from 2 to 9 in increments of 1. The line graph displays four curves. The S C G underscore Training curve (dashed line with square markers) starts at (4, 6.8), passes through (6, 5.0), (10, 3.7), and (12, 3.2), and ends at (16, 3.3). The S C G underscore Validation curve (dashed line with circle markers) starts at (4, 8.2), passes through (6, 4.2), (10, 4.0), and (12, 3.3), and ends at (16, 3.7). The L M underscore training curve (dashed line with triangle markers) starts at (4, 2.8), passes through (6, 2.7), (10, 2.75), and (12, 3.2), and ends at (16, 2.7). The L M underscore Validation curve (dashed line with inverted triangle markers) starts at (4, 4.9), passes through (6, 3.4), (10, 3.0), and (12, 3.3), and ends at (16, 3.2). Note: All numerical data values are approximated.Comparison on MSE of networks with LM and SCG algorithms under different hidden-layer size
The horizontal axis represents “The number of hidden layer neurons,” ranging from 4 to 16 in increments of 2, and the vertical axis represents “Mean Squared Error (M S E) (10 to the negative 3 power),” ranging from 2 to 9 in increments of 1. The line graph displays four curves. The S C G underscore Training curve (dashed line with square markers) starts at (4, 6.8), passes through (6, 5.0), (10, 3.7), and (12, 3.2), and ends at (16, 3.3). The S C G underscore Validation curve (dashed line with circle markers) starts at (4, 8.2), passes through (6, 4.2), (10, 4.0), and (12, 3.3), and ends at (16, 3.7). The L M underscore training curve (dashed line with triangle markers) starts at (4, 2.8), passes through (6, 2.7), (10, 2.75), and (12, 3.2), and ends at (16, 2.7). The L M underscore Validation curve (dashed line with inverted triangle markers) starts at (4, 4.9), passes through (6, 3.4), (10, 3.0), and (12, 3.3), and ends at (16, 3.2). Note: All numerical data values are approximated.Comparison on MSE of networks with LM and SCG algorithms under different hidden-layer size
Figure 11 shows MSE curves over epoch for each algorithm with optimal architecture. Training was terminated based on an early-stopping criterion that monitored validation error, and the network state corresponding to the lowest validation MSE was retained as the best model. For the LM algorithm, the best validation performance was achieved at epoch 27, whereas for the SCG algorithm, the optimal validation performance occurred at epoch 38. The quality of the prediction was further evaluated through regression plots, as illustrated in Figure 12. The neural network trained using LM algorithm demonstrated an exceptional fit, with data points aligning closely to the bisector line in the prediction-expectation plots and achieving consistently high correlation coefficients of approximately 0.986. This suggests that the LM network predictions exhibit nearly perfect correlation with the target responses. Conversely, the SCG network showed a somewhat reduced correlation with target responses, with correlation coefficients ranging from 0.97 to 0.93. Although still respectable, this lower correlation was accompanied by increased scatter in the regression plots, particularly evident within the test dataset, indicating greater deviations between the predicted and actual target values.
Two multi-line graphs are presented side by side. In both graphs, the vertical axis is labeled “Mean Squared Error (M S E),” ranging from 10 to the negative 3 power to 10 to the 1 power in increments of 10 to the 1 power. The horizontal axis is labeled “Epoch,” ranging from 5 to 30 in increments of 5 in graph (a) and ranging from 5 to 40 with an interval of 5 in graph (b). In both graphs, four curves are drawn, representing “Train,” “Validation,” “Test,” and “Best.” On the left, the graph (a) is titled “Best validation performance is 0.0028096 at epoch 27.” The train, validation, and test curves start from around 10 to the 0 power on the vertical axis, and these decrease rapidly and initially overlap and then spread at the bend below the error of 10 to the negative 2 power just before the epoch of 10. After that, these curves remain constant and end on the right side around the error of 3.5 times 10 to the negative 3 power. The best curves are represented by the horizontal and vertical lines intersecting around the error of 2.9 times 10 to the negative 3 power at the epoch of 27. On the right, the graph (b) is labeled “Best validation performance is 0.0031696 at epoch 38.” The train, validation, and test curves start just above 10 to the 0 power on the vertical axis, and these decrease gradually in a concave-up pattern with slight fluctuation. The test curve remains at the top, which ends around 6.3 times 10 to the negative 3 power, while the train and validation curves end around 3.1 times 10 to the negative 3 power. The best curves are represented by the horizontal and vertical lines intersecting around the error of 3.1 times 10 to the negative 3 power at the epoch of 38. Note: All numerical data values are approximated.MSE curves over epoch: (a) LM network with 10-neuron hidden layer and (b) SCG network with 12-neuron hidden layer
Two multi-line graphs are presented side by side. In both graphs, the vertical axis is labeled “Mean Squared Error (M S E),” ranging from 10 to the negative 3 power to 10 to the 1 power in increments of 10 to the 1 power. The horizontal axis is labeled “Epoch,” ranging from 5 to 30 in increments of 5 in graph (a) and ranging from 5 to 40 with an interval of 5 in graph (b). In both graphs, four curves are drawn, representing “Train,” “Validation,” “Test,” and “Best.” On the left, the graph (a) is titled “Best validation performance is 0.0028096 at epoch 27.” The train, validation, and test curves start from around 10 to the 0 power on the vertical axis, and these decrease rapidly and initially overlap and then spread at the bend below the error of 10 to the negative 2 power just before the epoch of 10. After that, these curves remain constant and end on the right side around the error of 3.5 times 10 to the negative 3 power. The best curves are represented by the horizontal and vertical lines intersecting around the error of 2.9 times 10 to the negative 3 power at the epoch of 27. On the right, the graph (b) is labeled “Best validation performance is 0.0031696 at epoch 38.” The train, validation, and test curves start just above 10 to the 0 power on the vertical axis, and these decrease gradually in a concave-up pattern with slight fluctuation. The test curve remains at the top, which ends around 6.3 times 10 to the negative 3 power, while the train and validation curves end around 3.1 times 10 to the negative 3 power. The best curves are represented by the horizontal and vertical lines intersecting around the error of 3.1 times 10 to the negative 3 power at the epoch of 38. Note: All numerical data values are approximated.MSE curves over epoch: (a) LM network with 10-neuron hidden layer and (b) SCG network with 12-neuron hidden layer
Six scatter plots compare “Output” versus “Target” for Training, Validation, and Test in parts (a) and (b) with regression lines and R values. Two parts are labeled (a) and (b). Each set contains three scatter plots. In each graph, three types of data are presented: the open circle represents “Data,” the solid line represents “Fit,” and the dotted line represents “Y equals T.” The horizontal axis is labeled “Target” and ranges from 0 to 1 in increments of 0.2 in the left and right graphs in part (a) and in the left and middle graphs in part (b), while it ranges from 0.2 to 1 with an interval of 0.2 in the middle graph of part (a) and ranges from 0.4 to 1 with an interval of 0.2 in the right graph in part (b). Part (a): In the left graph, the vertical axis is labeled “Output approximately equals 0.97 times Target plus 0.018,” ranging from 0 to 1 with an interval of 0.2. The blue solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top. The label at the bottom reads, “Training: R equals 0.98665.” In the middle graph, the vertical axis is labeled “Output approximately equals 1 times Target plus 0.0081,” ranging from 0.2 to 1 with an interval of 0.1. The green solid line extends diagonally from the bottom left to the top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top and mostly clustered from the middle to the top side. The density of the data points is less than the left graph. The label at the bottom reads, “Validation: R equals 0.98626.” In the right graph, the vertical axis is labeled “Output approximately equals 1 times Target plus 0.018,” ranging from 0 to 1 with an interval of 0.2. The red solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top and mostly clustered at the bottom and top ends. The density of the data points is less than the middle graph. The label at the bottom reads, “Test: R equals 0.98578.” Part (b): In the left graph, the vertical axis is labeled “Output approximately equals 0.99 times Target plus 0.0088,” ranging from 0 to 1 with an interval of 0.2. The blue solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top. The label at the bottom reads, “Training: R equals 0.97147.” In the middle graph, the vertical axis is labeled “Output approximately equals 0.93 times Target plus 0.046,” ranging from 0 to 1 with an interval of 0.2. The green solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top and mostly clustered near the middle side. The density of the data points is less than the left graph. The label at the bottom reads, “Validation: R equals 0.95275.” In the right graph, the vertical axis is labeled “Output approximately equals 0.85 times Target plus 0.093,” ranging from 0.3 to 1 with an interval of 0.1. The red solid line extends diagonally from the bottom left to top right, with the dotted line slightly deviating from the solid line, and the circular data points are scattered along this diagonal from bottom to top. The density of the data points is less than the middle graph. The label at the bottom reads, “Test: R equals 0.93744.” Note: All numerical data values are approximated.Regression plots: Comparison of the target and prediction: (a) LM network model and (b) SCG network model
Six scatter plots compare “Output” versus “Target” for Training, Validation, and Test in parts (a) and (b) with regression lines and R values. Two parts are labeled (a) and (b). Each set contains three scatter plots. In each graph, three types of data are presented: the open circle represents “Data,” the solid line represents “Fit,” and the dotted line represents “Y equals T.” The horizontal axis is labeled “Target” and ranges from 0 to 1 in increments of 0.2 in the left and right graphs in part (a) and in the left and middle graphs in part (b), while it ranges from 0.2 to 1 with an interval of 0.2 in the middle graph of part (a) and ranges from 0.4 to 1 with an interval of 0.2 in the right graph in part (b). Part (a): In the left graph, the vertical axis is labeled “Output approximately equals 0.97 times Target plus 0.018,” ranging from 0 to 1 with an interval of 0.2. The blue solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top. The label at the bottom reads, “Training: R equals 0.98665.” In the middle graph, the vertical axis is labeled “Output approximately equals 1 times Target plus 0.0081,” ranging from 0.2 to 1 with an interval of 0.1. The green solid line extends diagonally from the bottom left to the top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top and mostly clustered from the middle to the top side. The density of the data points is less than the left graph. The label at the bottom reads, “Validation: R equals 0.98626.” In the right graph, the vertical axis is labeled “Output approximately equals 1 times Target plus 0.018,” ranging from 0 to 1 with an interval of 0.2. The red solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top and mostly clustered at the bottom and top ends. The density of the data points is less than the middle graph. The label at the bottom reads, “Test: R equals 0.98578.” Part (b): In the left graph, the vertical axis is labeled “Output approximately equals 0.99 times Target plus 0.0088,” ranging from 0 to 1 with an interval of 0.2. The blue solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top. The label at the bottom reads, “Training: R equals 0.97147.” In the middle graph, the vertical axis is labeled “Output approximately equals 0.93 times Target plus 0.046,” ranging from 0 to 1 with an interval of 0.2. The green solid line extends diagonally from the bottom left to top right along with the dotted line, and the circular data points are scattered along this diagonal from bottom to top and mostly clustered near the middle side. The density of the data points is less than the left graph. The label at the bottom reads, “Validation: R equals 0.95275.” In the right graph, the vertical axis is labeled “Output approximately equals 0.85 times Target plus 0.093,” ranging from 0.3 to 1 with an interval of 0.1. The red solid line extends diagonally from the bottom left to top right, with the dotted line slightly deviating from the solid line, and the circular data points are scattered along this diagonal from bottom to top. The density of the data points is less than the middle graph. The label at the bottom reads, “Test: R equals 0.93744.” Note: All numerical data values are approximated.Regression plots: Comparison of the target and prediction: (a) LM network model and (b) SCG network model
Based on the preceding analysis, the trained LM network with a 10-neuron hidden layer is identified to be the best-performing predictive surrogate model. At last, as shown in Figure 13, this FE-guided surrogate prediction results have been compared against unseen experimental results regarding the impact response metrices. The model's predictions nicely aligned with the experimental observations, yielding a correlation coefficient of 0.96. This strong agreement underscores the robustness and accuracy of the proposed surrogate model in capturing and predicting the impact responses of TPU composites with varying matrix content.
The horizontal axis at the bottom right is labeled “Displacement (millimeters)” and ranges from 10 to 20 in increments of 2. The horizontal axis at the bottom left is labeled “Force (Newton)” and ranges from 2000 to 7000 in increments of 1000. The vertical axis on the right is labeled “Energy absorption percentage” and ranges from 20 to 90 in increments of 10. Two sets of data points are shown: Black dots labeled “Experiment” and blue dots labeled “Neural network.” The data points are scattered across the plot with multiple sets of both dots. Multiple sets of data points are present at the bottom plane between the displacement of 8 and 14 and between the force of 2000 and 4500. Some of the data points are also positioned at the bottom plane between the displacement of 17 and 21 and between the force of 5500 and 7000. Multiple sets of data points are positioned on the vertical plane at the back side between the energy absorption of 30 and 70 percent and between the force of 2000 and 4500. Note: All numerical data values are approximated.Comparison on FE-guided LM neural network prediction against experimental results
The horizontal axis at the bottom right is labeled “Displacement (millimeters)” and ranges from 10 to 20 in increments of 2. The horizontal axis at the bottom left is labeled “Force (Newton)” and ranges from 2000 to 7000 in increments of 1000. The vertical axis on the right is labeled “Energy absorption percentage” and ranges from 20 to 90 in increments of 10. Two sets of data points are shown: Black dots labeled “Experiment” and blue dots labeled “Neural network.” The data points are scattered across the plot with multiple sets of both dots. Multiple sets of data points are present at the bottom plane between the displacement of 8 and 14 and between the force of 2000 and 4500. Some of the data points are also positioned at the bottom plane between the displacement of 17 and 21 and between the force of 5500 and 7000. Multiple sets of data points are positioned on the vertical plane at the back side between the energy absorption of 30 and 70 percent and between the force of 2000 and 4500. Note: All numerical data values are approximated.Comparison on FE-guided LM neural network prediction against experimental results
5. Conclusion
This study proposes and validates a comprehensive hybrid framework that combines experimental testing, FE modeling and ANNs to investigate and effectively predict the low-velocity impact responses of Kevlar fiber thermoplastic composites incorporating varying TPU matrix content. Key findings include:
According to the mechanical behaviors under impacts, the TPU composites exhibit exceptional toughening and superior impact resistance, while increasing TPU matrix content can effectively enhance composite impact stiffness and modulate impact tolerance.
The developed macro-homogeneous ply-level FE model can accurately capture and replicate the impact behavior and response of TPU composites, serving as a high-fidelity virtual testing tool to simulate impact cases with physical accuracy.
The FE-guided neural network using the LM algorithm demonstrates excellent predictive accuracy and rapid convergence, facilitating a robust and efficient predictive surrogate model for composite instant impact resistance evaluation and design guidance.
This study also presents a robust strategy for exploring, investigating, and predicting the impact performance of advanced composite materials. The numerical results from validated numerical models with detailed experimental data can be utilized as a dataset for ML, thereby enabling rapid prediction of complex impact behaviors.

