-
Large language models, physics-based modeling, experimental measurements: the trinity of data-scarce learning of polymer properties
Authors:
Ning Liu,
Siavash Jafarzadeh,
Brian Y. Lattimer,
Shuna Ni,
Jim Lua,
Yue Yu
Abstract:
Large language models (LLMs) bear promise as a fast and accurate material modeling paradigm for evaluation, analysis, and design. Their vast number of trainable parameters necessitates a wealth of data to achieve accuracy and mitigate overfitting. However, experimental measurements are often limited and costly to obtain in sufficient quantities for finetuning. To this end, we present a physics-bas…
▽ More
Large language models (LLMs) bear promise as a fast and accurate material modeling paradigm for evaluation, analysis, and design. Their vast number of trainable parameters necessitates a wealth of data to achieve accuracy and mitigate overfitting. However, experimental measurements are often limited and costly to obtain in sufficient quantities for finetuning. To this end, we present a physics-based training pipeline that tackles the pathology of data scarcity. The core enabler is a physics-based modeling framework that generates a multitude of synthetic data to align the LLM to a physically consistent initial state before finetuning. Our framework features a two-phase training strategy: (1) utilizing the large-in-amount while less accurate synthetic data for supervised pretraining, and (2) finetuning the phase-1 model with limited experimental data. We empirically demonstrate that supervised pretraining is vital to obtaining accurate finetuned LLMs, via the lens of learning polymer flammability metrics where cone calorimeter data is sparse.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
An Ultra-high-speed Reproducing Kernel Particle Method
Authors:
Siavash Jafarzadeh,
Michael Hillman
Abstract:
In this work, the fast-convolving reproducing kernel particle method (FC-RKPM) is introduced. This method is hundreds to millions of times faster than the traditional RKPM for 3D meshfree simulations. In this approach, the meshfree discretizations with RK approximation are expressed in terms of convolution sums. Fast Fourier transform (FFT) is then used to efficiently compute the convolutions. Cer…
▽ More
In this work, the fast-convolving reproducing kernel particle method (FC-RKPM) is introduced. This method is hundreds to millions of times faster than the traditional RKPM for 3D meshfree simulations. In this approach, the meshfree discretizations with RK approximation are expressed in terms of convolution sums. Fast Fourier transform (FFT) is then used to efficiently compute the convolutions. Certain modifications to the domain and shape functions are considered to maintain generality for complex geometries and arbitrary boundary conditions. The new method does not need to identify, store, and loop over the neighbors which is one of the bottleneck of the traditional meshfree methods. As a result, the run-times and memory allocations are independent of the number of neighbors and the shape function support size. As a model problem, the method is laid out for a Galerkin weak form of the Poisson problem with the RK approximation, and is verified in 1D, 2D, and 3D. Tables with run-times and allocated memory are presented to compare the performance of FC-RKPM with the traditional method in 3D. The performance is studied for various node numbers, support size, and approximation degree. All the implementation details and the roadmap for software development are also provided. Application of the new method to nonlinear and explicit problems are briefly discussed as well.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Heterogeneous Peridynamic Neural Operators: Discover Biotissue Constitutive Law and Microstructure From Digital Image Correlation Measurements
Authors:
Siavash Jafarzadeh,
Stewart Silling,
Lu Zhang,
Colton Ross,
Chung-Hao Lee,
S. M. Rakibur Rahman,
Shuodao Wang,
Yue Yu
Abstract:
Human tissues are highly organized structures with specific collagen fiber arrangements varying from point to point. The effects of such heterogeneity play an important role for tissue function, and hence it is of critical to discover and understand the distribution of such fiber orientations from experimental measurements, such as the digital image correlation data. To this end, we introduce the…
▽ More
Human tissues are highly organized structures with specific collagen fiber arrangements varying from point to point. The effects of such heterogeneity play an important role for tissue function, and hence it is of critical to discover and understand the distribution of such fiber orientations from experimental measurements, such as the digital image correlation data. To this end, we introduce the heterogeneous peridynamic neural operator (HeteroPNO) approach, for data-driven constitutive modeling of heterogeneous anisotropic materials. The goal is to learn both a nonlocal constitutive law together with the material microstructure, in the form of a heterogeneous fiber orientation field, from loading field-displacement field measurements. To this end, we propose a two-phase learning approach. Firstly, we learn a homogeneous constitutive law in the form of a neural network-based kernel function and a nonlocal bond force, to capture complex homogeneous material responses from data. Then, in the second phase we reinitialize the learnt bond force and the kernel function, and training them together with a fiber orientation field for each material point. Owing to the state-based peridynamic skeleton, our HeteroPNO-learned material models are objective and have the balance of linear and angular momentum guaranteed. Moreover, the effects from heterogeneity and nonlinear constitutive relationship are captured by the kernel function and the bond force respectively, enabling physical interpretability. As a result, our HeteroPNO architecture can learn a constitutive model for a biological tissue with anisotropic heterogeneous response undergoing large deformation regime. Moreover, the framework is capable to provide displacement and stress field predictions for new and unseen loading instances.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Peridynamic Neural Operators: A Data-Driven Nonlocal Constitutive Model for Complex Material Responses
Authors:
Siavash Jafarzadeh,
Stewart Silling,
Ning Liu,
Zhongqiang Zhang,
Yue Yu
Abstract:
Neural operators, which can act as implicit solution operators of hidden governing equations, have recently become popular tools for learning the responses of complex real-world physical systems. Nevertheless, most neural operator applications have thus far been data-driven and neglect the intrinsic preservation of fundamental physical laws in data. In this work, we introduce a novel integral neur…
▽ More
Neural operators, which can act as implicit solution operators of hidden governing equations, have recently become popular tools for learning the responses of complex real-world physical systems. Nevertheless, most neural operator applications have thus far been data-driven and neglect the intrinsic preservation of fundamental physical laws in data. In this work, we introduce a novel integral neural operator architecture called the Peridynamic Neural Operator (PNO) that learns a nonlocal constitutive law from data. This neural operator provides a forward model in the form of state-based peridynamics, with objectivity and momentum balance laws automatically guaranteed. As applications, we demonstrate the expressivity and efficacy of our model in learning complex material behaviors from both synthetic and experimental data sets. We show that, owing to its ability to capture complex responses, our learned neural operator achieves improved accuracy and efficiency compared to baseline models that use predefined constitutive laws. Moreover, by preserving the essential physical laws within the neural network architecture, the PNO is robust in treating noisy data. The method shows generalizability to different domain configurations, external loadings, and discretizations.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Domain Agnostic Fourier Neural Operators
Authors:
Ning Liu,
Siavash Jafarzadeh,
Yue Yu
Abstract:
Fourier neural operators (FNOs) can learn highly nonlinear mappings between function spaces, and have recently become a popular tool for learning responses of complex physical systems. However, to achieve good accuracy and efficiency, FNOs rely on the Fast Fourier transform (FFT), which is restricted to modeling problems on rectangular domains. To lift such a restriction and permit FFT on irregula…
▽ More
Fourier neural operators (FNOs) can learn highly nonlinear mappings between function spaces, and have recently become a popular tool for learning responses of complex physical systems. However, to achieve good accuracy and efficiency, FNOs rely on the Fast Fourier transform (FFT), which is restricted to modeling problems on rectangular domains. To lift such a restriction and permit FFT on irregular geometries as well as topology changes, we introduce domain agnostic Fourier neural operator (DAFNO), a novel neural operator architecture for learning surrogates with irregular geometries and evolving domains. The key idea is to incorporate a smoothed characteristic function in the integral layer architecture of FNOs, and leverage FFT to achieve rapid computations, in such a way that the geometric information is explicitly encoded in the architecture. In our empirical evaluation, DAFNO has achieved state-of-the-art accuracy as compared to baseline neural operator models on two benchmark datasets of material modeling and airfoil simulation. To further demonstrate the capability and generalizability of DAFNO in handling complex domains with topology changes, we consider a brittle material fracture evolution problem. With only one training crack simulation sample, DAFNO has achieved generalizability to unseen loading scenarios and substantially different crack patterns from the trained scenario. Our code and data accompanying this paper are available at https://github.com/ningliu-iga/DAFNO.
△ Less
Submitted 28 October, 2023; v1 submitted 30 April, 2023;
originally announced May 2023.
-
A general and fast convolution-based method for peridynamics: applications to elasticity and brittle fracture
Authors:
Siavash Jafarzadeh,
Farzaneh Mousavi,
Adam Larios,
Florin Bobaru
Abstract:
We introduce a general and fast convolution-based method (FCBM) for peridynamics (PD). Expressing the PD integrals in terms of convolutions and computing them by fast Fourier transform (FFT), we reduce the computational complexity of PD models from O(N^2) to O(Nlog_2 N), with N being the total number of discretization nodes. Initial neighbor identification and storing neighbor information is not r…
▽ More
We introduce a general and fast convolution-based method (FCBM) for peridynamics (PD). Expressing the PD integrals in terms of convolutions and computing them by fast Fourier transform (FFT), we reduce the computational complexity of PD models from O(N^2) to O(Nlog_2 N), with N being the total number of discretization nodes. Initial neighbor identification and storing neighbor information is not required, and, as a consequence, memory allocation scales with O(N) instead of O(N^2), common for existing methods. The method is applicable to bounded domains with arbitrary shapes and boundary conditions via an embedded constraint (EC) approach. We explain the FCBM-EC formulation for certain bond-based and state-based, linear and nonlinear PD models of elasticity and dynamic brittle fracture, as applications. We solve a 3D elastostatic problem and show that the FCBM reduces the computational time from days to hours and from years to days, compared with the original meshfree discretization for PD models. Large-scale computations of PD models are feasible with the new method, and we demonstrate its versatility by simulating, with ease, the difficult problem of multiple crack branching in a brittle plate.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Efficient solutions for nonlocal diffusion problems via boundary-adapted spectral methods
Authors:
Siavash Jafarzadeh,
Adam Larios,
Florin Bobaru
Abstract:
We introduce an efficient boundary-adapted spectral method for peridynamic diffusion problems with arbitrary boundary conditions. The spectral approach transforms the convolution integral in the peridynamic formulation into a multiplication in the Fourier space, resulting in computations that scale as O(NlogN). The limitation of regular spectral methods to periodic problems is eliminated using the…
▽ More
We introduce an efficient boundary-adapted spectral method for peridynamic diffusion problems with arbitrary boundary conditions. The spectral approach transforms the convolution integral in the peridynamic formulation into a multiplication in the Fourier space, resulting in computations that scale as O(NlogN). The limitation of regular spectral methods to periodic problems is eliminated using the volume penalization method. We show that arbitrary boundary conditions or volume constraints can be enforced in this way to achieve high levels of accuracy. To test the performance of our approach we compare the computational results with analytical solutions of the nonlocal problem. The performance is tested with convergence studies in terms of nodal discretization and the size of the penalization parameter in problems with Dirichlet and Neumann boundary conditions.
△ Less
Submitted 9 May, 2019;
originally announced May 2019.
-
Spatio-Temporal Modeling of Users' Check-ins in Location-Based Social Networks
Authors:
Ali Zarezade,
Sina Jafarzadeh,
Hamid R. Rabiee
Abstract:
Social networks are getting closer to our real physical world. People share the exact location and time of their check-ins and are influenced by their friends. Modeling the spatio-temporal behavior of users in social networks is of great importance for predicting the future behavior of users, controlling the users' movements, and finding the latent influence network. It is observed that users have…
▽ More
Social networks are getting closer to our real physical world. People share the exact location and time of their check-ins and are influenced by their friends. Modeling the spatio-temporal behavior of users in social networks is of great importance for predicting the future behavior of users, controlling the users' movements, and finding the latent influence network. It is observed that users have periodic patterns in their movements. Also, they are influenced by the locations that their close friends recently visited. Leveraging these two observations, we propose a probabilistic model based on a doubly stochastic point process with a periodic decaying kernel for the time of check-ins and a time-varying multinomial distribution for the location of check-ins of users in the location-based social networks. We learn the model parameters using an efficient EM algorithm, which distributes over the users. Experiments on synthetic and real data gathered from Foursquare show that the proposed inference algorithm learns the parameters efficiently and our model outperforms the other alternatives in the prediction of time and location of check-ins.
△ Less
Submitted 10 April, 2017; v1 submitted 23 November, 2016;
originally announced November 2016.
-
Kissing Cuisines: Exploring Worldwide Culinary Habits on the Web
Authors:
Sina Sajadmanesh,
Sina Jafarzadeh,
Seyed Ali Osia,
Hamid R. Rabiee,
Hamed Haddadi,
Yelena Mejova,
Mirco Musolesi,
Emiliano De Cristofaro,
Gianluca Stringhini
Abstract:
Food and nutrition occupy an increasingly prevalent space on the web, and dishes and recipes shared online provide an invaluable mirror into culinary cultures and attitudes around the world. More specifically, ingredients, flavors, and nutrition information become strong signals of the taste preferences of individuals and civilizations. However, there is little understanding of these palate variet…
▽ More
Food and nutrition occupy an increasingly prevalent space on the web, and dishes and recipes shared online provide an invaluable mirror into culinary cultures and attitudes around the world. More specifically, ingredients, flavors, and nutrition information become strong signals of the taste preferences of individuals and civilizations. However, there is little understanding of these palate varieties. In this paper, we present a large-scale study of recipes published on the web and their content, aiming to understand cuisines and culinary habits around the world. Using a database of more than 157K recipes from over 200 different cuisines, we analyze ingredients, flavors, and nutritional values which distinguish dishes from different regions, and use this knowledge to assess the predictability of recipes from different cuisines. We then use country health statistics to understand the relation between these factors and health indicators of different nations, such as obesity, diabetes, migration, and health expenditure. Our results confirm the strong effects of geographical and cultural similarities on recipes, health indicators, and culinary preferences across the globe.
△ Less
Submitted 25 April, 2017; v1 submitted 26 October, 2016;
originally announced October 2016.