Hybrid Actuarial Pricing Models for Automobile Insurance: A Comparative Analysis of GLM, XGBoost, CANN and DRN


Araçlı O., Erdemir Ö. G.

VII. International Applied Statistics Congress (UYIK-2026) , İstanbul, Türkiye, 11 - 13 Mayıs 2026, ss.1, (Özet Bildiri)

  • Yayın Türü: Bildiri / Özet Bildiri
  • Basıldığı Şehir: İstanbul
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.1
  • Hacettepe Üniversitesi Adresli: Evet

Özet

One of the primary objectives of the insurance industry is to accurately estimate premiums based on policyholders’ risk profiles. Generalized Linear Models (GLMs) have long been the standard approach in actuarial pricing due to their interpretability and solid statistical foundation. However, they may exhibit limitations in capturing complex nonlinear relationships and interactions present in large-scale insurance datasets. In this study, several advanced machine learning and deep learning models are evaluated as potential alternatives to GLMs using the French Motor Third Party Liability (MTPL) dataset. The models considered include Gradient Boosting Machines (XGBoost), Combined Actuarial Neural Networks (CANN), and Distributional Refinement Network (DRN). To ensure robust model evaluation and mitigate overfitting, a 5-fold cross-validation strategy is employed. In addition, hyperparameter tuning is conducted using the Optuna optimization framework to obtain model-specific optimal configurations. The premium estimation process is decomposed into claim frequency and claim severity components. For frequency modeling, the CANN model achieves the lowest deviance and highest Gini coefficient, indicating superior predictive performance. For severity modeling, the DRN model outperforms other approaches, particularly in capturing heavy-tailed distributions, which are critical in insurance risk modeling. The findings suggest that hybrid approaches combining statistical rigor with neural network flexibility can provide a competitive and robust alternative to traditional GLM-based pricing models.