A SuperLearner Approach to Predict Run-In Selection in Clinical Trials

Lanera, Corrado; Berchialla, Paola; Lorenzoni, Giulia; Acar, ASLIHAN; Chiminazzo, Valentina; Azzolina, Danila; Gregori, Dario; Baldi, Ileana

doi:10.1155/2022/4306413

A SuperLearner Approach to Predict Run-In Selection in Clinical Trials

Lanera C., Berchialla P., Lorenzoni G., Acar A. S., Chiminazzo V., Azzolina D., ...Daha Fazla

COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, cilt.2022, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 2022
Basım Tarihi: 2022
Doi Numarası: 10.1155/2022/4306413
Dergi Adı: COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Compendex, EMBASE, MEDLINE, zbMATH
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Hacettepe Üniversitesi Adresli: Evet

Özet

A critical early step in a clinical trial is defining the study sample that appropriately represents the target population from which the sample will be drawn. Envisaging a "run-in" process in study design may accomplish this task; however, the traditional run-in requires additional patients, increasing times, and costs. The possible use of the available a-priori data could skip the run-in period. In this regard, ML (machine learning) techniques, which have recently shown considerable promising usage in clinical research, can be used to construct individual predictions of therapy response probability conditional on patient characteristics. An ensemble model of ML techniques was trained and validated on twin randomized clinical trials to mimic a run-in process within this framework. An ensemble ML model composed of 26 algorithms was trained on the twin clinical trials. SuperLearner (SL) performance for the Verum (Treatment) arm is above 70% sensitivity. The Positive Predictive Value (PPP) achieves a value of 80%. Results show good performance in the direction of being useful in the simulation of the run-in period; the trials conducted in similar settings can train an optimal patient selection algorithm minimizing the run-in time and costs of conduction.