Systematic literature review on software quality for AI-based software

GEZİCİ, BAHAR; KOLUKISA TARHAN, AYÇA

doi:10.1007/s10664-021-10105-2

Systematic literature review on software quality for AI-based software

GEZİCİ B., KOLUKISA TARHAN A.

EMPIRICAL SOFTWARE ENGINEERING, cilt.27, sa.3, 2022 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Derleme
Cilt numarası: 27 Sayı: 3
Basım Tarihi: 2022
Doi Numarası: 10.1007/s10664-021-10105-2
Dergi Adı: EMPIRICAL SOFTWARE ENGINEERING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC, Metadex, Civil Engineering Abstracts
Anahtar Kelimeler: Artificial intelligence, Machine learning, Software quality, Quality attributes, Quality metrics, Measurement, Product quality model, ARTIFICIAL-INTELLIGENCE, MANAGEMENT
Hacettepe Üniversitesi Adresli: Evet

Özet

There is a widespread demand for Artificial Intelligence (AI) software, specifically Machine Learning (ML). It is getting increasingly popular and being adopted in various applications we use daily. AI-based software quality is different from traditional software quality because it generally addresses distinct and more complex kinds of problems. With the fast advance of AI technologies and related techniques, how to build high-quality AI-based software becomes a very prominent subject. This paper aims at investigating the state of the art on software quality (SQ) for AI-based systems and identifying quality attributes, applied models, challenges, and practices that are reported in the literature. We carried out a systematic literature review (SLR) from 1988 to 2020 to (i) analyze and understand related primary studies and (ii) synthesize limitations and open challenges to drive future research. Our study provides a road map for researchers to understand quality challenges, attributes, and practices in the context of software quality for AI-based software better. From the empirical evidence that we have gathered by this SLR, we suggest future work on this topic be structured under three categories which are Definition/Specification, Design/Evaluation, and Process/Socio-technical.