Spoken language biomarkers in Turkish-speaking schizophrenia patients: Evidence from linguistic analysis and word embeddings


Çınar Bozdağ M., Kumcu A., Şenel L. K., Temizkan H. N., Özil Ö., Arslanyürek İ., ...More

PSYCHIATRY RESEARCH, vol.362, pp.117215, 2026 (SCI-Expanded, SSCI, Scopus)

  • Publication Type: Article / Article
  • Volume: 362
  • Publication Date: 2026
  • Doi Number: 10.1016/j.psychres.2026.117215
  • Journal Name: PSYCHIATRY RESEARCH
  • Journal Indexes: Scopus, Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), BIOSIS, EMBASE, MEDLINE, Psycinfo
  • Page Numbers: pp.117215
  • Hacettepe University Affiliated: Yes

Abstract

Background and Hypothesis

Schizophrenia (SZ) disrupts language in ways that may be universal across languages. This study investigated whether linguistic anomalies previously observed in SZ also occur in Turkish, a morphologically rich and agglutinative language. We hypothesised that SZ patients would differ from healthy controls (HCs) across multiple linguistic domains, including features typically sensitive to cross-linguistic variation.

Methods

Speech characteristics of 50 native Turkish-speaking SZ patients were compared with 50 HCs matched for age, sex, length of education, and handedness. Speech data were collected in 15-minute interviews. The interview recordings were transcribed and analysed for various lexical, syntactic, and phonological measures using CLAN, and compared for discourse measures using fastText word embedding models.

Results

The number of words produced per minute, mean length of utterance, average word frequency, the number of filled pauses, discourse coherence, and question-response similarity were lower in the patient group than in the control group. The content word-function word ratio, sentence prediction loss, type-token ratio, number of silent pauses, and silent pauses-to-total speech ratio were higher in the patient group than in the control group. Specific clinical and sociodemographic variables were identified as predictors of speech abnormalities in patients.

Conclusion

The hypothesis was confirmed. Turkish-speaking SZ patients displayed speech patterns similar to those reported in other language groups, including language-sensitive variables. This supports the idea of universal linguistic disruptions in SZ. The findings are particularly valuable given the scarcity of research on Turkish, a low-resource and typologically distinct language.