Fusing Census and MC-CNN Cost Volumes for Stereo Matching

KIVANÇ, ŞAHIM; ŞEN, BAHA; OK, ALİ; NAR, FATİH

doi:10.22364/bjmc.2022.10.2.11

Fusing Census and MC-CNN Cost Volumes for Stereo Matching

Atıf İçin Kopyala

KIVANÇ Ş. G., ŞEN B., OK A. Ö., NAR F.

BALTIC JOURNAL OF MODERN COMPUTING, cilt.10, sa.2, ss.251-265, 2022 (ESCI)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 10 Sayı: 2
Basım Tarihi: 2022
Doi Numarası: 10.22364/bjmc.2022.10.2.11
Dergi Adı: BALTIC JOURNAL OF MODERN COMPUTING
Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus
Sayfa Sayıları: ss.251-265
Anahtar Kelimeler: Stereo Matching, SGM, MGM, Census transform, MC-CNN, Deep Learning
Hacettepe Üniversitesi Adresli: Evet

Özet

Stereo matching is an important and popular field of computer vision. Numerous researchers worldwide are devoted to enhancing the effectiveness of stereo matching applications. In stereo matching, determining the costs of matching is a critical step. This step generates a cost volume that quantifies the similarity of pixels, and thereafter, it is processed further to generate the final disparity map. The purpose of this study is to improve stereo matching performance by fusing two different cost-volumes, namely Census and MC-CNN. The Census transform and Hamming distance are one of the most frequently used cost functions in conventional approaches. Besides, a matching cost volume generated using a deep learning technique called MC-CNN has been shown to extract more reliable features from images than conventional approaches. Thus, both of these cost computation strategies have a number of advantages and disadvantages. By including deep learning as a cost-volume, the advantages of these two distinct cost-volumes complement one another, resulting in a better cost-volume prior to applying the smoothing operation (e.g. Semi-global matching or More-global Matching). Our findings indicate that fusing costvolumes improves the stereo matching performance of nearly all of the benchmark stereo images we tested in the Middlebury dataset.