A parallel implementation of Strassen's matrix multiplication algorithm for wormhole-routed all-port 2D torus networks


Baransel C., İMRE K. M.

JOURNAL OF SUPERCOMPUTING, cilt.62, sa.1, ss.486-509, 2012 (SCI İndekslerine Giren Dergi) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 62 Konu: 1
  • Basım Tarihi: 2012
  • Doi Numarası: 10.1007/s11227-011-0730-1
  • Dergi Adı: JOURNAL OF SUPERCOMPUTING
  • Sayfa Sayıları: ss.486-509

Özet

A new parallel implementation of Strassen's matrix multiplication algorithm is proposed for massively parallel supercomputers with 2D, all-port torus interconnection networks. The proposed algorithm employs a special conflict-free routing pattern for better scalability and is able to yield a performance rate very close to the theoretical bound for many practical network and matrix sizes. It effectively scales up to very large networks typically containing hundreds-of-thousands processors where petaflop or exaflop processing rates are sought.