N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm

Oudjida, A.K.; Titr, S.; Hamarlain, M.

doi:10.1108/03321640210423298

Article navigation

Research Article| September 01 2002

N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm

A.K. Oudjida;

A.K. Oudjida

CDTA, El‐Madania, Algiers

Search for other works by this author on:

This Site

PubMed

Google Scholar

S. Titr;

S. Titr

CDTA, El‐Madania, Algiers

Search for other works by this author on:

This Site

PubMed

Google Scholar

M. Hamarlain

CDTA, El‐Madania, Algiers

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Publisher: Emerald Publishing

Online ISSN: 2054-5606

Print ISSN: 0332-1649

2002

COMPEL (2002) 21 (3): 377–392.

https://doi.org/10.1108/03321640210423298

The emergence of the systolic paradigm in 1978 inspired the first 2D‐array parallelization of the sequential matrix multiplication algorithm. Since then, and due to its attractive and appealing features, systolic approach has been gaining great momentum to the point where all 2D‐array parallelization attempts were exclusively systolic. As good result, latency has been successively reduced a number of times (5N, 3N, 2N, 3N/2), where N is the matrix size. But as latency was getting lower, further irregularities were introduced into the array, making the implementation severely compromised either at VLSI level or at system level. The best illustrative case of such irregularities are the two designs proposed by Tsay and Chang in 1995 and considered as the fastest designs (3N/2) that have been developed so far. The purpose of this paper is twofold: we first demonstrate that N+√N/2 is the minimal latency that can be achieved using the systolic approach. Afterwards, we introduce a full‐parallel 2D‐array algorithm with N latency and 2N I/O‐bandwidth. This novel algorithm is not only the fastest algorithm, but is also the most regular one too. A 3D parallel version with O(log N) latency is also presented.

2002

You do not currently have access to this content.

Don't already have an account? Register

N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm

Email Alerts

Cited By

N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm Available to Purchase

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

N latency 2N I/O‐bandwidth 2D‐array matrix multiplication algorithm