SplitMS: Split Modulo-Scheduling for Accelerating Loops Onto CGRAs - Equipe Hardware ARchitectures and CAD tools
Communication Dans Un Congrès Année : 2024

SplitMS: Split Modulo-Scheduling for Accelerating Loops Onto CGRAs

Résumé

Coarse-Grained Reconfigurable Array (CGRA) architectures are popular for accelerating loop kernels due to a good balance between energy efficiency and flexibility. Modulo scheduling (MS) is the preferred solution for efficiently mapping loops onto CGRAs. Existing CGRA MS algorithms suffer from low resource utilization if the number of operation nodes in the Data Flow Graph (DFG) is less than the number of Processing Elements (PEs) in the CGRA. To improve instruction level parallelism (ILP), the common approaches unroll the loop before applying MS. However, finding valid MS solutions for larger DFGs becomes difficult for CGRAs with resource constraints. This paper proposes a novel Split Modulo-Scheduling (SplitMS) technique to improve the ILP by segmenting the target CGRA into clusters and mapping loop chunks. We also present a lightweight hardware approach to support the cluster execution. Experiments show that SplitMS for a 4×[2×2] CGRA cluster achieves an average speedup of 2.8× over MS for a 4×4 target CGRA with 8 Load-Store Units (LSUs). SplitMS increases an average of 2.9× the PE utilization and 3× the energy efficiency over the conventional MS approach.
Fichier principal
Vignette du fichier
DSD24_splitMS_authorVersionHAL.pdf (1.32 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)
licence

Dates et versions

hal-04777325 , version 1 (12-11-2024)

Licence

Identifiants

Citer

Christie Sajitha Sajan, Kevin J M Martin, Satyajit Das, Philippe Coussy. SplitMS: Split Modulo-Scheduling for Accelerating Loops Onto CGRAs. 2024 27th Euromicro Conference on Digital System Design (DSD), Aug 2024, Paris, France. pp.242 - 249, ⟨10.1109/dsd64264.2024.00040⟩. ⟨hal-04777325⟩
4 Consultations
4 Téléchargements

Altmetric

Partager

More