Network-on-chip based manycore systems
As deep sub-micron technologies advance, architectures of microprocessors have evolved from traditional monolithic ones into parallel ones which consist of large number of small but energy efficient cores that rely on Network-on-Chip (NoC) communication infrastructure to achieve scalability. This th...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/46287 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | As deep sub-micron technologies advance, architectures of microprocessors have evolved from traditional monolithic ones into parallel ones which consist of large number of small but energy efficient cores that rely on Network-on-Chip (NoC) communication infrastructure to achieve scalability. This thesis presents the design and development of a comprehensive simulation framework for an NoC based system simulator covering all system levels from low-level hardware modules to high-level software applications. The motivation stems from the fact that there exists no publicly available simulator that supports embedded NoC based manycore systems. A cycle-level simulation framework has been proposed using the UNISIM environment to complement the configurable on-chip network built upon efficient pipelined routers that employ wormhole switching and virtual-channel flow control. In order to accelerate the cycle-level micro-architectural simulations for manycore systems, novel techniques have been devised to accelerate the cycle-level simulations on multi-core platforms. In particular, we have exploited the fine-grained parallelism within each simulated cycle using Pthreads, leading to notable speedups. The proposed multithreaded simulation engine exploits inherent parallelism with the help of an adaptive technique for managing the computation workload and relies on a graph partitioning based technique for automating load balancing among workloads of multithreaded executions. Our investigations show that, by adaptive distribution of modules among multiple CPU cores of the simulation platform at runtime, the proposed techniques can provide for up to 6X speed up using an 8-core computer. Existing allocation strategies do not lend well NoC based many-core systems when their core counts increase. In this thesis, a runtime resource management strategy has been proposed to overcome these limitations thereby making it suitable for NoC based manycore systems that must support a large number of cores. The proposed resource management strategy relies on submesh based scheme for organizing resources at runtime allocation/deallocation process. It employs a hierarchical-based resource allocation and deallocation scheme to speed-up and to save energy. |
---|