32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
Nowadays mobile devices represent a significant portion of the market for embedded systems, and are continuously demanded in daily life. From the end-user perspective size, weight, features are the key quality criteria. These benchmarks criteria became the usual design constraints in the embedded...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Published: |
Foundation of Computer Science
2012
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/30383/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.30383 |
---|---|
record_format |
eprints |
spelling |
my.utm.303832019-07-23T09:01:24Z http://eprints.utm.my/id/eprint/30383/ 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors Mohamed, Izzeldin Ibrahim Chin, Fatt Chay Marsono, Muhammad Nadzir TK Electrical engineering. Electronics Nuclear engineering Nowadays mobile devices represent a significant portion of the market for embedded systems, and are continuously demanded in daily life. From the end-user perspective size, weight, features are the key quality criteria. These benchmarks criteria became the usual design constraints in the embedded systems design process and put a high impact on the power consumption. This paper survey and explore different low power design techniques for FPGA and processors. We compare, evaluate, and analyze, the power and energy consumption in three different designs namely, Altera FPGA Cyclone II which has a systolic array matrix multiplication implemented, i5 Clarkdale, and Atom Pineview-D Intel general purpose processors, which multiply two nxn 32-bit matrices and produce a 64-bit matrix as an output. We concluded that FPGA is a more power and energy efficient on low matrix size. However, general purpose processor performance is close to FPGA on larger matrix size as the larger cache size in general purpose processor help in reducing latency. We also concluded that the performance of FPGA can be improved in terms of latency if more systolic array processing elements are implemented in parallel to allow more concurrency. Foundation of Computer Science 2012-08 Article PeerReviewed Mohamed, Izzeldin Ibrahim and Chin, Fatt Chay and Marsono, Muhammad Nadzir (2012) 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors. International Journal of Computer Applications, 52 (11). pp. 17-23. ISSN 0975-8887 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
TK Electrical engineering. Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering. Electronics Nuclear engineering Mohamed, Izzeldin Ibrahim Chin, Fatt Chay Marsono, Muhammad Nadzir 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors |
description |
Nowadays mobile devices represent a significant portion of the market for embedded systems, and are continuously demanded in daily life. From the end-user perspective size, weight, features are the key quality criteria. These benchmarks criteria became the usual design constraints in the embedded systems design process and put a high impact on the power consumption. This paper survey and explore different low power design techniques for FPGA and processors. We compare, evaluate, and analyze, the power and energy consumption in three different designs namely, Altera FPGA Cyclone II which has a systolic array matrix multiplication implemented, i5 Clarkdale, and Atom Pineview-D Intel general purpose processors, which multiply two nxn 32-bit matrices and produce a 64-bit matrix as an output. We concluded that FPGA is a more power and energy efficient on low matrix size. However, general purpose processor performance is close to FPGA on larger matrix size as the larger cache size in general purpose processor help in reducing latency. We also concluded that the performance of FPGA can be improved in terms of latency if more systolic array processing elements are implemented in parallel to allow more concurrency. |
format |
Article |
author |
Mohamed, Izzeldin Ibrahim Chin, Fatt Chay Marsono, Muhammad Nadzir |
author_facet |
Mohamed, Izzeldin Ibrahim Chin, Fatt Chay Marsono, Muhammad Nadzir |
author_sort |
Mohamed, Izzeldin Ibrahim |
title |
32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors |
title_short |
32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors |
title_full |
32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors |
title_fullStr |
32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors |
title_full_unstemmed |
32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors |
title_sort |
32-bit nxn matrix multiplication: performance evaluation for altera fpga, i5 clarkdale, and atom pineview-d intel general purpose processors |
publisher |
Foundation of Computer Science |
publishDate |
2012 |
url |
http://eprints.utm.my/id/eprint/30383/ |
_version_ |
1643648539803254784 |