Effective Utilisation of LLCs by Managing Associativity, Placement and Mapping

dc.contributor.authorDas, Shirshendu
dc.date.accessioned2016-06-23T11:20:35Z
dc.date.accessioned2023-10-20T04:36:44Z
dc.date.available2016-06-23T11:20:35Z
dc.date.available2023-10-20T04:36:44Z
dc.date.issued2016
dc.descriptionSupervisor: Hemangee K Kapooren_US
dc.description.abstractTiled based CMP (TCMP) has become the essential next generation scalable multicore architecture. The cores in TCMP commonly share a large sized Last Level Cache (LLC). NUCA is used in LLC to divide it into multiple banks such that each bank can be accessed independently. Static NUCA (SNUCA) has a fixed address mapping policy whereas dynamic NUCA (DNUCA) allows blocks to relocate nearer to the processing cores at runtime. It has been observed that the LLC of the current TCMP architectures is not utilised properly. Better cache utilisation will reduce the number of misses in the cache and hence can improve performance. The utilisation issue of LLC can be divided into two categories: (a) local utilisation and (b) global utilisation. The memory accesses within a bank are not distributed uniformly among the sets. Some sets are used heavily while some others remain idle. Such utilisation issue is termed as the local utilisation issue. It has also been observed that the banks of the LLC are not carrying equal loads during the execution. Some banks are loaded heavily while some other banks remain almost unused. Better load distribution among the banks may improve the utilisation factor of the cache. Such inter-bank utilisation issue is termed as the global utilisation issue. In this work we propose architectures to increase both the local and global utilisations of LLC for TCMP. Our first three proposals are for improving the local utilisation. We do this by allowing the heavily used set to use the idle ways of lightly used sets. Hence the associativity of each bank is managed dynamically. The three architectures we propose have different performance benefits and hardware requirements. To improve global utilisation we propose two DNUCA based TCMP architectures capable of distributing loads among multiple banks. Experimental evaluation using full-system simulations has validated our claim of performance enhancement. The improvements in local utilisation give better performance in the range of 6.3-13.5% and those in global utilisation give 6.1-13% performance enhancement.en_US
dc.identifier.otherROLL NO. 10610112
dc.identifier.urihttps://gyan.iitg.ac.in/handle/123456789/714
dc.language.isoenen_US
dc.relation.ispartofseriesTH-1467;
dc.subjectCOMPUTER SCIENCE AND ENGINEERINGen_US
dc.titleEffective Utilisation of LLCs by Managing Associativity, Placement and Mappingen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Abstract-1467_10610112.pdf
Size:
106.73 KB
Format:
Adobe Portable Document Format
Description:
Abstract
No Thumbnail Available
Name:
TH-1467_10610112.pdf
Size:
15.62 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: