Publications
Filters: First Letter Of Last Name is H [Clear All Filters]
"Plasticine: A Reconfigurable Architecture For Parallel Patterns",
ISCA '17: 44th International Symposium on Computer Architecture, Toronto, Canada, 06/2017.
Abstract
Download: paper (1.53 MB)
"TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory",
The 22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Xi'an, China, 04/2017.
Download: paper (1.93 MB); slides (1.06 MB)
"Convolution Engine: Balancing Efficiency &\#38; Flexibility in Specialized Computing",
Proceedings of the 40th Annual International Symposium on Computer Architecture, New York, NY, USA, ACM, pp. 24–35, 2013.
Download: paper (4.78 MB)
"Locality-aware Task Management for Unstructured Parallelism: A Quantitative Limit Study",
Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures, New York, NY, USA, ACM, pp. 315–325, 2013.
Download: paper (1.52 MB)
"Measuring and Analyzing the Energy Use of Enterprise Computing Systems",
Sustainable Computing: Informatics and Systems (SUSCOM), 2013.
"A case of system-level hardware/software co-design and co-verification of a commodity multi-processor system with custom hardware.",
CODES+ISSS: ACM, pp. 513-520, 2012.
Download: paper (402.69 KB)
"Green enterprise computing data: Assumptions and realities.",
IGCC: IEEE Computer Society, pp. 1-10, 2012.
Download: paper (331.4 KB)
"Towards Energy-proportional Datacenter Memory with Mobile DRAM",
Proceedings of the 39th Annual International Symposium on Computer Architecture, Washington, DC, USA, IEEE Computer Society, pp. 37–48, 2012.
Download: paper (5.08 MB)
"Hardware acceleration of transactional memory on commodity systems.",
ASPLOS: ACM, pp. 27-38, 2011.
Download: paper (1.22 MB)
"Understanding Sources of Ineffciency in General-purpose Chips",
Commun. ACM, vol. 54, no. 10, New York, NY, USA, ACM, pp. 85–93, 2011.
Download: paper (2.83 MB)
"EigenBench: A Simple Exploration Tool for Orthogonal TM Characteristics",
IEEE Intl. Symposium on Workload Characterization (IISWC), Atlanta, GA, 12/2010.
Download: paper (914.55 KB)
"FARM: A Prototyping Environment for Tightly-Coupled, Heterogeneous Architectures.",
FCCM: IEEE Computer Society, pp. 221-228, 2010.
Download: paper (1.05 MB)
"Implementing and evaluating nested parallel transactions in software transactional memory.",
SPAA: ACM, pp. 253-262, 2010.
Download: paper (449.46 KB)
"Understanding Sources of Inefficiency in General-purpose Chips",
Proceedings of the 37th Annual International Symposium on Computer Architecture, New York, NY, USA, ACM, pp. 37–47, 2010.
Download: paper (455.9 KB)
"The Stream Virtual Machine",
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 267–277, 9/2004.
"Transactional Memory Coherence and Consistency",
Proceedings of the 31st Annual International Symposium on Computer Architecture (ISCA), Munich, Germany, pp. 102–, 6/2004.