BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:1.0 BEGIN:VEVENT DTSTART:20131121T174500Z DTEND:20131121T180000Z LOCATION:601/603 DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: We show the design of specialized compute fabrics that maintain the efficiency of full custom hardware while providing flexibility to execute a whole class of coarse- grain linear algebra and FFT operations. =0AWe have designed a specialized linear algebra processor (LAP) that can perform level-3 BLAS, more complex LAPACK level operations like Cholesky, LU (with partial pivoting), and QR factorizations, and even FFT operations.=0A We present a power performance model that compares state of the art CPUs and GPUs with our design. Our power model reveals sources of inefficiencies in CPUs and GPUs, and our LAP design demonstrates how to overcome them. When compared to other conventional architectures for linear algebra applications, LAP is over orders of magnitude more power efficient. Based on our estimations up to 25 GFLOPS/W and double-precision efficiency is achievable on a single chip in standard 45nm technology. SUMMARY:Algorithm/Architecture Codesign of Low Power and High Performance Linear Algebra Compute Fabrics PRIORITY:3 END:VEVENT END:VCALENDAR