SSketch: An Automated Framework for Streaming Sketch-based Analysis of Big Data on FPGA

TitleSSketch: An Automated Framework for Streaming Sketch-based Analysis of Big Data on FPGA
Publication TypeConference Paper
Year of Publication2015
AuthorsRouhani, B., E. Songhori, A. Mirhoseini, and F. Koushanfar
Conference Name23rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)
Date PublishedMay/2015
KeywordsStreaming model; Big data; Dense matrix; FPGA; Low-rank matrix; HW/SW co-design; Matrix sketching

This paper proposes SSketch, a novel automated computing framework for FPGA-based online analysis of big data with dense (non-sparse) correlation matrices. SSketch targets streaming applications where each data sample can be processed only once and storage is severely limited. The stream of input data is used by SSketch for adaptive learning and updating a corresponding ensemble of lower dimensional data structures, a.k.a., a sketch matrix. A new sketching methodology is introduced that tailors the problem of transforming the big data with dense correlations to an ensemble of lower dimensional subspaces such that it is suitable for hardware-based acceleration performed by reconfigurable hardware. The new method is scalable, while it significantly reduces costly memory interactions and enhances matrix computation performance by leveraging coarse-grained parallelism existing in the dataset. To facilitate automation, SSketch takes advantage of a HW/SW co-design approach: It provides an Application Programming Interface (API) that can be customized for rapid prototyping of an arbitrary matrixbased data analysis algorithm. Proof-of-concept evaluations on a variety of visual datasets with more than 11 million nonzeros demonstrates up to 200 folds speedup on our hardwareaccelerated realization of SSketch compared to a software-based deployment on a general purpose processor.

SSketch.pdf670.5 KB


Theme by Danetsoft and Danang Probo Sayekti inspired by Maksimer