Skip to main content

Showing 1–1 of 1 results for author: Maggiotti, G

  1. arXiv:2312.06881  [pdf, other

    cs.LG cs.CL

    DYAD: A Descriptive Yet Abjuring Density efficient approximation to linear neural network layers

    Authors: Sarin Chandy, Varun Gangal, Yi Yang, Gabriel Maggiotti

    Abstract: We devise, implement and performance-asses DYAD, a layer which can serve as a faster and more memory-efficient approximate replacement for linear layers, (nn.Linear() in Pytorch). These layers appear in common subcomponents, such as in the ff module of Transformers. DYAD is based on a bespoke near-sparse matrix structure which approximates the dense "weight" matrix W that matrix-multiplies the inp… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted at WANT workshop at NeurIPS 2023; code at https://github.com/asappresearch/dyad