Skip to main content

Showing 1–16 of 16 results for author: Ebert, J

  1. arXiv:2406.08767  [pdf, ps, other

    cs.IT

    Coding for the unsourced B-channel with erasures: enhancing the linked loop code

    Authors: William W. Zheng, Jamison R. Ebert, Stefano Rini, Jean-Francois Chamberland

    Abstract: In [1], the linked loop code (LLC) is presented as a promising code for the unsourced A-channel with erasures (UACE). The UACE is an unsourced multiple access channel in which active users' transmitted symbols are erased with a given probability and the channel output is obtained as the union of the non-erased symbols. In this paper, we extend the UACE channel model to the unsourced B-channel with… ▽ More

    Submitted 20 May, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, accepted by ICASSP 2024

  2. arXiv:2402.13703  [pdf, other

    cs.CL

    Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?

    Authors: Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali

    Abstract: The adaption of multilingual pre-trained Large Language Models (LLMs) into eloquent and helpful assistants is essential to facilitate their use across different language regions. In that spirit, we are the first to conduct an extensive study of the performance of multilingual models on parallel, multi-turn instruction-tuning benchmarks across a selection of the most-spoken Indo-European languages.… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 22 pages, 7 figures

  3. arXiv:2402.06881  [pdf, ps, other

    cs.IT eess.SP

    Multi-User SR-LDPC Codes via Coded Demixing with Applications to Cell-Free Systems

    Authors: Jamison R. Ebert, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: Novel sparse regression LDPC (SR-LDPC) codes exhibit excellent performance over additive white Gaussian noise (AWGN) channels in part due to their natural provision of shaping gains. Though SR-LDPC-like codes have been considered within the context of single-user error correction and massive random access, they are yet to be examined as candidates for coordinated multi-user communication scenarios… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Submitted to ISIT 2024

  4. arXiv:2312.02160  [pdf, ps, other

    cs.IT cs.MM

    Coding for the unsourced A-channel with erasures: the linked loop code

    Authors: William W. Zheng, Jamison R. Ebert, Stefano Rini, Jean-Francois Chamberland

    Abstract: The A-channel is a noiseless multiple access channel in which users simultaneously transmit Q-ary symbols and the receiver observes the set of transmitted symbols, but not their multiplicities. An A-channel is said to be unsourced if, additionally, users transmissions are encoded across time using a common codebook and decoding of the transmitted messages is done without regard to the identities o… ▽ More

    Submitted 19 September, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures, to be published in the 31st European Signal Processing Conference, EUSIPCO 2023

  5. arXiv:2311.07720  [pdf, ps, other

    cs.IT

    Sparse Regression LDPC Codes

    Authors: Jamison R. Ebert, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: This article introduces a novel concatenated coding scheme called sparse regression LDPC (SR-LDPC) codes. An SR-LDPC code consists of an outer non-binary LDPC code and an inner sparse regression code (SPARC) whose respective field size and section sizes are equal. For such codes, an efficient decoding algorithm is proposed based on approximate message passing (AMP) that dynamically shares soft inf… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: substantial text overlap with arXiv:2301.01899

  6. arXiv:2310.08754  [pdf, other

    cs.LG

    Tokenizer Choice For LLM Training: Negligible or Crucial?

    Authors: Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr

    Abstract: The recent success of Large Language Models (LLMs) has been predominantly driven by curating the training dataset composition, scaling of model architectures and dataset sizes and advancements in pretraining objectives, leaving tokenizer influence as a blind spot. Shedding light on this underexplored area, we conduct a comprehensive study on the influence of tokenizer choice on LLM downstream perf… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  7. arXiv:2308.12312  [pdf, other

    physics.comp-ph cs.AI physics.plasm-ph

    Physics informed Neural Networks applied to the description of wave-particle resonance in kinetic simulations of fusion plasmas

    Authors: Jai Kumar, David Zarzoso, Virginie Grandgirard, Jan Ebert, Stefan Kesselheim

    Abstract: The Vlasov-Poisson system is employed in its reduced form version (1D1V) as a test bed for the applicability of Physics Informed Neural Network (PINN) to the wave-particle resonance. Two examples are explored: the Landau damping and the bump-on-tail instability. PINN is first tested as a compression method for the solution of the Vlasov-Poisson system and compared to the standard neural networks.… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  8. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  9. arXiv:2301.01899  [pdf, ps, other

    cs.IT

    On Sparse Regression LDPC Codes

    Authors: Jamison R. Ebert, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: Belief propagation applied to iterative decoding and sparse recovery through approximate message passing (AMP) are two research areas that have seen monumental progress in recent decades. Inspired by these advances, this article introduces sparse regression LDPC codes and their decoding. Sparse regression codes (SPARCs) are a class of error correcting codes that build on ideas from compressed sens… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  10. arXiv:2209.05466  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Hearts Gym: Learning Reinforcement Learning as a Team Event

    Authors: Jan Ebert, Danimir T. Doncevic, Ramona Kloß, Stefan Kesselheim

    Abstract: Amidst the COVID-19 pandemic, the authors of this paper organized a Reinforcement Learning (RL) course for a graduate school in the field of data science. We describe the strategy and materials for creating an exciting learning experience despite the ubiquitous Zoom fatigue and evaluate the course qualitatively. The key organizational features are a focus on a competitive hands-on setting in teams… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  11. arXiv:2206.01684  [pdf, ps, other

    cs.IT

    HashBeam: Enabling Feedback Through Downlink Beamforming in Unsourced Random Access

    Authors: Jamison R. Ebert, Krishna R. Narayanan, Jean-Francois Chamberland

    Abstract: Unsourced random access (URA) has emerged as a candidate paradigm for massive machine-type communication (MTC) in next-generation wireless networks. While many excellent uplink schemes have been developed for URA, these schemes do not specify a mechanism for providing feedback regarding whether a user's message was successfully decoded. While this may be acceptable in some MTC scenarios, the lack… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  12. Coded Demixing for Unsourced Random Access

    Authors: Jamison R. Ebert, Vamsi K. Amalladinne, Stefano Rini, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: Unsourced random access (URA) is a recently proposed multiple access paradigm tailored to the uplink channel of machine-type communication networks. By exploiting a strong connection between URA and compressed sensing, the massive multiple access problem may be cast as a compressed sensing (CS) problem, albeit one in exceedingly large dimensions. To efficiently handle the dimensionality of the pro… ▽ More

    Submitted 27 June, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 1053-587X Copyright 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information

    Journal ref: IEEE Transactions on Signal Processing, vol. 70, pp. 2972-2984, 2022

  13. arXiv:2112.00270  [pdf, ps, other

    cs.IT

    An Enhanced Decoding Algorithm for Coded Compressed Sensing with Applications to Unsourced Random Access

    Authors: Vamsi K. Amalladinne, Jamison R. Ebert, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: Unsourced random access (URA) has emerged as a pragmatic framework for next-generation distributed sensor networks. Within URA, concatenated coding structures are often employed to ensure that the central base station can accurately recover the set of sent codewords during a given transmission period. Many URA algorithms employ independent inner and outer decoders, which can help reduce computatio… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: Submitted to MDPI Sensors

  14. arXiv:2108.11976  [pdf, other

    cs.DC cs.LG

    JUWELS Booster -- A Supercomputer for Large-Scale AI Research

    Authors: Stefan Kesselheim, Andreas Herten, Kai Krajsek, Jan Ebert, Jenia Jitsev, Mehdi Cherti, Michael Langguth, Bing Gong, Scarlet Stadtler, Amirpasha Mozaffari, Gabriele Cavallaro, Rocco Sedona, Alexander Schug, Alexandre Strube, Roshni Kamath, Martin G. Schultz, Morris Riedel, Thomas Lippert

    Abstract: In this article, we present JUWELS Booster, a recently commissioned high-performance computing system at the Jülich Supercomputing Center. With its system architecture, most importantly its large number of powerful Graphics Processing Units (GPUs) and its fast interconnect via InfiniBand, it is an ideal machine for large-scale Artificial Intelligence (AI) research and applications. We detail its s… ▽ More

    Submitted 30 June, 2021; originally announced August 2021.

    Comments: 12 pages, 5 figures. Accepted at ISC 2021, Workshop Deep Learning on Supercomputers. This is a duplicate submission as my previous submission is on hold for several weeks now and my attempts to contact the moderators failed

    Report number: 1234567Dummy

  15. arXiv:2104.05686  [pdf, ps, other

    cs.IT

    Stochastic Binning and Coded Demixing for Unsourced Random Access

    Authors: Jamison R. Ebert, Vamsi K. Amalladinne, Stefano Rini, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: Unsourced random access is a novel communication paradigm designed for handling a large number of uncoordinated users that sporadically transmit very short messages. Under this model, coded compressed sensing (CCS) has emerged as a low-complexity scheme that exhibits good error performance. Yet, one of the challenges faced by CCS pertains to disentangling a large number of codewords present on a s… ▽ More

    Submitted 21 July, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Submitted to IEEE-SPAWC 2021

  16. arXiv:2010.11380  [pdf, ps, other

    cs.IT

    A Hybrid Approach to Coded Compressed Sensing where Coupling Takes Place via the Outer Code

    Authors: Jamison R. Ebert, Vamsi K. Amalladinne, Jean-Francois Chamberland, Krishna R. Narayanan

    Abstract: This article seeks to advance coded compressed sensing (CCS) as a practical scheme for unsourced random access. The original CCS algorithm features a concatenated structure where an inner code is tasked with support recovery, and an outer tree code conducts message disambiguation. Recently, a link between CCS and sparse regression codes was established, leading to the application of approximate me… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.