We gratefully acknowledge support from
the Simons Foundation and member institutions.

Luiza Pozzobon, Marzieh Fadaee and Sara Hooker are qualified to endorse.

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

Max Marion: Is registered as an author of this paper.
Not currently an endorser. (why?)
Luiza Pozzobon: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.LG. (why?)
Marzieh Fadaee: Is registered as an author of this paper.
Can endorse for cs.CL. (why?)
Sara Hooker: Is registered as an author of this paper.
Can endorse for cs.AI, cs.LG. (why?)

Ahmet Üstün and Alex Wang are not registered as owners of this paper. (why?)