We gratefully acknowledge support from
the Simons Foundation and member institutions.

Gengyuan Zhang is qualified to endorse.

Localizing Events in Videos with Multimodal Queries

Gengyuan Zhang: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CV, cs.IR, cs.LG. (why?)

Mang Ling Ada Fok, Yan Xia, Yansong Tang, Daniel Cremers, Philip Torr, Volker Tresp and Jindong Gu are not registered as owners of this paper. (why?)