Skip to main content

Showing 1–8 of 8 results for author: Khosravi, K

  1. arXiv:2307.11655  [pdf, other

    cs.LG cs.AI cs.GT

    Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms

    Authors: Khashayar Khosravi, Renato Paes Leme, Chara Podimata, Apostolis Tsorvantzis

    Abstract: We propose a model for learning with bandit feedback while accounting for deterministically evolving and unobservable states that we call Bandits with Deterministically Evolving States ($B$-$DES$). The workhorse applications of our model are learning for recommendation systems and learning for online ads. In both cases, the reward that the algorithm obtains at each round is a function of the short… ▽ More

    Submitted 19 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

  2. arXiv:2102.13028  [pdf, other

    cs.LG stat.ML

    Batched Neural Bandits

    Authors: Quanquan Gu, Amin Karbasi, Khashayar Khosravi, Vahab Mirrokni, Dongruo Zhou

    Abstract: In many sequential decision-making problems, the individuals are split into several batches and the decision-maker is only allowed to change her policy at the end of batches. These batch problems have a large number of applications, ranging from clinical trials to crowdsourcing. Motivated by this, we study the stochastic contextual bandit problem for general reward distributions under the batched… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 21 pages, 7 figures

  3. arXiv:2002.10121  [pdf, other

    cs.LG stat.ML

    The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

    Authors: Mohsen Bayati, Nima Hamidi, Ramesh Johari, Khashayar Khosravi

    Abstract: We investigate a Bayesian $k$-armed bandit problem in the \emph{many-armed} regime, where $k \geq \sqrt{T}$ and $T$ represents the time horizon. Initially, and aligned with recent literature on many-armed bandit problems, we observe that subsampling plays a key role in designing optimal algorithms; the conventional UCB algorithm is sub-optimal, whereas a subsampled UCB (SS-UCB), which selects… ▽ More

    Submitted 20 March, 2024; v1 submitted 24 February, 2020; originally announced February 2020.

  4. arXiv:2001.01558  [pdf

    physics.flu-dyn cs.LG stat.ML

    Shear Stress Distribution Prediction in Symmetric Compound Channels Using Data Mining and Machine Learning Models

    Authors: Zohreh Sheikh Khozani, Khabat Khosravi, Mohammadamin Torabi, Amir Mosavi, Bahram Rezaei, Timon Rabczuk

    Abstract: Shear stress distribution prediction in open channels is of utmost importance in hydraulic structural engineering as it directly affects the design of stable channels. In this study, at first, a series of experimental tests were conducted to assess the shear stress distribution in prismatic compound channels. The shear stress values around the whole wetted perimeter were measured in the compound c… ▽ More

    Submitted 20 December, 2019; originally announced January 2020.

    Comments: 29 pages, 6 figures

    MSC Class: 68T05

  5. arXiv:1901.03719  [pdf, other

    cs.LG econ.EM math.ST stat.ML

    Non-Parametric Inference Adaptive to Intrinsic Dimension

    Authors: Khashayar Khosravi, Greg Lewis, Vasilis Syrgkanis

    Abstract: We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension $D$ of the conditioning variable is larger than the sample size $n$, estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension $d$, as measured by locally low doubling measures. Our estimation is… ▽ More

    Submitted 17 June, 2019; v1 submitted 11 January, 2019; originally announced January 2019.

  6. arXiv:1704.09011  [pdf, other

    stat.ML cs.LG

    Mostly Exploration-Free Algorithms for Contextual Bandits

    Authors: Hamsa Bastani, Mohsen Bayati, Khashayar Khosravi

    Abstract: The contextual bandit literature has traditionally focused on algorithms that address the exploration-exploitation tradeoff. In particular, greedy algorithms that exploit current estimates without any exploration may be sub-optimal in general. However, exploration-free greedy algorithms are desirable in practical settings where exploration may be costly or unethical (e.g., clinical trials). Surpri… ▽ More

    Submitted 18 April, 2020; v1 submitted 28 April, 2017; originally announced April 2017.

    Comments: 62 Pages, 7 Figures

  7. arXiv:1611.01462  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling

    Authors: Hakan Inan, Khashayar Khosravi, Richard Socher

    Abstract: Recurrent neural networks have been very successful at predicting sequences of words in tasks such as language modeling. However, all such models are based on the conventional classification framework, where the model is trained against one-hot targets, and each word is represented both as an input and as an output in isolation. This causes inefficiencies in learning both in terms of utilizing all… ▽ More

    Submitted 11 March, 2017; v1 submitted 4 November, 2016; originally announced November 2016.

  8. arXiv:1603.00126  [pdf, ps, other

    math.ST cs.IT

    Multiclass Classification, Information, Divergence, and Surrogate Risk

    Authors: John C. Duchi, Khashayar Khosravi, Feng Ruan

    Abstract: We provide a unifying view of statistical information measures, multi-way Bayesian hypothesis testing, loss functions for multi-class classification problems, and multi-distribution $f$-divergences, elaborating equivalence results between all of these objects, and extending existing results for binary outcome spaces to more general ones. We consider a generalization of $f$-divergences to multiple… ▽ More

    Submitted 10 September, 2017; v1 submitted 29 February, 2016; originally announced March 2016.