Contextual Multi-Armed Bandit with Costly Feature Observation in Non-stationary Environments
Published in IEEE Open Journal of Signal Processing, 2023
We study contextual multi-armed bandit problem where the features are costly and the agent has to simultaneously learn the reward distributions and the feature importances. The environment undergoes distribution shifts making the problem more challenging.
Download here