This project adapts online learning techniques to unstructured data, focusing on neural networks. We posit a non-parametric multi-armed bandit algorithm with context that uses a neural network for its underlying function approximation. We theoretically analyze and prove guarantees on the regret under this algorithm…Read More