Data from: Risk-aware multi-armed bandit problem with application to portfolio selection

Huo, Xiaoguang1; Fu, Feng2

Published Oct 17, 2017 on Dryad. https://doi.org/10.5061/dryad.h628h

Data files

Oct 17, 2017 version files 87.02 KB

Data_and_codes.zip

86.77 KB
README_for_Data_and_codes.txt

248 B

Abstract

Sequential portfolio selection has attracted increasing interests in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk-awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return.

Data from: Risk-aware multi-armed bandit problem with application to portfolio selection

Data files

Abstract

Usage notes

data and codes

Works referencing this dataset