Skip to main content

Showing 1–1 of 1 results for author: Kovačević, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.08637  [pdf, other

    cs.LG cs.RO q-fin.TR

    Robot See, Robot Do: Imitation Reward for Noisy Financial Environments

    Authors: Sven Goluža, Tomislav Kovačević, Stjepan Begušić, Zvonko Kostanjčar

    Abstract: The sequential nature of decision-making in financial asset trading aligns naturally with the reinforcement learning (RL) framework, making RL a common approach in this domain. However, the low signal-to-noise ratio in financial markets results in noisy estimates of environment components, including the reward function, which hinders effective policy learning by RL agents. Given the critical impor… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.