Skip to main navigation Skip to search Skip to main content

Competitive pricing using model-based bandits

Lukasz Sliwinski, Tanut Treetanthiploet, David Siska, Lukasz Szpruch

Research output: Contribution to journalArticlepeer-review

Abstract

The use of learning algorithms for automatic price adjustments in markets is on the rise. However, these algorithms often assume that reward distributions for actions are uncorrelated and stationary, a condition that does not hold in competitive pricing environments.

In this paper, we introduce a pricing environment, find conditions under which a unique Nash equilibrium exists and verify the assumptions numerically. Then, we propose a bandit algorithm that approximates the structure of the environment and extend it to accommodate non-stationary settings. We perform numerical tests in both stationary and competitive pricing environments, analysing the potential benefits and drawbacks of incorporating the structure of the environment within learning algorithms. While modelling the stationary environment improves the algorithm's performance in a stationary setting, it does not offer an advantage in pricing competitions between non-stationary learning agents.
Original languageEnglish
Pages (from-to)4813-4867
JournalComputational Economics
Volume66
Issue number6
Early online date4 Feb 2025
DOIs
Publication statusPublished - 31 Dec 2025

Fingerprint

Dive into the research topics of 'Competitive pricing using model-based bandits'. Together they form a unique fingerprint.

Cite this