| Article ID: | iaor200935003 |
| Country: | United Kingdom |
| Volume: | 3 |
| Issue: | 4 |
| Start Page Number: | 320 |
| End Page Number: | 336 |
| Publication Date: | Jan 2005 |
| Journal: | Journal of Revenue and Pricing Management |
| Authors: | Carvalho Alexandre X, Puterman Martin L |
| Keywords: | learning |
This paper considers the problem of setting prices dynamically to maximise expected revenues in a finite horizon model in which the demand parameters are unknown. At each decision epoch, the manager chooses a price and observes a binary response (buy or not) for each consumer visiting the website during that period. This paper focuses on comparing several easy to implement good pricing policies. A Taylor series expansion of the future reward function explicity illustrates the trade–off between short–term revenue maximisation and future information gains and suggests a pricing policy referred to as a one–step look ahead rule. A Monte Carlo study compares several different pricing strategies and shows that the one–step look ahead rule dominates other policies and produces good short term performance.