Dynamic Assortment Planning Under Nested Logit Models

Abstract

We study a stylized dynamic assortment planning problem during a selling season of finite length T. At each time period, the seller offers an arriving customer an assortment of substitutable products and the customer makes the purchase among offered products according to a discrete choice model. The goal of the seller is to maximize the expected revenue, or equivalently, to minimize the worst‐case expected regret. One key challenge is that utilities of products are unknown to the seller and need to be learned. Although the dynamic assortment planning problem has received increasing attention in revenue management, most existing work is based on the multinomial logit choice models (MNL). In this paper, we study the problem of dynamic assortment planning under a more general choice model—the nested logit model, which models hierarchical choice behavior and is “the most widely used member of the GEV (generalized extreme value) family” (Train 2009). By leveraging the revenue‐ordered structure of the optimal assortment within each nest, we develop a novel upper confidence bound (UCB) policy with an aggregated estimation scheme. Our policy simultaneously learns customers’ choice behavior and makes dynamic decisions on assortments based on the current knowledge. It achieves the accumulated regret at the order of $\tilde{O} (\sqrt{MNT})$ , where M is the number of nests and N is the number of products in each nest. We further provide a lower bound result of $Ω (\sqrt{MT})$ , which shows the near optimality of the upper bound when T is much larger than M and N. When the number of items per nest N is large, we further provide a discretization heuristic for better performance of our algorithm. Numerical results are presented to demonstrate the empirical performance of our proposed algorithms.

Keywords

dynamic assortment optimization nested logit models regret analysis upper confidence bound

Get full access to this article

View all access options for this article.

References

Agrawal

Avandhanula

Goyal

Zeevi

. 2017. Thompson sampling for MNL‐bandit. Proccedings of the Conference on Learning Theory (COLT).

Agrawal

Avadhanula

Goyal

Zeevi

. 2019. MNL‐bandit: A dynamic learning approach to assortment selection. Oper. Res. 67(5): 1453–1485.

Bertsimas

Mišić

V. V.

. 2019 Exact first‐choice product line optimization. Oper. Res. 67(3): 559–904.

Besbes

Saure

. 2016. Product assortment and price competition under multinomial logit demand. Prod. Oper. Manag. 25(1): 114–127.

Blanchet

Gallego

Goyal

. 2016. A markov chain approximation to choice modeling. Oper. Res. 64(4): 886–905.

Borch‐Supan

1990. On the compatibility of nested logit models with utility maximization. J. Econom. 43(3): 373–388.

Bront

J. J. M.

Méndez‐Díaz

Vulcano

. 2009. A column generation algorithm for choice‐based network revenue management. Oper. Res. 57(3): 769–784.

Caro

Gallien

. 2007. Dynamic assortment with demand learning for seasonal consumer goods. Management Sci. 53(2): 276–292.

Chen

Wang

. 2018. A note on tight lower bound for MNL‐bandit assortment selection models. Oper. Res. Lett. 46(5): 534–537.

10.

Chen

Wang

Zhou

. 2018. Dynamic assortment optimization with changing contextual information. J Mach Learn Res.

11.

Chen

Simchi‐Levi

Xin

. 2019. Assortment Planning for Recommendations at Checkout Under Inventory Constraints. Available at https://papers.ssrn.com/sol3/papers.cfm?abstract_ #id=2853093 (accessed date October 20, 2020).

12.

Cheung

W. C.

Simchi‐Levi

. 2017. Thompson sampling for online personalized assortment optimization problems with multinomial logit choice models. Available at https://papers.ssrn.com>abstract_#id=3075658 (accessed date October 20, 2020).

13.

Chung

Ahn

H. S.

Jasin

. 2019. (Rescaled) multi‐attempt approximation of choice model and its application to assortment optimization. Prod. Ooper. Manag. 28(2): 341–353.

14.

Csiszar

Körner

. 2011. Information Theory: Coding Theorems for Discrete Memoryless Systems. Cambridge University Press, Cambridge.

15.

Davis

J. M.

Gallego

Topaloglu

. 2014. Assortment optimization under variants of the nested logit model. Oper. Res. 62(2): 250–273.

16.

Désir

Goyal

Jagabathula

Segev

. 2016. Assortment optimization under the Mallows model. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS).

17.

Désir

Goyal

Segev

. 2020. Capacity constrained assortment optimization under the markov chain‐based choice model. Management Sci. 66(2): 698–721.

18.

Farias

V. F.

Jagabathula

Shah

. 2013. A nonparametric approach to modeling choice with limited data. Management Sci. 59(2): 305–322.

19.

Gallego

Topaloglu

. 2014. Constrained assortment optimization for the nested logit model. Management Sci. 60(10): 2583–2601.

20.

Gallego

Iyengar

Phillips

Dubey

. 2004. Managing flexible products on a network. Technical Report CORC TR‐2004-01, Department of Industrial Engineering and Operations Research, Columbia University.

21.

Golrezaei

Nazerzadeh

Rusmevichientong

. 2014. Real‐time optimization of personalized assortments. Management Sci. 60(6): 1532–1551.

22.

Kök

A. G.

. 2011. Optimal and competitive assortments with endogenous pricing under hierarchical consumer choice models. Management Sci. 57(9): 1546–1563.

23.

Rusmevichientong

. 2014. A greedy algorithm for the two‐level nested logit model. Oper. Res. Lett. 42(5): 319–324.

24.

Rusmevichientong

Topaloglu

. 2015. The d‐level nested logit model: Assortment and price optimization problems. Oper. Res. 63(2): 325–342.

25.

Liu

van Ryzin

. 2008. On the choice‐based linear programming model for network revenue management. Manufact. Serv. Oper. Manag. 10(2): 288–310.

26.

Mahajan

van Ryzin

. 2001. Stocking retail assortments under dynamic consumer substitution. Oper. Res. 49: 334–351.

27.

McFadden

1974. Conditional logit analysis of qualitative choice behavior. Frontiers in Econometrics (Academic Press).

28.

McFadden

1980. Econometric models for probabilistic choice among products. J. Bus. 53(3): 13–29.

29.

Megiddo

1978. Combinatorial optimization with rational objective functions. Proceedings of the annual ACM symposium on Theory of computing (STOC).

30.

Méndez‐Díaz

Miranda‐Bront

J. J.

Vulcano

Zabala

. 2014. A branch‐and-cut algorithm for the latentclass logit assortment problem. Discrete Appl. Math. 164: 246–263.

31.

Miao

S. T.

Chao

X. L.

. 2018. Dynamic joint assortment and pricing optimization with demand learning. Technical report, University of Michigan, Ann Arbor.

32.

Rusmevichientong

Topaloglu

. 2012. Robust assortment optimization in revenue management under the multinomial logit choice model. Oper. Res. 60(4): 865–882.

33.

Rusmevichientong

Shen

Z. J.

Shmoys

. 2010. Dynamic assortment optimization with a multinomial logit choice model and capacity constraint. Oper. Res. 58(6): 1666–1680.

34.

Rusmevichientong

Shmoys

Tong

Topaloglu

. 2014. Assortment optimization under the multinomial logit model with random choice parameters. Prod. Oper. Manag. 23(11): 2023–2039.

35.

van Ryzin

Mahajan

. 1999. On the relationships between inventory costs and variety benefits in retail assortments. Management Sci. 45(11): 1496–1509.

36.

Saure

Zeevi

. 2013. Optimal dynamic assortment planning with demand learning. Manufact. Serv. Oper. Manag. 15(3): 387–404.

37.

Talluri

van Ryzin

. 2004. Revenue management under a general discrete choice model of consumer behavior. Management Sci. 50(1): 15–33.

38.

Train

2009. Discrete Choice Methods with Simulation. Cambridge University Press, Cambridge, 2nd edn.

39.

Tsybakov

A. B.

2009. Introduction to Nonparametric Estimation Springer Series in Statistics. Springer, New York.

40.

Wang

2012. Capacitated assortment and price optimization under the multinomial logit choice model. Oper. Res. Lett. 40(6): 492–497.

41.

Wang

2013. Assortment management under the generalized attraction model with a capacity constraint. J. Rev. Pric. Manag. 12(3): 254–270.

42.

Wang

Chen

Zhou

. 2018. Near‐optimal policies for dynamic multinomial logit assortment selection models. Proceedings of Advances in Neural Information Processing Systems (NeurIPS).

43.

Williams

H. C. W. L.

1977. On the formation of travel demand models and economic evaluation measures of user benefit. Environ. Plan. A 9, 285–344.

44.

Zhang

Rusmevichientong

Topaloglu

. 2020. Assortment optimization under the paired combinatorial logit model. Oper. Res. 68(3): 741–761.