Product Choice with Large Assortments: A Scalable Deep-Learning Model

2021 ◽  
Author(s):  
Sebastian Gabel ◽  
Artem Timoshenko

Personalized marketing in retail requires a model to predict how different marketing actions affect product choices by individual customers. Large retailers often handle millions of transactions daily, involving thousands of products in hundreds of categories. Product choice models thus need to scale to large product assortments and customer bases, without extensive product attribute information. To address these challenges, we propose a custom deep neural network model. The model incorporates bottleneck layers to encode cross-product relationships, calibrates time-series filters to capture purchase dynamics for products with different interpurchase times, and relies on weight sharing between the products to improve convergence and scale to large assortments. The model applies to loyalty card transaction data without predefined categories or product attributes to predict customer-specific purchase probabilities in response to marketing actions. In a simulation, the proposed product choice model predicts purchase decisions better than baseline methods by adjusting the predicted probabilities for the effects of recent purchases and price discounts. The improved predictions lead to substantially higher revenue gains in a simulated coupon personalization problem. We verify predictive performance using transaction data from a large retailer with experimental variation in price discounts. This paper was accepted by Gui Liberali, Management Science Special Issue on Data-Driven Prescriptive Analytics.

Author(s):  
Andrew Stuntz ◽  
John Attanucci ◽  
Frederick P. Salvucci

Customer fare product choices can affect both ridership and revenue, so they are strategically important for transit agencies. Nearly all major agencies offer choices between pay-per-use and pass products, and with each potential fare change, agencies face decisions about whether to modify pass “multiples”—the number of rides needed to “break even” on a pass purchase. However, the simple elasticity spreadsheet models often used to analyze the potential ridership and revenue impacts of fare changes make little or no adjustment for shifts in fare product choices. This paper reviews different options for incorporating product choice into fare policy scenario models, and it presents a ridership and revenue prediction procedure that combines a multinomial logit fare product choice model with the logic of an elasticity spreadsheet model. This combination facilitates evaluation of complex fare changes that are likely to alter fare product market shares while maintaining much of the flexibility and simplicity of a traditional spreadsheet model. Additionally, the proposed model uses only preexisting, revealed-preference automated fare collection data rather than requiring customer surveys. The proposed model is demonstrated using examples at the Chicago Transit Authority (CTA). The CTA experienced a large shift from passes to pay-per-use following a fare change in 2013, illustrating the potential value of accounting for fare product choices in fare scenario evaluation.


Author(s):  
Hong Xie ◽  
Yongkun Li ◽  
John C. S. Lui

Feedback-based reputation systems are widely deployed in E-commerce systems. Evidences showed that earning a reputable label (for sellers of such systems) may take a substantial amount of time and this implies a reduction of profit. We propose to enhance sellers’ reputation via price discounts. However, the challenges are: (1) The demands from buyers depend on both the discount and reputation; (2) The demands are unknown to the seller. To address these challenges, we first formulate a profit maximization problem via a semiMarkov decision process (SMDP) to explore the optimal trade-offs in selecting price discounts. We prove the monotonicity of the optimal profit and optimal discount. Based on the monotonicity, we design a QLFP (Q-learning with forward projection) algorithm, which infers the optimal discount from historical transaction data. We conduct experiments on a dataset from to show that our QLFP algorithm improves the profit by as high as 50% over both the classical Q-learning and speedy Q-learning algorithm. Our QLFP algorithm also improves the profit by as high as four times over the case of not providing any price discount.


2016 ◽  
Vol 16 (2) ◽  
pp. 785-805 ◽  
Author(s):  
Gilad Sorek

Abstract This work presents the first analysis of competition through horizontal and vertical differentiation in option demand markets, which are common in the health-care sector. I studied two alternative market structures: (a) a “pure” option demand market where medical providers sell insurance directly to consumers and (b) a public insurance regime where the public insurer bargains over prices with providers before bundling both products under a single insurance policy. I show that (a) product choices in option demand markets differ greatly from those in respective spot markets and (b) bundling medical products under a single insurance policy alters product choices and equilibrium prices in a way that does not benefit consumers.


2019 ◽  
Vol 57 (1) ◽  
pp. 35-54 ◽  
Author(s):  
Paul F. Burke ◽  
Christine Eckert ◽  
Srishti Sethi

Previous research has demonstrated that consumers evaluate products according to their perceived benefits when making a choice. This article extends prior work by proposing a method that evaluates the degree to which multiple a priori defined benefits mediate product choices. The model is the first to consider process heterogeneity—that is, heterogeneity in how consumers perceive multiple attributes to positively or negatively affect multiple benefits simultaneously and the contribution of each benefit to product utility. The authors propose discrete choice experiments to holistically measure the link between attributes and benefits, as well as between attributes and choice, resulting in data that can be analyzed with a generalized probit model. The approach contributes to mediation research by offering an alternative method of handling multiple multinomial mediators and dichotomous outcome variables. An empirical illustration of bread choices shows how consumer judgments about health and value perceptions of products mediate purchase decisions. The authors demonstrate how the method can help managers (1) confirm and test existing knowledge about latent benefits, including whether they explain all the variation in choice, and (2) consider process heterogeneity to inform market segmentation strategies.


Author(s):  
Yanbo Ge ◽  
Alec Biehl ◽  
Srinath Ravulaparthy ◽  
Venu Garikapati ◽  
Monte Lunacek ◽  
...  

Airport ground access mode choice is distinct from everyday mode choice decisions, necessitating context-specific choice model estimation. Understanding airport ground access mode choice decisions is not only important for developing infrastructure planning strategies, but also for assessing the impacts of emerging modes on airport revenues, particularly from parking. However, parking choice is an often-overlooked dimension in airport ground access choice modeling. This paper addresses this gap through the development of a joint model of airport access mode and parking option choice using a passenger survey conducted at Dallas-Fort Worth (DFW) International Airport in 2015. Compared with a traditional conditional logit model that does not consider parking options available at DFW airport, the joint model of mode and parking decisions was found to generate more realistic values of travel time and was shown to have better predictive performance, both of which are critical for obtaining better airport parking revenue estimates and identifying traveler cohorts who may respond more strongly to potential policies targeting curb congestion and parking demand.


2021 ◽  
pp. 1-43
Author(s):  
Jonathan B. Scott

Abstract This paper studies the role of the U.S. pipeline infrastructure in the country's transition from coal to natural gas energy. I leverage the EPA's Mercury and Air Toxics Standards as a plausibly exogenous intervention, which encouraged many coal plants to convert to natural gas. Combining this quasi-experimental variation with a plant's preexisting proximity to the pipeline network, I isolate implied pipeline connection costs within a dynamic discrete choice model of plant conversions. Key model results indicate that infrastructure-related costs prevent $9 billion in emissions reductions from taking place, suggesting a $2.4 million per mile external benefit of pipeline expansions.


Author(s):  
Dongwoo Lee ◽  
John Mulrow ◽  
Chana Joanne Haboucha ◽  
Sybil Derrible ◽  
Yoram Shiftan

This article applies machine learning (ML) to develop a choice model on three choice alternatives related to autonomous vehicles (AV): regular vehicle (REG), private AV (PAV), and shared AV (SAV). The learned model is used to examine users’ preferences and behaviors on AV uptake by car commuters. Specifically, this study applies gradient boosting machine (GBM) to stated preference (SP) survey data (i.e., panel data). GBM notably possesses more interpretable features than other ML methods as well as high predictive performance for panel data. The prediction performance of GBM is evaluated by conducting a 5-fold cross-validation and shows around 80% accuracy. To interpret users’ behaviors, variable importance (VI) and partial dependence (PD) were measured. The results of VI indicate that trip cost, purchase cost, and subscription cost are the most influential variables in selecting an alternative. Moreover, the attitudinal variables Pro-AV Sentiment and Environmental Concern are also shown to be significant. The article also examines the sensitivity of choice by using the PD of the log-odds on selected important factors. The results inform both the modeling of transportation technology uptake and the configuration and interpretation of GBM that can be applied for policy analysis.


2014 ◽  
Vol 71 (1) ◽  
pp. 141-150 ◽  
Author(s):  
A. van der Lee ◽  
D.M. Gillis ◽  
P. Comeau

The spatial dynamics of catch and effort data are often overlooked in fisheries research despite its well-documented utility in understanding the distribution and abundance of fish. We apply a recently developed fisheries isodar model to an otter trawl groundfish fishery on the Scotian Shelf and compare its predictive performance with a more traditional discrete choice model random utility model. Isodars represent the expected distribution of foragers between two habitats when fitness is equal and can be a representation of the ideal free distribution. Here, fitness was defined with relative catch rates, cost differentials, and interference effects between habitats. Random utility models were constructed as mixed logit models to give the expected probability of fishing in a particular area based on a collection of predictors. The predictions of the isodar models consistently outperformed the mixed logit for both in-sample and out-of-sample forecasts and the isodar was determined to be the preferred model based on its increased accuracy and simplicity. The isodar model can provide a statistically powerful and easily implemented tool in effort studies, especially in situations of aggregated or limited data, which can inform conservation and management decisions.


Sign in / Sign up

Export Citation Format

Share Document