online learning algorithms Latest Research Papers

We consider the product-ranking challenge that online retailers face when their customers typically behave as “window shoppers.” They form an impression of the assortment after browsing products ranked in the initial positions and then decide whether to continue browsing. We design online learning algorithms for product ranking that maximize the number of customers who engage with the site. Customers’ product preferences and attention spans are correlated and unknown to the retailer; furthermore, the retailer cannot exploit similarities across products, owing to the fact that the products are not necessarily characterized by a set of attributes. We develop a class of online learning-then-earning algorithms that prescribe a ranking to offer each customer, learning from preceding customers’ clickstream data to offer better rankings to subsequent customers. Our algorithms balance product popularity with diversity, the notion of appealing to a large variety of heterogeneous customers. We prove that our learning algorithms converge to a ranking that matches the best-known approximation factors for the offline, complete information setting. Finally, we partner with Wayfair — a multibillion-dollar home goods online retailer — to estimate the impact of our algorithms in practice via simulations using actual clickstream data, and we find that our algorithms yield a significant increase (5–30%) in the number of customers that engage with the site. This paper was accepted by J. George Shanthikumar for the Management Science Special Issue on Data-Driven Prescriptive Analytics.

Download Full-text

Online Learning Algorithms for Offloading Augmented Reality Requests with Uncertain Demands in MECs

10.1109/icdcs51616.2021.00105 ◽

2021 ◽

Author(s):

Zichuan Xu ◽

Dongqi Liu ◽

Weifa Liang ◽

Wenzheng Xu ◽

Haipeng Dai ◽

...

Keyword(s):

Online Learning ◽

Augmented Reality ◽

Learning Algorithms ◽

Online Learning Algorithms

Download Full-text

Speech Recognition: B

Probability in Electrical Engineering and Computer Science ◽

10.1007/978-3-030-49995-2_12 ◽

2021 ◽

pp. 217-242

Author(s):

Jean Walrand

Keyword(s):

Matrix Completion ◽

Gradient Projection ◽

Stochastic Gradient ◽

Projection Algorithm ◽

General Technique ◽

Gradient Projection Algorithm ◽

Completion Problem ◽

Matrix Completion Problem ◽

Online Learning Algorithms ◽

The Matrix

AbstractOnline learning algorithms update their estimates as additional observations are made. Section 12.1 explains a simple example: online linear regression. The stochastic gradient projection algorithm is a general technique to update estimates based on additional observations; it is widely used in machine learning. Section 12.2 presents the theory behind that algorithm. When analyzing large amounts of data, one faces the problems of identifying the most relevant data and of how to use efficiently the available data. Section 12.3 explains three examples of how these questions are addressed: the LASSO algorithm, compressed sensing, and the matrix completion problem. Section 12.4 discusses deep neural networks for which the stochastic gradient projection algorithm is easy to implement.

Download Full-text

Perceptron Ranking Using Interval Labels with Ramp Loss for Online Ordinal Regression

Mathematical Problems in Engineering ◽

10.1155/2020/8866257 ◽

2020 ◽

Vol 2020 ◽

pp. 1-15

Author(s):

Cuiqing Zhang ◽

Maojun Zhang ◽

Xijun Liang ◽

Zhonghang Xia ◽

Jiangxia Nan

Keyword(s):

Data Streams ◽

Gradient Descent ◽

Decision Function ◽

Ordinal Regression ◽

Regression Algorithms ◽

Online Learning Algorithms ◽

Ramp Loss ◽

Ranking Problems ◽

Real World Datasets ◽

Ramp Loss Function

Due to its wide applications and learning efficiency, online ordinal regression using perceptron algorithms with interval labels (PRIL) has been increasingly applied to solve ordinal ranking problems. However, it is still a challenge for the PRIL method to handle noise labels, in which case the ranking results may change dramatically. To tackle this problem, in this paper, we propose noise-resilient online learning algorithms using ramp loss function, called PRIL-RAMP, and its nonlinear variant K-PRIL-RAMP, to improve the performance of PRIL method for noisy data streams. The proposed algorithms iteratively optimize the decision function under the framework of online gradient descent (OGD), and we justify the algorithms by showing the order preservation of thresholds. It is validated in the experiments that both approaches are more robust and efficient to noise labels than state-of-the-art online ordinal regression algorithms on real-world datasets.

Download Full-text

w-HAR: An Activity Recognition Dataset and Framework Using Low-Power Wearable Devices

Sensors ◽

10.3390/s20185356 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5356 ◽

Cited By ~ 1

Author(s):

Ganapati Bhat ◽

Nicholas Tran ◽

Holly Shill ◽

Umit Y. Ogras

Keyword(s):

Online Learning ◽

Activity Recognition ◽

Network Architecture ◽

Design Space Exploration ◽

Inertial Sensors ◽

Learning Algorithms ◽

Wearable Devices ◽

Sensor Data ◽

Online Learning Algorithms ◽

Segment Data

Human activity recognition (HAR) is growing in popularity due to its wide-ranging applications in patient rehabilitation and movement disorders. HAR approaches typically start with collecting sensor data for the activities under consideration and then develop algorithms using the dataset. As such, the success of algorithms for HAR depends on the availability and quality of datasets. Most of the existing work on HAR uses data from inertial sensors on wearable devices or smartphones to design HAR algorithms. However, inertial sensors exhibit high noise that makes it difficult to segment the data and classify the activities. Furthermore, existing approaches typically do not make their data available publicly, which makes it difficult or impossible to obtain comparisons of HAR approaches. To address these issues, we present wearable HAR (w-HAR) which contains labeled data of seven activities from 22 users. Our dataset’s unique aspect is the integration of data from inertial and wearable stretch sensors, thus providing two modalities of activity information. The wearable stretch sensor data allows us to create variable-length segment data and ensure that each segment contains a single activity. We also provide a HAR framework to use w-HAR to classify the activities. To this end, we first perform a design space exploration to choose a neural network architecture for activity classification. Then, we use two online learning algorithms to adapt the classifier to users whose data are not included at design time. Experiments on the w-HAR dataset show that our framework achieves 95% accuracy while the online learning algorithms improve the accuracy by as much as 40%.

Download Full-text

Optimizing Mixture Importance Sampling Via Online Learning: Algorithms and Applications

IEEE INFOCOM 2020 - IEEE Conference on Computer Communications ◽

10.1109/infocom41043.2020.9155466 ◽

2020 ◽

Author(s):

Tingwei Liu ◽

Hong Xie ◽

John C.S. Lui

Keyword(s):

Online Learning ◽

Importance Sampling ◽

Learning Algorithms ◽

Online Learning Algorithms

Download Full-text

Improved Algorithms for Conservative Exploration in Bandits

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5812 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3962-3969

Author(s):

Evrard Garcelon ◽

Mohammad Ghavamzadeh ◽

Alessandro Lazaric ◽

Matteo Pirotta

Keyword(s):

State Of The Art ◽

Digital Marketing ◽

Learning Problem ◽

Online Learning Algorithms ◽

Empirical Performance ◽

Regret Bounds ◽

Healthcare Finance ◽

And Robotics ◽

Novel Algorithm ◽

Real World Problems

In many fields such as digital marketing, healthcare, finance, and robotics, it is common to have a well-tested and reliable baseline policy running in production (e.g., a recommender system). Nonetheless, the baseline policy is often suboptimal. In this case, it is desirable to deploy online learning algorithms (e.g., a multi-armed bandit algorithm) that interact with the system to learn a better/optimal policy under the constraint that during the learning process the performance is almost never worse than the performance of the baseline itself. In this paper, we study the conservative learning problem in the contextual linear bandit setting and introduce a novel algorithm, the Conservative Constrained LinUCB (CLUCB2). We derive regret bounds for CLUCB2 that match existing results and empirically show that it outperforms state-of-the-art conservative bandit algorithms in a number of synthetic and real-world problems. Finally, we consider a more realistic constraint where the performance is verified only at predefined checkpoints (instead of at every step) and show how this relaxed constraint favorably impacts the regret and empirical performance of CLUCB2.

Download Full-text

Parallel tracking and detection for long-term object tracking

International Journal of Advanced Robotic Systems ◽

10.1177/1729881420902577 ◽

2020 ◽

Vol 17 (2) ◽

pp. 172988142090257

Author(s):

Dan Xiong ◽

Huimin Lu ◽

Qinghua Yu ◽

Junhao Xiao ◽

Wei Han ◽

...

Keyword(s):

Object Detection ◽

Object Tracking ◽

Detection Algorithm ◽

Detection Algorithms ◽

Art Object ◽

High Processing Speed ◽

Online Learning Algorithms ◽

Integrate State ◽

Unknown Object

High tracking frame rates have been achieved based on traditional tracking methods which however would fail due to drifts of the object template or model, especially when the object disappears from the camera’s field of view. To deal with it, tracking-and-detection-combination has become more and more popular for long-term unknown object tracking, whose detector almost does not drift and can regain the disappeared object when it comes back. However, for online machine learning and multiscale object detection, expensive computing resources and time are required. So it is not a good idea to combine tracking and detection sequentially like Tracking-Learning-Detection algorithm. Inspired from parallel tracking and mapping, this article proposes a framework of parallel tracking and detection for unknown object tracking. The object tracking algorithm is split into two separate tasks—tracking and detection which can be processed in two different threads, respectively. One thread is used to deal with the tracking between consecutive frames with a high processing speed. The other thread runs online learning algorithms to construct a discriminative model for object detection. Using our proposed framework, high tracking frame rates and the ability of correcting and recovering the failed tracker can be combined effectively. Furthermore, our framework provides open interfaces to integrate state-of-the-art object tracking and detection algorithms. We carry out an evaluation of several popular tracking and detection algorithms using the proposed framework. The experimental results show that different tracking and detection algorithms can be integrated and compared effectively by our proposed framework, and robust and fast long-term object tracking can be realized.

Download Full-text

Semi-Supervised Hybrid Local Kernel Regression for Soft Sensor Modelling of Rubber-Mixing Process

Advances in Polymer Technology ◽

10.1155/2020/6981302 ◽

2020 ◽

Vol 2020 ◽

pp. 1-8

Author(s):

Haiqing Yu ◽

Jun Ji ◽

Ping Li ◽

Fengjing Shao ◽

Shunyao Wu ◽

...

Keyword(s):

Raw Materials ◽

Time Windows ◽

Learning Algorithms ◽

Kernel Regression ◽

Soft Sensor ◽

Small Sample ◽

Batch Processes ◽

Global Learning ◽

Internal Mixing ◽

Online Learning Algorithms

Soft sensor techniques have been widely adopted in chemical industry to estimate important indices that cannot be online measured by hardware sensors. Unfortunately, due to the instinct time-variation, the small-sample condition and the uncertainty caused by the drifting of raw materials, it is exceedingly difficult to model the fed-batch processes, for instance, rubber internal mixing processing. Meanwhile, traditional global learning algorithms suffer from the outdated samples while online learning algorithms lack practicality since too many labelled samples of current batch are required to build the soft sensor. In this paper, semi-supervised hybrid local kernel regression (SHLKR) is presented to leverage both historical and online samples to semi-supervised model the soft sensor using proposed time-windows series. Moreover, the recursive formulas are deduced to improve its adaptability and feasibility. Additionally, the rubber Mooney soft sensor of internal mixing processing is implemented using real onsite data to validate proposed method. Compared with classical algorithms, the performance of SHLKR is evaluated and the contribution of unlabelled samples is discussed.

Download Full-text

Detecting cyberattacks in industrial control systems using online learning algorithms

Neurocomputing ◽

10.1016/j.neucom.2019.07.031 ◽

2019 ◽

Vol 364 ◽

pp. 338-348 ◽

Cited By ~ 7

Author(s):

Guangxia Li ◽

Yulong Shen ◽

Peilin Zhao ◽

Xiao Lu ◽

Jia Liu ◽

...

Keyword(s):

Online Learning ◽

Control Systems ◽

Learning Algorithms ◽

Industrial Control ◽

Industrial Control Systems ◽

Online Learning Algorithms

Download Full-text

online learning algorithms
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Learning to Rank an Assortment of Products

Online Learning Algorithms for Offloading Augmented Reality Requests with Uncertain Demands in MECs

Speech Recognition: B

Perceptron Ranking Using Interval Labels with Ramp Loss for Online Ordinal Regression

w-HAR: An Activity Recognition Dataset and Framework Using Low-Power Wearable Devices

Optimizing Mixture Importance Sampling Via Online Learning: Algorithms and Applications

Improved Algorithms for Conservative Exploration in Bandits

Parallel tracking and detection for long-term object tracking

Semi-Supervised Hybrid Local Kernel Regression for Soft Sensor Modelling of Rubber-Mixing Process

Detecting cyberattacks in industrial control systems using online learning algorithms

Export Citation Format

online learning algorithmsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Learning to Rank an Assortment of Products

Online Learning Algorithms for Offloading Augmented Reality Requests with Uncertain Demands in MECs

Speech Recognition: B

Perceptron Ranking Using Interval Labels with Ramp Loss for Online Ordinal Regression

w-HAR: An Activity Recognition Dataset and Framework Using Low-Power Wearable Devices

Optimizing Mixture Importance Sampling Via Online Learning: Algorithms and Applications

Improved Algorithms for Conservative Exploration in Bandits

Parallel tracking and detection for long-term object tracking

Semi-Supervised Hybrid Local Kernel Regression for Soft Sensor Modelling of Rubber-Mixing Process

Detecting cyberattacks in industrial control systems using online learning algorithms

online learning algorithms
Recently Published Documents