training strategy
Recently Published Documents


TOTAL DOCUMENTS

635
(FIVE YEARS 305)

H-INDEX

19
(FIVE YEARS 7)

2022 ◽  
Vol 40 (1) ◽  
pp. 1-27
Author(s):  
Lei Guo ◽  
Hongzhi Yin ◽  
Tong Chen ◽  
Xiangliang Zhang ◽  
Kai Zheng

Group recommendation aims to recommend items to a group of users. In this work, we study group recommendation in a particular scenario, namely occasional group recommendation, where groups are formed ad hoc and users may just constitute a group for the first time—that is, the historical group-item interaction records are highly limited. Most state-of-the-art works have addressed the challenge by aggregating group members’ personal preferences to learn the group representation. However, the representation learning for a group is most complex beyond the aggregation or fusion of group member representation, as the personal preferences and group preferences may be in different spaces and even orthogonal. In addition, the learned user representation is not accurate due to the sparsity of users’ interaction data. Moreover, the group similarity in terms of common group members has been overlooked, which, however, has the great potential to improve the group representation learning. In this work, we focus on addressing the aforementioned challenges in the group representation learning task, and devise a hierarchical hyperedge embedding-based group recommender, namely HyperGroup. Specifically, we propose to leverage the user-user interactions to alleviate the sparsity issue of user-item interactions, and design a graph neural network-based representation learning network to enhance the learning of individuals’ preferences from their friends’ preferences, which provides a solid foundation for learning groups’ preferences. To exploit the group similarity (i.e., overlapping relationships among groups) to learn a more accurate group representation from highly limited group-item interactions, we connect all groups as a network of overlapping sets (a.k.a. hypergraph), and treat the task of group preference learning as embedding hyperedges (i.e., user sets/groups) in a hypergraph, where an inductive hyperedge embedding method is proposed. To further enhance the group-level preference modeling, we develop a joint training strategy to learn both user-item and group-item interactions in the same process. We conduct extensive experiments on two real-world datasets, and the experimental results demonstrate the superiority of our proposed HyperGroup in comparison to the state-of-the-art baselines.


Author(s):  
Dimitrije Kovac ◽  
Zarko Krkeljas ◽  
Ranel Venter

Abstract Background Improving the quality of functional movements in athletes generally requires additional training targeting specific functional deficiencies. However, well-rounded, traditional strength and conditioning program should also improve player’s movement quality. Therefore, the primary aim of this study was to compare the effect of two different six-week interventions on the functional score of female netball players. Methods In a randomized controlled study, players were divided into control and intervention group. Both groups completed identical six-week strength and conditioning program, with the intervention group also completing additional corrective exercises three sessions per week during the same period. Results The FMS® score was significantly higher in the intervention group after 6-week program (f = 9.85, p = 0.004). However, the differences in total score may be attributed mainly to differences between groups in active straight leg raise (p = 0.004) and trunk stability push-up test (p = 0.02), as other individual tests demonstrated similar time and group effect. Conclusion These results indicate that although FMS® based intervention may improve overall functional movement score, the athletes in both groups have demonstrated similar improvements in most of the individual tests. Hence, a well-rounded strength and conditioning program incorporating athlete-specific exercises based on limitations identified in the functional movement screen, may result in a balanced training strategy and reduce the need for supplementary functional training sessions.


2022 ◽  
Vol 12 ◽  
Author(s):  
Shenda Hong ◽  
Wenrui Zhang ◽  
Chenxi Sun ◽  
Yuxi Zhou ◽  
Hongyan Li

Cardiovascular diseases (CVDs) are one of the most fatal disease groups worldwide. Electrocardiogram (ECG) is a widely used tool for automatically detecting cardiac abnormalities, thereby helping to control and manage CVDs. To encourage more multidisciplinary researches, PhysioNet/Computing in Cardiology Challenge 2020 (Challenge 2020) provided a public platform involving multi-center databases and automatic evaluations for ECG classification tasks. As a result, 41 teams successfully submitted their solutions and were qualified for rankings. Although Challenge 2020 was a success, there has been no in-depth methodological meta-analysis of these solutions, making it difficult for researchers to benefit from the solutions and results. In this study, we aim to systematically review the 41 solutions in terms of data processing, feature engineering, model architecture, and training strategy. For each perspective, we visualize and statistically analyze the effectiveness of the common techniques, and discuss the methodological advantages and disadvantages. Finally, we summarize five practical lessons based on the aforementioned analysis: (1) Data augmentation should be employed and adapted to specific scenarios; (2) Combining different features can improve performance; (3) A hybrid design of different types of deep neural networks (DNNs) is better than using a single type; (4) The use of end-to-end architectures should depend on the task being solved; (5) Multiple models are better than one. We expect that our meta-analysis will help accelerate the research related to ECG classification based on machine-learning models.


2022 ◽  
Vol 11 (1) ◽  
pp. 44
Author(s):  
Hai-Yan Yao ◽  
Wang-Gen Wan ◽  
Xiang Li

Analysis of pedestrians’ motion is important to real-world applications in public scenes. Due to the complex temporal and spatial factors, trajectory prediction is a challenging task. With the development of attention mechanism recently, transformer network has been successfully applied in natural language processing, computer vision, and audio processing. We propose an end-to-end transformer network embedded with random deviation queries for pedestrian trajectory forecasting. The self-correcting scheme can enhance the robustness of the network. Moreover, we present a co-training strategy to improve the training effect. The whole scheme is trained collaboratively by the original loss and classification loss. Therefore, we also achieve more accurate prediction results. Experimental results on several datasets indicate the validity and robustness of the network. We achieve the best performance in individual forecasting and comparable results in social forecasting. Encouragingly, our approach achieves a new state of the art on the Hotel and Zara2 datasets compared with the social-based and individual-based approaches.


Author(s):  
Kaixuan Cui ◽  
Shuchai Su ◽  
Jiawei Cai ◽  
Fengjun Chen

To realize rapid and accurate ripeness detection for walnut on mobile terminals such as mobile phones, we propose a method based on coupling information and lightweight YOLOv4. First, we collected 50 walnuts at each ripeness (Unripe, Mid-ripe, Ripe, Over-ripe) to determine the kernel oil content. Pearson correlation analysis and one-way analysis of variance (ANOVA) prove that the division of walnut ripeness reflects the change in kernel oil content. It is feasible to estimate the kernel oil content by detecting the ripeness of walnut. Next, we achieve ripeness detection based on lightweight YOLOv4. We adopt MobileNetV3 as the backbone feature extractor and adopt depthwise separable convolution to replace the traditional convolution. We design a parallel convolution structure with depthwise convolution stacking (PCSDCS) to reduce parameters and improve feature extraction ability. To enhance the model’s detection ability for walnuts in the growth-intensive areas, we design a Gaussian Soft DIoU non-maximum suppression (GSDIoU-NMS) algorithm. The dataset used for model optimization contains 3600 images, of which 2880 images in the training set, 320 images in the validation set, and 400 images in the test set. We adopt a multi-training strategy based on dynamic learning rate and transfer learning to get training weights. The lightweight YOLOv4 model achieves 94.05%, 90.72%, 88.30%, 76.92 FPS, and 38.14 MB in mean average precision, precision, recall, average detection speed, and weight capacity, respectively. Compared with the Faster R-CNN model, EfficientDet-D1 model, YOLOv3 model, and YOLOv4 model, the lightweight YOLOv4 model improves 8.77%, 4.84%, 5.43%, and 0.06% in mean average precision, 74.60 FPS, 55.60 FPS, 38.83 FPS, and 46.63 FPS in detection speed, respectively. And the lightweight YOLOv4 is 84.4% smaller than the original YOLOv4 model in terms of weight capacity. This paper provides a theoretical reference for the rapid ripeness detection of walnut and exploration for the model’s lightweight.


Sensors ◽  
2021 ◽  
Vol 21 (24) ◽  
pp. 8418
Author(s):  
Xiang Jin ◽  
Wei Lan ◽  
Tianlin Wang ◽  
Pengyao Yu

Path planning technology is significant for planetary rovers that perform exploration missions in unfamiliar environments. In this work, we propose a novel global path planning algorithm, based on the value iteration network (VIN), which is embedded within a differentiable planning module, built on the value iteration (VI) algorithm, and has emerged as an effective method to learn to plan. Despite the capability of learning environment dynamics and performing long-range reasoning, the VIN suffers from several limitations, including sensitivity to initialization and poor performance in large-scale domains. We introduce the double value iteration network (dVIN), which decouples action selection and value estimation in the VI module, using the weighted double estimator method to approximate the maximum expected value, instead of maximizing over the estimated action value. We have devised a simple, yet effective, two-stage training strategy for VI-based models to address the problem of high computational cost and poor performance in large-size domains. We evaluate the dVIN on planning problems in grid-world domains and realistic datasets, generated from terrain images of a moon landscape. We show that our dVIN empirically outperforms the baseline methods and generalize better to large-scale environments.


2021 ◽  
Vol 7 (12) ◽  
pp. 276
Author(s):  
Antonio Galli ◽  
Stefano Marrone ◽  
Gabriele Piantadosi ◽  
Mario Sansone ◽  
Carlo Sansone

The recent spread of Deep Learning (DL) in medical imaging is pushing researchers to explore its suitability for lesion segmentation in Dynamic Contrast-Enhanced Magnetic-Resonance Imaging (DCE-MRI), a complementary imaging procedure increasingly used in breast-cancer analysis. Despite some promising proposed solutions, we argue that a “naive” use of DL may have limited effectiveness as the presence of a contrast agent results in the acquisition of multimodal 4D images requiring thorough processing before training a DL model. We thus propose a pipelined approach where each stage is intended to deal with or to leverage a peculiar characteristic of breast DCE-MRI data: the use of a breast-masking pre-processing to remove non-breast tissues; the use of Three-Time-Points (3TP) slices to effectively highlight contrast agent time course; the application of a motion-correction technique to deal with patient involuntary movements; the leverage of a modified U-Net architecture tailored on the problem; and the introduction of a new “Eras/Epochs” training strategy to handle the unbalanced dataset while performing a strong data augmentation. We compared our pipelined solution against some literature works. The results show that our approach outperforms the competitors by a large margin (+9.13% over our previous solution) while also showing a higher generalization ability.


2021 ◽  
Author(s):  
Ping-Huan Kuo ◽  
Po-Chien Luan ◽  
Yung-Ruen Tseng ◽  
Her-Terng Yau

Abstract Chatter has a direct effect on the precision and life of machine tools and its detection is a crucial issue in all metal machining processes. Traditional methods focus on how to extract discriminative features to help identify chatter. Nowadays, deep learning models have shown an extraordinary ability to extract data features which are their necessary fuel. In this study deep learning models have been substituted for more traditional methods. Chatter data are rare and valuable because the collecting process is extremely difficult. To solve this practical problem an innovative training strategy has been proposed that is combined with a modified convolutional neural network and deep convolutional generative adversarial nets. This improves chatter detection and classification. Convolutional neural networks can be effective chatter classifiers, and adversarial networks can act as generators that produce more data. The convolutional neural networks were trained using original data as well as by forged data produced by the generator. Original training data were collected and preprocessed by the Chen-Lee chaotic system. The adversarial training process used these data to create the generator and the generator could produce enough data to compensate for the lack of training data. The experimental results were compared with without a data generator and data augmentation. The proposed method had an accuracy of 95.3% on leave-one-out cross-validation over ten runs and surpassed other methods and models. The forged data were also compared with original training data as well as data produced by augmentation. The distribution shows that forged data had similar quality and characteristics to the original data. The proposed training strategy provides a high-quality deep learning chatter detection model.


2021 ◽  
Author(s):  
Alexander Aguirre-Perez ◽  
Rajagopal Shyamala Joshya ◽  
Helene Carrere ◽  
Xavier Marie ◽  
Thierry Amand ◽  
...  

Abstract We demonstrate the application of a two stage machine learning algorithm that enables to correlate the electrical signals from a GaAsxN1-x circular polarimeter with the intensity, degree of circular polarization and handedness of an incident light beam. Specifically, we employ a multimodal logistic regression to discriminate the handedness of light and a 6-layer neural network to establish the relationship between the input voltages, the intensity and degree of circular polarization. We have developed a particular neural network training strategy that substantially improves the accuracy of the device. The algorithm was trained and tested on theoretically generated photoconductivity and on photoluminescence experimental results. Even for a small training experimental dataset (70 instances), it is shown that the proposed algorithm correctly predicts linear, right and left circularly polarized light misclassifying less than 1.5% of the cases and attains an accuracy larger than 97% in the vast majority of the predictions (92%) for intensity and degree of circular polarization. These numbers are significantly improved for the larger theoretically generated datasets (4851 instances). The algorithm is versatile enough that it can be easily adjusted to other device configurations where a map needs to be established between the input parameters and the device response. Training and testing data files as well as the algorithm are provided as supplementary material.


Sign in / Sign up

Export Citation Format

Share Document