weight decay
Recently Published Documents


TOTAL DOCUMENTS

97
(FIVE YEARS 39)

H-INDEX

15
(FIVE YEARS 3)

2022 ◽  
Author(s):  
Christopher Graney-Ward ◽  
Biju Issac ◽  
LIDA KETSBAIA ◽  
Seibu Mary Jacob

Due to the recent popularity and growth of social media platforms such as Facebook and Twitter, cyberbullying is becoming more and more prevalent. The current research on cyberbullying and the NLP techniques being used to classify this kind of online behaviour was initially studied. This paper discusses the experimentation with combined Twitter datasets by Maryland and Cornell universities using different classification approaches like classical machine learning, RNN, CNN, and pretrained transformer-based classifiers. A state of the art (SOTA) solution was achieved by optimising BERTweet on a Onecycle policy with a Decoupled weight decay optimiser (AdamW), improving the previous F1-score by up to 8.4%, resulting in 64.8% macro F1. Particle Swarm Optimisation was later used to optimise the ensemble model. The ensemble developed from the optimised BERTweet model and a collection of models with varying data representations, outperformed the standalone BERTweet model by 0.53% resulting in 65.33% macro F1 for TweetEval dataset and by 0.55% for combined datasets, resulting in 68.1% macro F1.


2022 ◽  
Author(s):  
Christopher Graney-Ward ◽  
Biju Issac ◽  
LIDA KETSBAIA ◽  
Seibu Mary Jacob

Due to the recent popularity and growth of social media platforms such as Facebook and Twitter, cyberbullying is becoming more and more prevalent. The current research on cyberbullying and the NLP techniques being used to classify this kind of online behaviour was initially studied. This paper discusses the experimentation with combined Twitter datasets by Maryland and Cornell universities using different classification approaches like classical machine learning, RNN, CNN, and pretrained transformer-based classifiers. A state of the art (SOTA) solution was achieved by optimising BERTweet on a Onecycle policy with a Decoupled weight decay optimiser (AdamW), improving the previous F1-score by up to 8.4%, resulting in 64.8% macro F1. Particle Swarm Optimisation was later used to optimise the ensemble model. The ensemble developed from the optimised BERTweet model and a collection of models with varying data representations, outperformed the standalone BERTweet model by 0.53% resulting in 65.33% macro F1 for TweetEval dataset and by 0.55% for combined datasets, resulting in 68.1% macro F1.


Entropy ◽  
2021 ◽  
Vol 23 (12) ◽  
pp. 1629
Author(s):  
Ali Unlu ◽  
Laurence Aitchison

We developed Variational Laplace for Bayesian neural networks (BNNs), which exploits a local approximation of the curvature of the likelihood to estimate the ELBO without the need for stochastic sampling of the neural-network weights. The Variational Laplace objective is simple to evaluate, as it is the log-likelihood plus weight-decay, plus a squared-gradient regularizer. Variational Laplace gave better test performance and expected calibration errors than maximum a posteriori inference and standard sampling-based variational inference, despite using the same variational approximate posterior. Finally, we emphasize the care needed in benchmarking standard VI, as there is a risk of stopping before the variance parameters have converged. We show that early-stopping can be avoided by increasing the learning rate for the variance parameters.


2021 ◽  
Vol 8 (Supplement_1) ◽  
pp. S499-S499
Author(s):  
Flávio Henrique Batista de Souza ◽  
Bráulio R G M Couto ◽  
Felipe Leandro Andrade da Conceição ◽  
Gabriel Henrique Silvestre da Silva ◽  
Igor Gonçalves Dias ◽  
...  

Abstract Background A research focused on surgical site infection (SSI) was performed in patients undergoing cardiac pacemaker implantation surgery. The main objective is to statistically evaluate such incidences and enable a study of the prediction power of SSI through pattern recognition algorithms, in this case the Multilayer Perceptron (MLP). Methods Data were collected from five hospitals in the city of Belo Horizonte (more than 3,000,000 inhabitants), between July 2016 and June 2018, on SSI by the Hospital Infection Control Committees (CCIH) of the hospitals involved in the search. All data used in the analysis during their routine SSI surveillance procedures were collected. So, three procedures were performed: a treatment of the collected database for use of intact samples; a statistical analysis on the profile of the hospitals collected and; an assessment of the predictive power of five types of MLP (Backpropagation Standard, Momentum, Resilient Propagation, Weight Decay, and Quick Propagation) for SSI prediction. MLPs were tested with 3, 5, 7, and 10 hidden layer neurons and a database split for the resampling process (65% and 75% for testing, 35% and 25% for validation). They were compared by measuring AUC (Area Under the Curve - from 0 to 1) presented for each of the configurations. Results From 1394, 572 records were: 21% of deaths and 2.4% patients had SSI; from the confirmed SSI cases, approximately 64.3% had sites classified as “clean”; length of hospital stay ranged from 0 to 175 days (from 1 to 70 days); the average age is 67 years. The prediction power of SSI, the experiments achieved from 0.409 to 0.722. Conclusion Despite the considerable loss rate of more than 65% of the database samples due to the presence of noise, it was possible to have a relevant sampling for the profile evaluation of Belo Horizonte hospitals. Moreover, for the predictive process, although some configurations reached 0.722. To optimize data collection and enable other hospitals to use the SSI prediction tool (available in www.nois.org.br ), two mobile application were developed: one for monitoring the patient in the hospital and the other for monitoring after hospital discharge. Disclosures All Authors: No reported disclosures


2021 ◽  
Author(s):  
stellaewis not provided

With the Keto Strong Shark Tank Advan ced Burn Complex Capsules, you could get fabulous weight decrease results more prominent immediate and swifter than any time in relentless memory! This additional exceptional improvement works close by the advancement ketogenic weight reduction intend to guarantee which you insecure down and sense more prominent singing than at later. Without a doubt, one appraisal even passes on that the keto weight reduction plan can hold making prepared and oversee hunger! Therefore, withinside the event which you are establishment to get your speediest fats burning-through up results, click on any photograph or seize in this page even as materials shutting How To Use Keto Strong Shark Tank Pills? The Keto Strong Shark TankWeight Loss Pills involve an immeasurable mix of the lovely top notch ketones and distinctive sizable weight decay dietary enhancements so you can weaken swifter than at later close by the improvement ketogenic weight reduction plan! Be that since it might, to get your lovely results, you need the keto weight reduction plan and those tips:Also, its evaluation concentrates on work environment appeared and attempted parts, approve the casing to utilize the put away fats for gas supply rather than conventional cabohydrate convey while on the whole safeguarding muscle tissue. Keto Strong Shark TankCustomer Feedbacks and Shocking Side impacts Increment Fat – Boost fats use to 70% to guarantee which you get the fuel supply your casing wishes to help ketosis. Lessen Carbs – Keep carbs at a low 5% to help your casing with ending using glucose so you can visit ingesting your additional fats for power pretty close by the Quick Burn Keto Pills. Protein – The end 25% is for protein. By getting sufficient protein, this helps you with safeguarding your mass excellent all through the fats ingesting pursuing of ketosis. Keto Strong Shark TankReviews & Shark Tank Alert: Price of Keto Strong Shark TankPills & Ingredients Benefits Of Keto Strong Shark TankPills? As shown through the maker, it's far an all-seal call overhaul. Thought to expand fats diminishing at last assisting you with losing the additional pounds. Might assist you with drawing the abrading regions. Will assist you with taking part in that whole edge you have got reliably been requiring it would positively help your enormity. Fire Up Fat Burning Get Your Hottest Bod Change with regards to Ketosis Faster Gobble up Fat for Extra Energy Are There Keto Strong Shark TankSide Effects? we have not apparent any alert of Keto Strong Shark Tank Side Effects! Which is an uncommon sign that this mind blowing ketogenic situation permit you to with bringing down immense issues occurring at the way. With this confusing fats executioner, you could get handiest the beautifications your edge wishes to weaken simpler and quicker than any time in constant memory. In actuality, through including more BHB ketones, you could assemble generously more prominent power and consent to ketosis quicker. Additionally, you could get the quick fats devouring up you need to detect forsaken when more prominent! Considering the entire parcel, may you are saying you're establishment to guarantee a SPECIAL OFFER of those amazing ketogenic weight decay pills? Snap any photograph or seize in this site page to test whether you could guarantee this pick affiliation sooner than it is past in which it's far possible to start up your fats lighting installations with the main pills! How To Order Keto Strong Shark TankPills? On the off hazard which you are really considering in which to purchase Keto Strong Shark TankDiet Pills, you could tune down this ingesting fats eliminator at the strain factor site! Else, you could flip up the sparkle and discover awe inspiring methodologies for the zenith advancing ketogenic weight decay situation through clicking any photograph or seize in this website page. In the event which you hustle, you could even guarantee a SPECIAL OFFER of the apex advancing ketogenic weight decrease situation. With this incomprehensible way of activity, you could guarantee you have turned into your speediest fats ingesting past to zeroing in on it completely. Regardless, the more prominent you stick tight, the almost sure that this brilliant give probably will need to byskip, or materials might need to advance out. Thus, withinside the event which you are establishment to guarantee a SPECIAL OFFER of the main formula, click on any photograph or seize in this page even as materials https://buddysupplement.com/keto-strong/ https://fundly.com/keto-strong-shark-tank-reviews https://www.prlog.org/12888872-keto-strong-pills-new-weight-loss-program-shark-tank-scam.html https://www.stageit.com/ketostrongshark https://r2.community.samsung.com/t5/other/Keto-Strong-Shark-Tank-50-Off-Special-Offer/m-p/9946798#M7029 https://ketostrongsharktank.footeo.com/news/2021/10/29/keto-strong-shark-tank-pills-is-it-work


2021 ◽  
pp. 1-11
Author(s):  
Dugang Guo ◽  
Jun Liu ◽  
Xuewei Wang

Plant disease is one of the major threats to food security. Accurate diagnosis of plant diseases can benefit the agricultural production. For the purpose of real-time plant disease diagnostics, the deep learning models are employed. In this study, we present an accurate identification method for common diseases of tomatoes based on deep-learning methods. The devising of multi-resolution detector, in line with bounding box generating and assigning, facilitates the feature extracting process of detection. The employment of an dropout and ADAMW (Adaptive moment estimation with decoupled weight decay) optimizer further resolve the overfitting problem. Using the collected images of healthy and diseased tomatoes, our detector is trained to identify 10 different diseases. Experimental results showed that the disease identification method proposed in this study could accurately and rapidly identify common diseases of tomato with an average accuracy of 85.03%and a recognition speed of 61 frames per second, which was superior to other models under the same conditions and was beneficial for tomato disease control work.


2021 ◽  
Vol 19 (2) ◽  
pp. 9-15
Author(s):  
Arjun Singh Saud ◽  
Subarna Shakya

Stock price forecasting in the field of interest for many stock investors to earn more profit from stock trading. Nowadays, machine learning researchers are also involved in this research field so that fast, accurate and automatic stock price forecasting can be achieved. This research paper evaluated GRU network’s performance with weight decay reg-ularization techniques for predicting price of stocks listed NEPSE. Three weight decay regularization technique analyzed in this research work were (1) L1 regularization (2) L2 regularization and (3) L1_L2 regularization. In this research work, six randomly selected stocks from NEPSE were experimented. From the experimental results, we observed that L2 regularization could outperform L1 and L1_L2 reg-ularization techniques for all six stocks. The average MSE obtained with L2 regularization was 4.12% to 33.52% lower than the average MSE obtained with L1 regularization, and it was 10.92% to 37.1% lower than the average MSE obtained with L1_L2 regularization. Thus, we concluded that the L2 regularization is best choice among weight regularization for stock price prediction.


2021 ◽  
pp. 1-34
Author(s):  
Runhao Jiang ◽  
Jie Zhang ◽  
Rui Yan ◽  
Huajin Tang

Learning new concepts rapidly from a few examples is an open issue in spike-based machine learning. This few-shot learning imposes substantial challenges to the current learning methodologies of spiking neuron networks (SNNs) due to the lack of task-related priori knowledge. The recent learning-to-learn (L2L) approach allows SNNs to acquire priori knowledge through example-level learning and task-level optimization. However, an existing L2L-based framework does not target the neural dynamics (i.e., neuronal and synaptic parameter changes) on different timescales. This diversity of temporal dynamics is an important attribute in spike-based learning, which facilitates the networks to rapidly acquire knowledge from very few examples and gradually integrate this knowledge. In this work, we consider the neural dynamics on various timescales and provide a multi-timescale optimization (MTSO) framework for SNNs. This framework introduces an adaptive-gated LSTM to accommodate two different timescales of neural dynamics: short-term learning and long-term evolution. Short-term learning is a fast knowledge acquisition process achieved by a novel surrogate gradient online learning (SGOL) algorithm, where the LSTM guides gradient updating of SNN on a short timescale through an adaptive learning rate and weight decay gating. The long-term evolution aims to slowly integrate acquired knowledge and form, which can be achieved by optimizing the LSTM guidance process to tune SNN parameters on a long timescale. Experimental results demonstrate that the collaborative optimization of multi-timescale neural dynamics can make SNNs achieve promising performance for the few-shot learning tasks.


2021 ◽  
Vol 38 (3) ◽  
pp. 903-909
Author(s):  
Veeranjaneyulu Naralasetti ◽  
Reshmi Khadherbhi Shaik ◽  
Gayatri Katepalli ◽  
Jyostna Devi Bodapati

Diagnosis based on chest X-rays is widely used and approved for the diagnosis of various diseases such as Pneumonia. Manually screening of theses X-ray images technician or radiologist involves expertise and time consuming. Addressing this, we propose an automated approach for the diagnosis of pneumonia by assisting doctors in spotting infected areas in the X-ray images. We propose a deep Convolutional Neural Network (CNN) model for efficiently detecting the presence of pneumonia in the X-ray images. The proposed CNN is designed with 5 convolution blocks followed by 4 fully connected layers. In order to boost the performance of the model, we incorporate batch normalization, dynamic dropout, learning rate decay, L2 regularization weight decay along with Adam optimizer and binary Cross-Entropy loss function while training the model using back propagating algorithm. The proposed model is validated on two publicly accessible benchmark datasets, and the experimental studies conducted on these datasets indicate that the proposed model is efficient. The suggested CNN architecture with specified hyper parameters allows the model to outperform several existing models by achieving accuracy of 97.73% and 91.17% respectively for binary and multi-class classification tasks of pneumonia disease.


Sign in / Sign up

Export Citation Format

Share Document