scholarly journals Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting

Author(s):  
Sayna Ebrahimi ◽  
Suzanne Petryk ◽  
Akash Gokul ◽  
William Gan ◽  
Joseph Gonzalez ◽  
...  

The goal of continual learning (CL) is to learn a sequence of tasks without suffering from the phenomenon of catastrophic forgetting. Previous work has shown that leveraging memory in the form of a replay buffer can reduce performance degradation on prior tasks. We hypothesize that forgetting can be further reduced when the model is encouraged to remember the evidence for previously made decisions. As a first step towards exploring this hypothesis, we propose a simple novel training paradigm, called Remembering for the Right Reasons (RRR), that additionally stores visual model explanations for each example in the buffer and ensures the model has “the right reasons” for its predictions by encouraging its explanations to remain consistent with those used to make decisions at training time. Without this constraint, there is a drift in explanations and increase in forgetting as conventional continual learning algorithms learn new tasks. We demonstrate how RRR can be easily added to any memory or regularization-based approach and results in reduced forgetting, and more importantly, improved model explanations. We have evaluated our approach in the standard and few-shot settings and observed a consistent improvement across various CL approaches using different architectures and techniques to generate model explanations and demonstrated our approach showing a promising connection between explainability and continual learning. Our code is available at \url{https://github.com/SaynaEbrahimi/Remembering-for-the-Right-Reasons}

2020 ◽  
Author(s):  
Abeer Saleh ◽  
Talal Hamoud

Abstract Person recognition based on gait model and motion print is indeed a challenging and novel task due to its usages and to the critical issues of human pose variation, human body occlusion, camera view variation, etc. In this project, a deep convolution neural network (CNN) was modified and adapted for person recognition with image augmentation technique. CNN is best algorithm of deep learning algorithms. Adaptation aims to get best values for CNN parameters to get best CNN model. In Addition to the CNN parameters, the design of CNN model itself was adapted to get best model design; number of layers and normalization between them. After choosing best parameters and best design, Image augmentation was used to increase train dataset with many copies of the image to boost the number of different images that will be used to train Deep learning algorithms. The tests were achieved using known dataset (Market dataset). The dataset contains sequential pictures of people in different gait status. The image in CNN model as matrix is extracted to many images or matrices, so dataset size may be bigger by hundred times to make the problem a big data problem, in this project Results show that adaptation has improved the accuracy of person recognition using gait model, that is represented in many successive images for the same person. In addition, dataset contains images of person carrying things. The improved model of CNN is robust to image dimensions (quality and resolution) and to carried things by persons.


2008 ◽  
Vol 55 ◽  
pp. 183-202 ◽  
Author(s):  
John W. Nielsen-Gammon ◽  
David A. Gold

Abstract Advances in computer power, new forecasting challenges, and new diagnostic techniques have brought about changes in the way atmospheric development and vertical motion are diagnosed in an operational setting. Many of these changes, such as improved model skill, model resolution, and ensemble forecasting, have arguably been detrimental to the ability of forecasters to understand and respond to the evolving atmosphere. The use of nondivergent wind in place of geostrophic wind would be a step in the right direction, but the advantages of potential vorticity suggest that its widespread adoption as a diagnostic tool on the west side of the Atlantic is overdue. Ertel potential vorticity (PV), when scaled to be compatible with pseudopotential vorticity, is generally similar to pseudopotential vorticity, so forecasters accustomed to quasigeostrophic reasoning through the height tendency equation can transfer some of their intuition into the Ertel-PV framework. Indeed, many of the differences between pseudopotential vorticity and Ertel potential vorticity are consequences of the choice of definition of quasigeostrophic PV and are not fundamental to the quasigeostrophic system. Thus, at its core, PV thinking is consistent with commonly used quasigeostrophic diagnostic techniques.


2022 ◽  
pp. medethics-2021-107678
Author(s):  
Conor Toale ◽  
Marie Morris ◽  
Dara O Kavanagh

A deontological approach to surgical ethics advocates that patients have the right to receive the best care that can be provided. The ‘learning curve’ in surgical skill is an observable and measurable phenomenon. Surgical training may therefore carry risk to patients. This can occur directly, through inadvertent harm, or indirectly through theatre inefficiency and associated costs. Trainee surgeon operating, however, is necessary from a utilitarian perspective, with potential risk balanced by the greater societal need to train future independent surgeons.New technology means that the surgical learning curve could take place, at least in part, outside of the operating theatre. Simulation-based deliberate practice could be used to obtain a predetermined level of proficiency in a safe environment, followed by simulation-based assessment of operative competence. Such an approach would require an overhaul of the current training paradigm and significant investment in simulator technology. This may increasingly be viewed as necessary in light of well-discussed pressures on surgical trainees and trainers.This article discusses the obligations to trainees, trainers and training bodies raised by simulation technology, and outlines the current arguments both against and in favour of a simulation-based training-to-proficiency model in surgery. The significant changes to the current training paradigm that would be required to implement such a model are also discussed.


2018 ◽  
Vol 7 (1) ◽  
pp. 352-360
Author(s):  
Andressa Paganini ◽  
Elaine Caroline Boscatto ◽  
Adriano Slongo

O bolão 16 é uma modalidade esportiva onde o objetivo é derrubar nove pinos posicionados ao final de uma pista de madeira com uma bola maciça. Exige técnica, inteligência e concentração. Tendo em vista a escassez de estudos na literatura e a necessidade de aprofundar conhecimentos neste esporte específico, foi realizada a presente pesquisa para investigar a incidência de lesões e fatores relacionados às mesmas em atletas do sexo feminino, praticantes da modalidade na cidade de Caçador/SC. Trata-se de estudo transversal, descritivo, com análise quanti-qualitativa dos dados. Foi aplicado um questionário de formulário próprio para investigar informações pessoais, índice de massa corporal, tempo de jogo, dores e lesões articulares, hábitos relacionados à prática da modalidade e de outras atividades físicas. Participaram do estudo 11 atletas, com idades entre 18 a 56 anos e média de IMC 27,25 kg/m2. O tempo de prática variou de 10 meses a 27 anos. A maioria das participantes utiliza o braço direito para o lançamento e a perna esquerda para o apoio ao final do arremesso, demonstrando um movimento harmônico. No entanto, possuem alto índice de lesões/dores articulares, tendo a articulação mais acometida por dores a do joelho, seguida de quadril. Sobre o hábito de aquecimento e alongamento, todas as atletas afirmaram que o realizam; contudo foi verificado que a metade das atletas tem como prática de atividade física exclusiva o bolão, sendo necessário o incentivo a práticas preventivas voltadas ao esporte, além de orientação para a técnica adequada evitando desgaste físico entre outras consequências.Palavras-chave: Esportes. Traumatismos em atletas. Articulações. ABSTRACT: The ninepin it’s a sports mode where play mens and womens, with the objetive is drop the nine pin in the end of wooden track, with the solid ball. It requires technique, intelligence and concentration. Given the scarcity of studies in the literature and the need to deepen knowledge in this specific sport, the present research was carried out to investigate the incidence of injuries and related factors in female athletes, practitioners of the sport in the city of Caçador in Santa Catarina state. This is a cross-sectional, descriptive study with quantitative qualitative data analysis. A questionnaire was used to investigate personal information, body mass index, playing time, pain and joint injuries, habits related to the practice of the sport and other physical activities. Eleven athletes, aged 18 to 56 years, with a mean BMI of 60,075 pounds ft.sq, participated of this study. Regarding the training time, it ranged from 10 months to 27 years. Most athletes use the right arm for the throw and the left leg for support at the end of the pitch, demonstrating a harmonious movement. However, they have a high index of lesions/joint pain, and the joint is more affected by pain in the knee, followed by hip pain. On the warm-up and stretching habit, all athletes stated that they perform it. However, it was verified that half of the athletes have a practice of exclusive physical activity the ninepin, being necessary the incentive to preventive practices oriented to the sport, besides orientation for the appropriate technique avoiding physical wear among other consequences.Keywords: Sports. Athletic injuries. Joint.


2020 ◽  
Author(s):  
Muhammad Haseeb Arshad ◽  
M. A. Abido

This paper serves as an overview for sequential learning algorithms for single hidden layer neural nets. Cite as: M. H. Arshad, M. A. Abido. An Overview of Sequential Learning Algorithms for Single Hidden Layer Networks: Current Issues & Future Trends. Abstract: In this paper, a brief survey of the commonly used sequential-learning algorithms used with single hidden layer feed-forward neural networks is presented. A glimpse at the different kinds that are available in the literature up until now, how they have developed throughout the years, and their relative execution is summarized. Most important things to take note of during the designing phase of neural networks are its complexity, computational efficiency, maximum training time, and ability to generalize the under-study problem. The comparison of different sequential learning algorithms in regard to these merits for single hidden layer neural networks is drawn.


Author(s):  
Prince Nathan S

Abstract: Travelling Salesmen problem is a very popular problem in the world of computer programming. It deals with the optimization of algorithms and an ever changing scenario as it gets more and more complex as the number of variables goes on increasing. The solutions which exist for this problem are optimal for a small and definite number of cases. One cannot take into consideration of the various factors which are included when this specific problem is tried to be solved for the real world where things change continuously. There is a need to adapt to these changes and find optimized solutions as the application goes on. The ability to adapt to any kind of data, whether static or ever-changing, understand and solve it is a quality that is shown by Machine Learning algorithms. As advances in Machine Learning take place, there has been quite a good amount of research for how to solve NP-hard problems using Machine Learning. This reportis a survey to understand what types of machine algorithms can be used to solve with TSP. Different types of approaches like Ant Colony Optimization and Q-learning are explored and compared. Ant Colony Optimization uses the concept of ants following pheromone levels which lets them know where the most amount of food is. This is widely used for TSP problems where the path is with the most pheromone is chosen. Q-Learning is supposed to use the concept of awarding an agent when taking the right action for a state it is in and compounding those specific rewards. This is very much based on the exploiting concept where the agent keeps on learning onits own to maximize its own reward. This can be used for TSP where an agentwill be rewarded for having a short path and will be rewarded more if the path chosen is the shortest. Keywords: LINEAR REGRESSION, LASSO REGRESSION, RIDGE REGRESSION, DECISION TREE REGRESSOR, MACHINE LEARNING, HYPERPARAMETER TUNING, DATA ANALYSIS


Sensors ◽  
2019 ◽  
Vol 19 (3) ◽  
pp. 718 ◽  
Author(s):  
Baohua Qiang ◽  
Shihao Zhang ◽  
Yongsong Zhan ◽  
Wu Xie ◽  
Tian Zhao

In recent years, increasing human data comes from image sensors. In this paper, a novel approach combining convolutional pose machines (CPMs) with GoogLeNet is proposed for human pose estimation using image sensor data. The first stage of the CPMs directly generates a response map of each human skeleton’s key points from images, in which we introduce some layers from the GoogLeNet. On the one hand, the improved model uses deeper network layers and more complex network structures to enhance the ability of low level feature extraction. On the other hand, the improved model applies a fine-tuning strategy, which benefits the estimation accuracy. Moreover, we introduce the inception structure to greatly reduce parameters of the model, which reduces the convergence time significantly. Extensive experiments on several datasets show that the improved model outperforms most mainstream models in accuracy and training time. The prediction efficiency of the improved model is improved by 1.023 times compared with the CPMs. At the same time, the training time of the improved model is reduced 3.414 times. This paper presents a new idea for future research.


2020 ◽  
Vol 31 (3) ◽  
pp. 750-760 ◽  
Author(s):  
Luca Montana ◽  
François Rousseu ◽  
Dany Garant ◽  
Marco Festa-Bianchet

Abstract In polygynous species, male reproductive success is predicted to be monopolized by a few dominant males. This prediction is often not supported, suggesting that ecological and alternative mating tactics influence siring success. The spatiotemporal distribution of individuals and the number of males competing for each receptive female are often overlooked because they are difficult to monitor in wild animals. We examined how spatial overlap of female–male pairs, the time spent by a male on the breeding site, number of competitors, and morphological traits influence siring probability in eastern gray kangaroos (Macropus giganteus). We compared home range overlap for 12 208 dam–male pairs and 295 known dam–sire pairs to define local competitive groups and to estimate every male’s opportunity to sire the young of each female. We compared models considering morphological traits relative to the entire population or to local competitive groups. Including local competition improved model performance because it estimated the intensity of competition and compared each male’s morphological traits to those of its competitive group. Regardless of size, males can increase their probability to sire a young by increasing their mating opportunity relative to the mother. We underline the importance of considering spatial structure to assess the intensity of competition in species where males cannot equally access all females in a population. The estimation of mating opportunity and intensity of local competition improves our understanding of how morphological traits affect siring success when each mating event involves a different set of competing males, a characteristic of most wild species.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Mona Bokharaei Nia ◽  
Mohammadali Afshar Kazemi ◽  
Changiz Valmohammadi ◽  
Ghanbar Abbaspour

PurposeThe increase in the number of healthcare wearable (Internet of Things) IoT options is making it difficult for individuals, healthcare experts and physicians to find the right smart device that best matches their requirements or treatments. The purpose of this research is to propose a framework for a recommender system to advise on the best device for the patient using machine learning algorithms and social media sentiment analysis. This approach will provide great value for patients, doctors, medical centers, and hospitals to enable them to provide the best advice and guidance in allocating the device for that particular time in the treatment process.Design/methodology/approachThis data-driven approach comprises multiple stages that lead to classifying the diseases that a patient is currently facing or is at risk of facing by using and comparing the results of various machine learning algorithms. Hereupon, the proposed recommender framework aggregates the specifications of wearable IoT devices along with the image of the wearable product, which is the extracted user perception shared on social media after applying sentiment analysis. Lastly, a proposed computation with the use of a genetic algorithm was used to compute all the collected data and to recommend the wearable IoT device recommendation for a patient.FindingsThe proposed conceptual framework illustrates how health record data, diseases, wearable devices, social media sentiment analysis and machine learning algorithms are interrelated to recommend the relevant wearable IoT devices for each patient. With the consultation of 15 physicians, each a specialist in their area, the proof-of-concept implementation result shows an accuracy rate of up to 95% using 17 settings of machine learning algorithms over multiple disease-detection stages. Social media sentiment analysis was computed at 76% accuracy. To reach the final optimized result for each patient, the proposed formula using a Genetic Algorithm has been tested and its results presented.Research limitations/implicationsThe research data were limited to recommendations for the best wearable devices for five types of patient diseases. The authors could not compare the results of this research with other studies because of the novelty of the proposed framework and, as such, the lack of available relevant research.Practical implicationsThe emerging trend of wearable IoT devices is having a significant impact on the lifestyle of people. The interest in healthcare and well-being is a major driver of this growth. This framework can help in accelerating the transformation of smart hospitals and can assist doctors in finding and suggesting the right wearable IoT for their patients smartly and efficiently during treatment for various diseases. Furthermore, wearable device manufacturers can also use the outcome of the proposed platform to develop personalized wearable devices for patients in the future.Originality/valueIn this study, by considering patient health, disease-detection algorithm, wearable and IoT social media sentiment analysis, and healthcare wearable device dataset, we were able to propose and test a framework for the intelligent recommendation of wearable and IoT devices helping healthcare professionals and patients find wearable devices with a better understanding of their demands and experiences.


2020 ◽  
Vol 12 (18) ◽  
pp. 7642 ◽  
Author(s):  
Michael J. Ryoba ◽  
Shaojian Qu ◽  
Ying Ji ◽  
Deqiang Qu

Only a small percentage of crowdfunding projects succeed in securing funds, the fact of which puts the sustainability of crowdfunding platforms at risk. Researchers have examined the influences of phased aspects of communication, drawn from updates and comments, on success of crowdfunding campaigns, but in most cases they have focused on the combined effects of the aspects. This paper investigated campaign success contribution of various combinations of phased communication aspects from updates and comments, the best of which can help creators to successfully manage campaigns by focusing on the important communication aspects. Metaheuristic and machine learning algorithms were used to search and evaluate the best combination of phased communication aspects for predicting success using Kickstarter dataset. The study found that the number of updates in phase one, the polarity of comments in phase two, readability of updates and polarity of comments in phase three, and the polarity of comments in phase five are the most important communication aspects in predicting campaign success. Moreover, the success prediction accuracy with the aspects identified after phasing is more than the baseline model without phasing. Our findings can help crowdfunding actors to focus on the important communication aspects leading to improved likelihood of success.


Sign in / Sign up

Export Citation Format

Share Document