PiLiMoT: A Modified Combination of LoLiMoT and PLN Learning Algorithms for Local Linear Neurofuzzy Modeling

Locally linear model tree (LoLiMoT) and piecewise linear network (PLN) learning algorithms are two approaches in local linear neurofuzzy modeling. While both methods belong to the class of growing tree learning algorithms, they use different logics. PLN learning relies on training data, it needs rich training data set and no division test, so it is much faster than LoLiMoT, but it may create adjacent neurons that may lead to singularity in regression matrix. On the other hand, LoLiMoT almost always leads to acceptable output error, but it often needs more rules. In this paper, to exploit the complimentary performance of both algorithms piecewise linear model tree (PiLiMoT) learning algorithm is introduced. In essence, PiLiMoT is a combination of LoLiMoT and PLN learning. The initially proposed algorithm is improved by adding the ability to merge previously divided local linear models, and utilizing a simulated annealing stochastic decision process to select a local model for splitting. Comparing to LoLiMoT and PLN learning, our proposed improved learning algorithm shows the ability to construct models with less number of rules at comparable modeling errors. Algorithms are compared through a case study of nonlinear function approximation. Obtained results demonstrate the advantages of combined modified method.

Download Full-text

Piecewise Linear Model Tree: A modified combination of two learning algorithms for neuro-fuzzy models

10.1109/cacsd-cca-isic.2006.4776974 ◽

2006 ◽

Author(s):

Atiye Sarabi Jamab ◽

Babak N. Araabi

Keyword(s):

Linear Model ◽

Piecewise Linear ◽

Learning Algorithms ◽

Fuzzy Models ◽

Piecewise Linear Model ◽

Model Tree ◽

Neuro Fuzzy

Download Full-text

Piecewise Linear Model Tree: A Modified Combination of two Learning Algorithms for Neuro-Fuzzy Models

2006 IEEE International Conference on Control Applications ◽

10.1109/cca.2006.286200 ◽

2006 ◽

Author(s):

Atiye Jamab ◽

Babak Araabi

Keyword(s):

Linear Model ◽

Piecewise Linear ◽

Learning Algorithms ◽

Fuzzy Models ◽

Piecewise Linear Model ◽

Model Tree ◽

Neuro Fuzzy

Download Full-text

Information-Theoretic Generalization Bounds for Meta-Learning and Applications

Entropy ◽

10.3390/e23010126 ◽

2021 ◽

Vol 23 (1) ◽

pp. 126

Author(s):

Sharu Theresa Jose ◽

Osvaldo Simeone

Keyword(s):

Learning Algorithm ◽

Broad Class ◽

Performance Measure ◽

Training Data ◽

Learning To Learn ◽

Data Set ◽

Information Theoretic ◽

Meta Learning ◽

Task Training ◽

Test Sets

Meta-learning, or “learning to learn”, refers to techniques that infer an inductive bias from data corresponding to multiple related tasks with the goal of improving the sample efficiency for new, previously unobserved, tasks. A key performance measure for meta-learning is the meta-generalization gap, that is, the difference between the average loss measured on the meta-training data and on a new, randomly selected task. This paper presents novel information-theoretic upper bounds on the meta-generalization gap. Two broad classes of meta-learning algorithms are considered that use either separate within-task training and test sets, like model agnostic meta-learning (MAML), or joint within-task training and test sets, like reptile. Extending the existing work for conventional learning, an upper bound on the meta-generalization gap is derived for the former class that depends on the mutual information (MI) between the output of the meta-learning algorithm and its input meta-training data. For the latter, the derived bound includes an additional MI between the output of the per-task learning procedure and corresponding data set to capture within-task uncertainty. Tighter bounds are then developed for the two classes via novel individual task MI (ITMI) bounds. Applications of the derived bounds are finally discussed, including a broad class of noisy iterative algorithms for meta-learning.

Download Full-text

Towards global empirical upscaling of FLUXNET eddy covariance observations: validation of a model tree ensemble approach using a biosphere model

Biogeosciences Discussions ◽

10.5194/bgd-6-5271-2009 ◽

2009 ◽

Vol 6 (3) ◽

pp. 5271-5304 ◽

Cited By ~ 22

Author(s):

M. Jung ◽

M. Reichstein ◽

A. Bondeau

Keyword(s):

Eddy Covariance ◽

Learning Algorithm ◽

Gross Primary Production ◽

Global Network ◽

Data Set ◽

Model Trees ◽

Ensemble Approach ◽

Model Tree ◽

Biosphere Model ◽

Variance Explained

Abstract. Global, spatially and temporally explicit estimates of carbon and water fluxes derived from empirical up-scaling eddy covariance measurements would constitute a new and possibly powerful data stream to study the variability of the global terrestrial carbon and water cycle. This paper introduces and validates a machine learning approach dedicated to the upscaling of observations from the current global network of eddy covariance towers (FLUXNET). We present a new model TRee Induction ALgorithm (TRIAL) that performs hierarchical stratification of the data set into units where particular multiple regressions for a target variable hold. We propose an ensemble approach (Evolving tRees with RandOm gRowth, ERROR) where the base learning algorithm is perturbed in order to gain a diverse sequence of different model trees which evolves over time. We evaluate the efficiency of the model tree ensemble approach using an artificial data set derived from the the Lund-Potsdam-Jena managed Land (LPJmL) biosphere model. We aim at reproducing global monthly gross primary production as simulated by LPJmL from 1998–2005 using only locations and months where high quality FLUXNET data exist for the training of the model trees. The model trees are trained with the LPJmL land cover and meteorological input data, climate data, and the fraction of absorbed photosynthetic active radiation simulated by LPJmL. Given that we know the "true result" in the form of global LPJmL simulations we can effectively study the performance of the model tree ensemble upscaling and associated problems of extrapolation capacity. We show that the model tree ensemble is able to explain 92% of the variability of the global LPJmL GPP simulations. The mean spatial pattern and the seasonal variability of GPP that constitute the largest sources of variance are very well reproduced (96% and 94% of variance explained respectively) while the monthly interannual anomalies which occupy much less variance are less well matched (41% of variance explained). We demonstrate the substantially improved accuracy of the model tree ensemble over individual model trees in particular for the monthly anomalies and for situations of extrapolation. We estimate that roughly one fifth of the domain is subject to extrapolation while the model tree ensemble is still able to reproduce 73% of the LPJmL GPP variability here. This paper presents for the first time a benchmark for a global FLUXNET upscaling approach that will be employed in future studies. Although the real world FLUXNET upscaling is more complicated than for a noise free and reduced complexity biosphere model as presented here, our results show that an empirical upscaling from the current FLUXNET network with a model tree ensemble is feasible and able to extract global patterns of carbon flux variability.

Download Full-text

Exploring a Local Linear Model Tree Approach to Car-Following

Computer-Aided Civil and Infrastructure Engineering ◽

10.1111/mice.12011 ◽

2013 ◽

Vol 28 (8) ◽

pp. 581-593 ◽

Cited By ~ 9

Author(s):

Kayvan Aghabayk ◽

Nafiseh Forouzideh ◽

William Young

Keyword(s):

Linear Model ◽

Car Following ◽

Model Tree ◽

Local Linear ◽

Tree Approach

Download Full-text

Assessment of Alternative Ways to Integrate Weather Predictions in Photovoltaic Generation Forecasting.

10.5194/egusphere-egu21-16091 ◽

2021 ◽

Author(s):

Kevin Bellinguer ◽

Robin Girard ◽

Guillaume Bontron ◽

Georges Kariniotakis

Keyword(s):

Linear Model ◽

Relevant Information ◽

Training Data ◽

Short Term ◽

Data Set ◽

Photovoltaic Generation ◽

Weather Information ◽

Weather Situation ◽

Conditioned Learning ◽

Pv Generation

<div> In recent years, the share of photovoltaic (PV) power in Europe has grown: the installed capacity increased from around 10 GW in 2008 to nearly 119 GW in 2018 [1]. Due to the intermittent nature of PV generation, new challenges arise regarding economic profitability and the safe operation of the power network. To overcome these issues, a special effort is made to develop efficient PV generation forecasting tools. &#160; For short-term PV production forecasting, past production observations are typically the main drivers. In addition, spatio-temporal (ST) inputs such as Satellite-Derived Surface Irradiance (SDSI) provide relevant information regarding the weather situation in the vicinity of the farm. Moreover, the literature shows us that Numerical Weather Predictions (NWPs) provide relevant information regarding weather trends. &#160; NWPs can be integrated in the forecasting process in two different ways. The most straightforward approach considers NWPs as explanatory input variables to the forecasting models. Thus, the atmosphere dynamics are directly carried by the NWPs. The alternative considers NWPs as state variables: weather information is used to filter the training data set to obtain a coherent subset of PV production observations measured under similar weather conditions as the PV production to be predicted. This approach is based on analog methods and makes the weather dynamics to be implicitly contained in the PV production observations. This conditioned learning approach permits to perform local regressions and is adaptive in the sense that the model training is conditioned to the weather situation. The specialized literature focuses on spot NWPs which permits to find situations that evolve in the same way but does not preserve ST patterns. In this context, the addition of SDSI features cannot make the most of the conditioning process. Ref. [3] proposes to use geopotential fields, which are wind drivers, as analog predictors. &#160; In this work, we propose the following contributions to the state of the art: We investigate the influence of spot NWPs on the performances of an auto-regressive (AR) and a random forest models according to the two above-mentioned approaches: either as additional explanatory features and/or as analog features. The analogy score proposed by [2] is used to find similar weather situations, then the model is trained over the associated PV production observations. The results highlight that the linear model performs better with the conditioned approach while the non-linear model obtains better performances when fed with explanatory features. Then, the similarity score is extended to gridded NWPs data through the use of a principal component analysis. This method allows to condition the learning to large-scale weather information. A comparison between spot and gridded NWPs conditioned approaches applied with AR model highlights that gridded NWPs improves the influence of SDSI over forecasting performances. &#160; The proposed approaches are evaluated using 9 PV plants in France and for a testing period of 12 months. &#160; References [1]&#160;&#160;&#160;&#160;&#160; IRENA - https://www.irena.org/Statistics/Download-Data [2]&#160;&#160;&#160;&#160;&#160; Alessandrini, Delle Monache, et al. An analog ensemble for short-term probabilistic solar power forecast. Applied Energy, 2015. https://doi.org/10.1016/j.apenergy.2015.08.011 [3]&#160;&#160;&#160;&#160;&#160; Bellinguer, Girard, Bontron, Kariniotakis. Short-term Forecasting of Photovoltaic Generation based on Conditioned Learning of Geopotential Fields. 2020, UPEC. https://doi.org/10.1109/UPEC49904.2020.9209858 </div>

Download Full-text

Water Quality Prediction Using Statistical Tool and Machine Learning Algorithm

Waste Management ◽

10.4018/978-1-7998-1210-4.ch029 ◽

2020 ◽

pp. 609-623

Author(s):

Arun Kumar Beerala ◽

Gobinath R. ◽

Shyamala G. ◽

Siribommala Manvitha

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Training Data ◽

Machine Learning Techniques ◽

Statistical Tool ◽

Data Set ◽

Water Quality Prediction ◽

Living Things ◽

Sampling Locations ◽

Different Seasons

Water is the most valuable natural resource for all living things and the ecosystem. The quality of groundwater is changed due to change in ecosystem, industrialisation, and urbanisation, etc. In the study, 60 samples were taken and analysed for various physio-chemical parameters. The sampling locations were located using global positioning system (GPS) and were taken for two consecutive years for two different seasons, monsoon (Nov-Dec) and post-monsoon (Jan-Mar). In 2016-2017 and 2017-2018 pH, EC, and TDS were obtained in the field. Hardness and Chloride are determined using titration method. Nitrate and Sulphate were determined using Spectrophotometer. Machine learning techniques were used to train the data set and to predict the unknown values. The dominant elements of groundwater are as follows: Ca2, Mg2 for cation and Cl-, SO42, NO3− for anions. The regression value for the training data set was found to be 0.90596, and for the entire network, it was found to be 0.81729. The best performance was observed as 0.0022605 at epoch 223.

Download Full-text

Lost in Space: Geolocation in Event Data

Political Science Research and Methods ◽

10.1017/psrm.2018.23 ◽

2018 ◽

Vol 7 (04) ◽

pp. 871-888 ◽

Cited By ~ 6

Author(s):

Sophie J. Lee ◽

Howard Liu ◽

Michael D. Ward

Keyword(s):

Learning Algorithm ◽

Text Processing ◽

Contextual Information ◽

Training Data ◽

Supervised Machine Learning ◽

Model Parameters ◽

Event Data ◽

Data Set ◽

N Gram ◽

Automated Text Processing

Improving geolocation accuracy in text data has long been a goal of automated text processing. We depart from the conventional method and introduce a two-stage supervised machine-learning algorithm that evaluates each location mention to be either correct or incorrect. We extract contextual information from texts, i.e., N-gram patterns for location words, mention frequency, and the context of sentences containing location words. We then estimate model parameters using a training data set and use this model to predict whether a location word in the test data set accurately represents the location of an event. We demonstrate these steps by constructing customized geolocation event data at the subnational level using news articles collected from around the world. The results show that the proposed algorithm outperforms existing geocoders even in a case added post hoc to test the generality of the developed algorithm.

Download Full-text

Identification of Leukemia Subtypes from Microscopic Images Using Convolutional Neural Network

Diagnostics ◽

10.3390/diagnostics9030104 ◽

2019 ◽

Vol 9 (3) ◽

pp. 104 ◽

Cited By ~ 11

Author(s):

Ahmed ◽

Yigit ◽

Isik ◽

Alpkocak

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Training Data ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Set ◽

Leukemia Data

Leukemia is a fatal cancer and has two main types: Acute and chronic. Each type has two more subtypes: Lymphoid and myeloid. Hence, in total, there are four subtypes of leukemia. This study proposes a new approach for diagnosis of all subtypes of leukemia from microscopic blood cell images using convolutional neural networks (CNN), which requires a large training data set. Therefore, we also investigated the effects of data augmentation for an increasing number of training samples synthetically. We used two publicly available leukemia data sources: ALL-IDB and ASH Image Bank. Next, we applied seven different image transformation techniques as data augmentation. We designed a CNN architecture capable of recognizing all subtypes of leukemia. Besides, we also explored other well-known machine learning algorithms such as naive Bayes, support vector machine, k-nearest neighbor, and decision tree. To evaluate our approach, we set up a set of experiments and used 5-fold cross-validation. The results we obtained from experiments showed that our CNN model performance has 88.25% and 81.74% accuracy, in leukemia versus healthy and multiclass classification of all subtypes, respectively. Finally, we also showed that the CNN model has a better performance than other wellknown machine learning algorithms.

Download Full-text