Updates to LUCI: A New Fitting Paradigm Using Mixture Density Networks

Abstract LUCI is an general-purpose spectral line-fitting pipeline which natively integrates machine learning algorithms to initialize fit functions. LUCI currently uses point-estimates obtained from a convolutional neural network (CNN) to inform optimization algorithms; this methodology has shown great promise by reducing computation time and reducing the chance of falling into a local minimum using convex optimization methods. In this update to LUCI, we expand upon the CNN developed in Rhea et al. so that it outputs Gaussian posterior distributions of the fit parameters of interest (the velocity and broadening) rather than simple point-estimates. Moreover, these posteriors are then used to inform the priors in a Bayesian inference scheme, either emcee or dynesty. The code is publicly available at crhea93:LUCI (https://github.com/crhea93/LUCI).

Download Full-text

PyTorch Operations Based Approach for Computing Local Binary Patterns

U Porto Journal of Engineering ◽

10.24840/2183-6493_007.004_0005 ◽

2021 ◽

Vol 7 (4) ◽

pp. 61-69

Author(s):

Devrim Akgun

Keyword(s):

Machine Learning ◽

Computer Vision ◽

Feature Extraction ◽

Computation Time ◽

Local Binary Patterns ◽

General Purpose ◽

Machine Learning Algorithms ◽

Experimental Measurements ◽

Computational Resources ◽

Learning Frameworks

Advances in machine learning frameworks like PyTorch provides users with various machine learning algorithms together with general purpose operations. PyTorch framework provides Numpy like functions and makes it practical to use computational resources for accelerating computations. Also users may define their custom layers or operations for feature extraction algorithms based on the tensor operations. In this paper, Local Binary Patterns (LBP) which is one of the important feature extraction approaches in computer vision were realized using tensor operations of PyTorch framework. The algorithm was written both using Python code with standard libraries and tensor operations of PyTorch in Python. According to experimental measurements which were realized for various batches of images, the algorithm based on tensor operations considerably reduced the computation time and provides significant accelerations over Python implementation with standard libraries.

Download Full-text

Machine Learning for Design Optimization of Electromagnetic Devices: Recent Developments and Future Directions

Applied Sciences ◽

10.3390/app11041627 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1627

Author(s):

Yanbin Li ◽

Gang Lei ◽

Gerd Bramerdorfer ◽

Sheng Peng ◽

Xiaodong Sun ◽

...

Keyword(s):

Machine Learning ◽

Design Optimization ◽

Optimization Methods ◽

Machine Learning Algorithms ◽

Cloud Services ◽

Robust Design Optimization ◽

Support Vector ◽

Future Directions ◽

Electromagnetic Devices ◽

Recent Developments

This paper reviews the recent developments of design optimization methods for electromagnetic devices, with a focus on machine learning methods. First, the recent advances in multi-objective, multidisciplinary, multilevel, topology, fuzzy, and robust design optimization of electromagnetic devices are overviewed. Second, a review is presented to the performance prediction and design optimization of electromagnetic devices based on the machine learning algorithms, including artificial neural network, support vector machine, extreme learning machine, random forest, and deep learning. Last, to meet modern requirements of high manufacturing/production quality and lifetime reliability, several promising topics, including the application of cloud services and digital twin, are discussed as future directions for design optimization of electromagnetic devices.

Download Full-text

Bayesian Metric Multidimensional Scaling

Political Analysis ◽

10.1093/pan/mps039 ◽

2013 ◽

Vol 21 (1) ◽

pp. 125-140 ◽

Cited By ~ 14

Author(s):

Ryan Bakker ◽

Keith T. Poole

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Bayesian Methods ◽

Optimal Solution ◽

Full Rank ◽

Mcmc Methods ◽

Ratio Scale ◽

Posterior Distributions ◽

Point Estimates ◽

Inflection Points

In this article, we show how to apply Bayesian methods to noisy ratio scale distances for both the classical similarities problem as well as the unfolding problem. Bayesian methods produce essentially the same point estimates as the classical methods, but are superior in that they provide more accurate measures of uncertainty in the data. Identification is nontrivial for this class of problems because a configuration of points that reproduces the distances is identified only up to a choice of origin, angles of rotation, and sign flips on the dimensions. We prove that fixing the origin and rotation is sufficient to identify a configuration in the sense that the corresponding maxima/minima are inflection points with full-rank Hessians. However, an unavoidable result is multiple posterior distributions that are mirror images of one another. This poses a problem for Markov chain Monte Carlo (MCMC) methods. The approach we take is to find the optimal solution using standard optimizers. The configuration of points from the optimizers is then used to isolate a single Bayesian posterior that can then be easily analyzed with standard MCMC methods.

Download Full-text

Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00020 ◽

2018 ◽

Vol 6 ◽

pp. 269-285 ◽

Cited By ~ 3

Author(s):

Andrius Mudinas ◽

Dell Zhang ◽

Mark Levene

Keyword(s):

Supervised Learning ◽

Learning Algorithms ◽

General Purpose ◽

Machine Learning Algorithms ◽

Sentiment Classification ◽

Two Phase ◽

Transductive Learning ◽

Domain Specific ◽

Sentiment Lexicon ◽

Supervised Learning Algorithms

There is often the need to perform sentiment classification in a particular domain where no labeled document is available. Although we could make use of a general-purpose off-the-shelf sentiment classifier or a pre-built one for a different domain, the effectiveness would be inferior. In this paper, we explore the possibility of building domain-specific sentiment classifiers with unlabeled documents only. Our investigation indicates that in the word embeddings learned from the unlabeled corpus of a given domain, the distributed word representations (vectors) for opposite sentiments form distinct clusters, though those clusters are not transferable across domains. Exploiting such a clustering structure, we are able to utilize machine learning algorithms to induce a quality domain-specific sentiment lexicon from just a few typical sentiment words (“seeds”). An important finding is that simple linear model based supervised learning algorithms (such as linear SVM) can actually work better than more sophisticated semi-supervised/transductive learning algorithms which represent the state-of-the-art technique for sentiment lexicon induction. The induced lexicon could be applied directly in a lexicon-based method for sentiment classification, but a higher performance could be achieved through a two-phase bootstrapping method which uses the induced lexicon to assign positive/negative sentiment scores to unlabeled documents first, a nd t hen u ses those documents found to have clear sentiment signals as pseudo-labeled examples to train a document sentiment classifier v ia supervised learning algorithms (such as LSTM). On several benchmark datasets for document sentiment classification, our end-to-end pipelined approach which is overall unsupervised (except for a tiny set of seed words) outperforms existing unsupervised approaches and achieves an accuracy comparable to that of fully supervised approaches.

Download Full-text

Better Network Modeling For Link Prediction In Protein-Protein Interaction Networks

10.21203/rs.3.rs-939985/v1 ◽

2021 ◽

Author(s):

Ho Yin Yuen ◽

Jesper Jansson

Keyword(s):

Protein Interaction ◽

Link Prediction ◽

Network Modeling ◽

Computation Time ◽

General Purpose ◽

Computational Techniques ◽

Full Potential ◽

Protein Protein Interaction ◽

Protein Protein Interaction Networks ◽

High Throughput Experiments

Abstract Background: Protein-protein interaction (PPI) data is an important type of data used in functional genomics. However, inaccuracies in high-throughput experiments often result in incomplete PPI data. Computational techniques are thus used to infer missing data and to evaluate confidence scores, with link prediction being one such approach that uses the structure of the network of PPIs known so far to find good candidates for missing PPIs. Recently, a new idea called the L3 principle introduced biological motivation into PPI link predictions, yielding predictors that are superior to general-purpose link predictors for complex networks. However, the previously developed L3 principle-based link predictors are only an approximate implementation of the L3 principle. As such, not only is the full potential of the L3 principle not realized, they may even lead to candidate PPIs that otherwise fit the L3 principle being penalized. Result: In this article, we propose a formulation of link predictors without approximation that we call ExactL3 (L3E) by addressing missing elements within L3 predictors in the perspective of network modeling. Through statistical and biological metrics, we show that in general, L3E predictors perform better than the previously proposed methods on seven datasets across two organisms (human and yeast) using a reasonable amount of computation time. In addition to L3E being able to rank the PPIs more accurately, we also found that L3-based predictors, including L3E, predicted a different pool of real PPIs than the general-purpose link predictors. This suggests that different types of PPIs can be predicted based on different topological assumptions and that even better PPI link predictors may be obtained in the future by improved network modeling.

Download Full-text

A Quick Presentation of Evolutionary Computation

Soft Computing Applications for Database Technologies ◽

10.4018/978-1-60566-814-7.ch002 ◽

2010 ◽

pp. 22-38

Author(s):

Pierre Collet

Keyword(s):

Evolutionary Computation ◽

Random Search ◽

Optimization Technique ◽

Parallel Architecture ◽

Optimization Methods ◽

General Purpose ◽

Old Field ◽

Multiple Data ◽

Intractable Problems ◽

The 1960S

Evolutionary computation is an old field of computer science, that started in the 1960s nearly simultaneously in different parts of the world. It is an optimization technique that mimics the principles of Darwinian evolution in order to find good solutions to intractable problems faster than a random search. Artificial Evolution is only one among many stochastic optimization methods, but recently developed hardware (General Purpose Graphic Processing Units or GPGPU) gives it a tremendous edge over all the other algorithms, because its inherently parallel nature can directly benefit from the difficult to use Single Instruction Multiple Data parallel architecture of these cheap, yet very powerful cards.

Download Full-text

Occam’s Razor for Big Data? On Detecting Quality in Large Unstructured Datasets

Applied Sciences ◽

10.3390/app9153065 ◽

2019 ◽

Vol 9 (15) ◽

pp. 3065 ◽

Cited By ~ 2

Author(s):

Dresp-Langley ◽

Ekseth ◽

Fesl ◽

Gohshi ◽

Kurz ◽

...

Keyword(s):

Big Data ◽

Data Science ◽

Computation Time ◽

Video Data ◽

Machine Learning Algorithms ◽

Sensor Data ◽

Science Data ◽

Occam’S Razor ◽

Principle Of Parsimony ◽

Occam's Razor

Detecting quality in large unstructured datasets requires capacities far beyond the limits of human perception and communicability and, as a result, there is an emerging trend towards increasingly complex analytic solutions in data science to cope with this problem. This new trend towards analytic complexity represents a severe challenge for the principle of parsimony (Occam’s razor) in science. This review article combines insight from various domains such as physics, computational science, data engineering, and cognitive science to review the specific properties of big data. Problems for detecting data quality without losing the principle of parsimony are then highlighted on the basis of specific examples. Computational building block approaches for data clustering can help to deal with large unstructured datasets in minimized computation time, and meaning can be extracted rapidly from large sets of unstructured image or video data parsimoniously through relatively simple unsupervised machine learning algorithms. Why we still massively lack in expertise for exploiting big data wisely to extract relevant information for specific tasks, recognize patterns and generate new information, or simply store and further process large amounts of sensor data is then reviewed, and examples illustrating why we need subjective views and pragmatic methods to analyze big data contents are brought forward. The review concludes on how cultural differences between East and West are likely to affect the course of big data analytics, and the development of increasingly autonomous artificial intelligence (AI) aimed at coping with the big data deluge in the near future.

Download Full-text

Detecting Emotions in English and Arabic Tweets

Information ◽

10.3390/info10030098 ◽

2019 ◽

Vol 10 (3) ◽

pp. 98 ◽

Cited By ~ 4

Author(s):

Tariq Ahmad ◽

Allan Ramsay ◽

Hanady Ahmed

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Deep Neural Networks ◽

State Of The Art ◽

Learning Algorithms ◽

General Purpose ◽

Machine Learning Algorithms ◽

Current State ◽

Optimal Thresholds ◽

Alternative Approach

Assigning sentiment labels to documents is, at first sight, a standard multi-label classification task. Many approaches have been used for this task, but the current state-of-the-art solutions use deep neural networks (DNNs). As such, it seems likely that standard machine learning algorithms, such as these, will provide an effective approach. We describe an alternative approach, involving the use of probabilities to construct a weighted lexicon of sentiment terms, then modifying the lexicon and calculating optimal thresholds for each class. We show that this approach outperforms the use of DNNs and other standard algorithms. We believe that DNNs are not a universal panacea and that paying attention to the nature of the data that you are trying to learn from can be more important than trying out ever more powerful general purpose machine learning algorithms.

Download Full-text

Development of a Common Platform for Testing Metamodel Based Design Optimization Methods

Volume 3B: 39th Design Automation Conference ◽

10.1115/detc2013-12664 ◽

2013 ◽

Author(s):

Adel A. Younis ◽

George H. Cheng ◽

G. Gary Wang ◽

Zuomin Dong

Keyword(s):

Design Optimization ◽

Optimization Problems ◽

Optimization Algorithms ◽

Computation Time ◽

Optimization Methods ◽

Global Optimum ◽

Test Problems ◽

Test Bed ◽

Test Procedures ◽

Local Optima

Metamodel based design optimization (MBDO) algorithms have attracted considerable interests in recent years due to their special capability in dealing with complex optimization problems with computationally expensive objective and constraint functions and local optima. Conventional unimodal-based optimization algorithms and stochastic global optimization algorithms either miss the global optimum frequently or require unacceptable computation time. In this work, a generic testbed/platform for evaluating various MBDO algorithms has been introduced. The purpose of the platform is to facilitate quantitative comparison of different MBDO algorithms using standard test problems, test procedures, and test outputs, as well as to improve the efficiency of new algorithm testing and improvement. The platform consists of a comprehensive test function database that contains about 100 benchmark functions and engineering problems. The testbed accepts any optimization algorithm to be tested, and only requires minor modifications to meet the test-bed requirements. The testbed is useful in comparing the performance of competing algorithms through execution of same problems. It allows researchers and practitioners to test and choose the most suitable optimization tool for their specific needs. It also helps to increase confidence and reliability of the newly developed MBDO tools. Many new MBDO algorithms, including Mode Pursuing Sampling (MPS), Pareto Set Pursuing (PSP), and Space Exploration and Unimodal Region Elimination (SEUMRE), were tested in this work to demonstrate its functionality and benefits.

Download Full-text

New BSP/CGM algorithms for spanning trees

The International Journal of High Performance Computing Applications ◽

10.1177/1094342018803672 ◽

2018 ◽

Vol 33 (3) ◽

pp. 444-461

Author(s):

Jucele França de Alencar Vasconcellos ◽

Edson Norberto Cáceres ◽

Henrique Mongelli ◽

Siang Wun Song ◽

Frank Dehne ◽

...

Keyword(s):

Parallel Algorithms ◽

Parallel Machines ◽

Spanning Trees ◽

Graphics Processing Unit ◽

Computation Time ◽

General Purpose ◽

Coarse Grained ◽

Processing Unit ◽

List Ranking ◽

Bulk Synchronous Parallel

Computing a spanning tree (ST) and a minimum ST (MST) of a graph are fundamental problems in graph theory and arise as a subproblem in many applications. In this article, we propose parallel algorithms to these problems. One of the steps of previous parallel MST algorithms relies on the heavy use of parallel list ranking which, though efficient in theory, is very time-consuming in practice. Using a different approach with a graph decomposition, we devised new parallel algorithms that do not make use of the list ranking procedure. We proved that our algorithms are correct, and for a graph [Formula: see text], [Formula: see text], and [Formula: see text], the algorithms can be executed on a Bulk Synchronous Parallel/Coarse Grained Multicomputer (BSP/CGM) model using [Formula: see text] communications rounds with [Formula: see text] computation time for each round. To show that our algorithms have good performance on real parallel machines, we have implemented them on graphics processing unit. The obtained speedups are competitive and showed that the BSP/CGM model is suitable for designing general purpose parallel algorithms.

Download Full-text