The structure of intrinsic complexity of learning

AbstractLimiting identification of r.e. indexes for r.e. languages (from a presentation of elements of the language) and limiting identification of programs for computable functions (from a graph of the function) have served as models for investigating the boundaries of learnability. Recently, a new approach to the study of “intrinsic” complexity of identification in the limit has been proposed. This approach, instead of dealing with the resource requirements of the learning algorithm, uses the notion of reducibility from recursion theory to compare and to capture the intuitive difficulty of learning various classes of concepts. Freivalds, Kinber, and Smith have studied this approach for function identification and Jain and Sharma have studied it for language identification.The present paper explores the structure of these reducibilities in the context of language identification. It is shown that there is an infinite hierarchy of language classes that represent learning problems of increasing difficulty. It is also shown that the language classes in this hierarchy are incomparable, under the reductions introduced, to the collection of pattern languages.Richness of the structure of intrinsic complexity is demonstrated by proving that any finite, acyclic, directed graph can be embedded in the reducibility structure. However, it is also established that this structure is not dense. The question of embedding any infinite, acyclic, directed graph is open.

Download Full-text

A Weight Moving Average Based Alternate Decoupled Learning Algorithm for Long-Tailed Language Identification

10.21437/interspeech.2021-776 ◽

2021 ◽

Author(s):

Hui Wang ◽

Lin Liu ◽

Yan Song ◽

Lei Fang ◽

Ian McLoughlin ◽

...

Keyword(s):

Learning Algorithm ◽

Moving Average ◽

Language Identification

Download Full-text

Optimal prosodic feature extraction and classification in parametric excitation source information for Indian language identification using neural network based Q-learning algorithm

International Journal of Speech Technology ◽

10.1007/s10772-018-09582-6 ◽

2018 ◽

Vol 22 (1) ◽

pp. 67-77 ◽

Cited By ~ 2

Author(s):

Himanish Shekhar Das ◽

Pinki Roy

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Parametric Excitation ◽

Learning Algorithm ◽

Language Identification ◽

Excitation Source ◽

Source Information ◽

Indian Language ◽

Q Learning ◽

Prosodic Feature

Download Full-text

The path set polytope of an acyclic, directed graph with an application to machine sequencing

Networks ◽

10.1002/net.3230190510 ◽

1989 ◽

Vol 19 (5) ◽

pp. 607-614 ◽

Cited By ~ 3

Author(s):

John H. Vande Vate

Keyword(s):

Directed Graph ◽

Acyclic Directed Graph

Download Full-text

An Improved Robust Fuzzy Algorithm for Unsupervised Learning

Journal of Intelligent Systems ◽

10.1515/jisys-2018-0030 ◽

2018 ◽

Vol 29 (1) ◽

pp. 1028-1042 ◽

Cited By ~ 1

Author(s):

Amina Dik ◽

Khalid Jebari ◽

Aziz Ettouhami

Keyword(s):

Unsupervised Learning ◽

Learning Algorithm ◽

Processing Method ◽

Learning Problems ◽

Second Phase ◽

Benchmark Datasets ◽

Noise Clustering ◽

Cluster A ◽

Number Of Classes ◽

Fuzzy Learning

Abstract This paper presents a robust, dynamic, and unsupervised fuzzy learning algorithm (RDUFL) that aims to cluster a set of data samples with the ability to detect outliers and assign the numbers of clusters automatically. It consists of three main stages. The first (1) stage is a pre-processing method in which possible outliers are determined and quarantined using a concept of proximity degree. The second (2) stage is a learning method, which consists in auto-detecting the number of classes with their prototypes for a dynamic threshold. This threshold is automatically determined based on the similarity among the detected prototypes that are updated at the exploration of a new data. The last (3) stage treats quarantined samples detected from the first stage to determine whether they belong to some class defined in the second phase. The effectiveness of this method is assessed on eight real medical benchmark datasets in comparison to known unsupervised learning methods, namely, the fuzzy c-means (FCM), possibilistic c-means (PCM), and noise clustering (NC). The obtained accuracy of our scheme is very promising for unsupervised learning problems.

Download Full-text

Acyclic Directed Graph

Encyclopedia of GIS ◽

10.1007/978-3-319-17885-1_100035 ◽

2017 ◽

pp. 48-48

Keyword(s):

Directed Graph ◽

Acyclic Directed Graph

Download Full-text

Improving Locality-Aware Scheduling with Acyclic Directed Graph Partitioning

Parallel Processing and Applied Mathematics - Lecture Notes in Computer Science ◽

10.1007/978-3-030-43229-4_19 ◽

2020 ◽

pp. 211-223

Author(s):

M. Yusuf Özkaya ◽

Anne Benoit ◽

Ümit V. Çatalyürek

Keyword(s):

Directed Graph ◽

Graph Partitioning ◽

Acyclic Directed Graph

Download Full-text

Image De-Noising Using Deep Learning

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.641-642.1287 ◽

2014 ◽

Vol 641-642 ◽

pp. 1287-1290

Author(s):

Lan Zhang ◽

Yu Feng Nie ◽

Zhen Hai Wang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Input Data ◽

Deep Neural Network ◽

Learning Algorithm ◽

State Of The Art ◽

Large Data ◽

Image Database ◽

Learning Problems ◽

Deep Learning Algorithm

Deep neural network as a part of deep learning algorithm is a state-of-the-art approach to find higher level representations of input data which has been introduced to many practical and challenging learning problems successfully. The primary goal of deep learning is to use large data to help solving a given task on machine learning. We propose an methodology for image de-noising project defined by this model and conduct training a large image database to get the experimental output. The result shows the robustness and efficient our our algorithm.

Download Full-text

Building a Chinese AMR Bank with Concept and Relation Alignments

Linguistic Issues in Language Technology ◽

10.33011/lilt.v18i.1429 ◽

2019 ◽

Vol 18 (1) ◽

Author(s):

Bin Li ◽

Yuan Wen ◽

Li Song ◽

Weiguang Qu ◽

Nianwen Xue

Keyword(s):

Quantitative Analysis ◽

Directed Graph ◽

Annotation Tool ◽

Systematic Treatment ◽

Tree Graphs ◽

Discourse Relations ◽

Significant Change ◽

Meaning Representation ◽

Acyclic Directed Graph

Abstract Meaning Representation (AMR) is a meaning representation framework in which the meaning of a full sentence is represented as a single-rooted, acyclic, directed graph. In this article, we describe an on-going project to build a Chinese AMR (CAMR) corpus, which currently includes 10,149 sentences from the newsgroup and weblog portion of the Chinese TreeBank (CTB). We describe the annotation specifications for the CAMR corpus, which follow the annotation principles of English AMR but make adaptations where needed to accommodate the linguistic facts of Chinese. The CAMR specifications also include a systematic treatment of sentence-internal discourse relations. One significant change we have made to the AMR annotation methodology is the inclusion of the alignment between word tokens in the sentence and the concepts/relations in the CAMR annotation to make it easier for automatic parsers to model the correspondence between a sentence and its meaning representation. We develop an annotation tool for CAMR, and the inter-agreement as measured by the Smatch score between the two annotators is 0.83, indicating reliable annotation. We also present some quantitative analysis of the CAMR corpus. 46.71% of the AMRs of the sentences are non-tree graphs. Moreover, the AMR of 88.95% of the sentences has concepts inferred from the context of the sentence but do not correspond to a specific word.

Download Full-text

Online Multitask Relative Similarity Learning

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/253 ◽

2017 ◽

Cited By ~ 2

Author(s):

Shuji Hao ◽

Peilin Zhao ◽

Yong Liu ◽

Steven C. H. Hoi ◽

Chunyan Miao

Keyword(s):

Real World ◽

Learning Algorithm ◽

Learning Problems ◽

Similarity Function ◽

Learning Approaches ◽

Similarity Learning ◽

Real World Data ◽

Real World Datasets ◽

Online Learning Algorithm ◽

Relative Similarity

Relative similarity learning~(RSL) aims to learn similarity functions from data with relative constraints. Most previous algorithms developed for RSL are batch-based learning approaches which suffer from poor scalability when dealing with real-world data arriving sequentially. These methods are often designed to learn a single similarity function for a specific task. Therefore, they may be sub-optimal to solve multiple task learning problems. To overcome these limitations, we propose a scalable RSL framework named OMTRSL (Online Multi-Task Relative Similarity Learning). Specifically, we first develop a simple yet effective online learning algorithm for multi-task relative similarity learning. Then, we also propose an active learning algorithm to save the labeling cost. The proposed algorithms not only enjoy theoretical guarantee, but also show high efficacy and efficiency in extensive experiments on real-world datasets.

Download Full-text

Federated Ensemble Regression Using Classification

Discovery Science - Lecture Notes in Computer Science ◽

10.1007/978-3-030-61527-7_22 ◽

2020 ◽

pp. 325-339

Author(s):

Oghenejokpeme I. Orhobor ◽

Larisa N. Soldatova ◽

Ross D. King

Keyword(s):

Ensemble Learning ◽

Predictive Accuracy ◽

Learning Algorithm ◽

Learning Problems ◽

Multiple Models ◽

Base Case ◽

Ensemble Learning Algorithm ◽

Regression Problems ◽

Learning Set ◽

Improved Performance

Abstract Ensemble learning has been shown to significantly improve predictive accuracy in a variety of machine learning problems. For a given predictive task, the goal of ensemble learning is to improve predictive accuracy by combining the predictive power of multiple models. In this paper, we present an ensemble learning algorithm for regression problems which leverages the distribution of the samples in a learning set to achieve improved performance. We apply the proposed algorithm to a problem in precision medicine where the goal is to predict drug perturbation effects on genes in cancer cell lines. The proposed approach significantly outperforms the base case.

Download Full-text