label data
Recently Published Documents

TOTAL DOCUMENTS

167

(FIVE YEARS 76)

H-INDEX

(FIVE YEARS 3)

Latest Documents Most Cited Documents Contributed Authors Related Sources Related Keywords

Addressing class imbalance in multi-label data

10.12681/eadd/50783 ◽

2021 ◽

Author(s):

Μπίν Λίου

Keyword(s):

Class Imbalance ◽

Label Data

Στα δεδομένα πολλαπλών ετικετών, κάθε παράδειγμα είναι συσχετισμένο με πολλαπλές δυαδικές μεταβλητές εξόδους (ετικέτες), το οποίο επιτρέπει αναπαραστάσεις με πλουσιότερη σημασιολογία σε σύγκριση με τα δυαδικά ή πολλαπλών κλάσεων δεδομένα. Κατά τη διάρκεια των τελευταίων δύο δεκαετιών, η μάθηση πολλαπλών ετικετών έχει σταδιακά προσελκύσει την προσοχή της ερευνητικής κοινότητας, και έχει ευρέως εφαρμοστεί σε ποικίλα προβλήματα, συμπεριλαμβανομένης της κατηγοριοποίησης κειμένων, της επισημείωσης πολυμέσων και της βιοπληροφορικής. Η ανισορροπία κλάσεων είναι ένα εγγενές χαρακτηριστικό των δεδομένων πολλαπλών ετικε\-τών, το οποίο εισάγει μια κρίσιμη πρόκληση για τις μεθόδους μάθησης πολλαπλών ετικετών. Στα δεδομένα πολλαπλών ετικετών, κάθε παράδειγμα συνήθως συσχετίζεται με μερικές ετικέτες που επιλέγονται από ένα σύνολο ετικετών μεγάλης κλίμακας, καθιστώντας των χώρο των ετικετών εξόδου αραιό και πολλές φορές μεροληπτικό. Αν και έχουν προταθεί πολλές μέθοδοι για την αντιμετώπιση της ανισορροπίας των δεδομένων πολλαπλών ετικετών, υπάρχει ακόμα περιθώριο περαιτέρω βελτίωσης της ακρίβειας πρόβλεψης, ειδικά για ετικέτες με μεγάλη ανισορροπία και για δύσκολα παραδείγματα. Επιπρόσθετα, μια σημαντική και ουσιαστική εφαρμογή μάθησης πολλαπλών ετικετών στον τομέα της βιοπληροφορικής είναι η πρόβλεψη της αλληλεπίδρασης φαρμάκου-στόχου (DTI) μέσω υπολογιστικών μεθόδων, η οποία επιταχύνει την κουραστική επικύρωση πειράματος in-vitro και παίζει κύριο ρόλο στη διαδικασία ανακάλυψης φαρμάκων. Ωστόσο, εκτός από την ανισορροπία κλάσεων, η πρόβλεψη DTI αντιμετωπίζει και άλλα προβλήματα, όπως η διαθεσιμότητα διμερών πλευρικών πληροφοριών, η ύπαρξη ελλιπών αλληλεπιδράσεων, και η ανάγκη για μάθηση χωρίς παραδείγματα. Τα προβλήματα αυτά καθιστούν την πρόβλεψη DTI πιο δύσκολη από τη συμβατική μάθηση πολλαπλών ετικετών. Ο γενικός στόχος αυτής της διατριβής είναι να αναπτύξει νέες τεχνικές που χειρίζονται πιο αποτελεσματικά την ανισορροπία κλάσεων των δεδομένων πολλαπλών ετικετών. Οι συνεισφορές αυτής της διατριβής αφορούν την αντιμετώπιση προκλήσεων που προκαλούνται από την ανισορροπία των δεδομένων πολλαπλών ετικετών σε τέσσερις διαφορετικές διαστάσεις. 1) Προτείνεται μια σειρά συνόλων ταξινομητών αλυσίδας που συνδυάζει την υποδειγματοληψία με σκοπό τη μείωση της ανισορροπίας κάθε ετικέτας. Τα μοντέλα αυτά όχι μόνο μοντελοποιούν συσχετίσεις ετικετών υψηλής τάξης, αλλά προωθούν επίσης την εκμετάλλευση παραδειγμάτων πλειοψηφικών κλάσεων για την πρόβλεψη ετικετών με υψηλή ανισορροπία. 2) Παρουσιάζονται τεχνικές διπλής δειγματοληψίας για δεδομένα πολλαπλών ετικετών. Αυτές προσαρμόζουν τα παραδείγματα εκπαίδευσης με βάση την τοπική ανισορροπία των ετικετών, η οποία παρέχει περισσότερη πληροφορία από ότι η καθολική ανισορροπία των ετικετών ολόκληρου του συνόλου δεδομένων. 3) Προτείνεται ο συνδυασμός μοντέλων που βασίζονται στη γειτνίαση για την πρόβλεψη DTI, ο οποίος ανακτά ελλιπείς αλληλεπιδράσεις για να αυξήσει τα δεδομένα με λίγες αλληλεπιδράσεις, και αξιοποιεί ποικίλες στρατηγικές δειγματοληψίας για κάθε ξεχωριστό μοντέλο με σκοπό τη βελτίωση της σταθερότητας και ανθεκτικότητας της πρόβλεψης. 4) Αναπτύσσονται τρεις τεχνικές πρόβλεψης DTI βάσει παραγοντοποίησης πινάκων, οι οποίες βελτιστοποιούν άμεσα μετρικές που υπολογίζουν την περιοχή κάτω της καμπύλης που χρησιμοποιούνται ευρέως στο πλαίσιο ανισορροπίας κλάσεων. Επίσης, ενσωματώνουν μια διαδικασία συνδυασμού ομοιοτήτων με βάση την τοπική συνέπεια αλληλεπίδρασης για τον χειρισμό πολλαπλών ομοιοτήτων εισόδου.

Download Full-text

Interactive Part Segmentation Using Edge Images

Applied Sciences ◽

10.3390/app112110106 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10106

Author(s):

Ju-Young Oh ◽

Jung-Min Park

Keyword(s):

Deep Learning ◽

Object Segmentation ◽

User Interaction ◽

Target Object ◽

Color Images ◽

Training Data ◽

Label Data ◽

Part Segmentation ◽

Increasing Demand ◽

Segmentation Models

As more and more fields utilize deep learning, there is an increasing demand to make suitable training data for each field. The existing interactive object segmentation models can easily make the mask label data because these can accurately segment the area of the target object through user interaction. However, it is difficult to accurately segment the target part in the object using the existing models. We propose a method to increase the accuracy of part segmentation by using the proposed interactive object segmentation model trained only with edge images instead of color images. The results evaluated with the PASCAL VOC Part dataset show that the proposed method can accurately segment the target part compared to the existing interactive object segmentation model and the semantic part-segmentation model.

Download Full-text

Prediction of the chemical context for Buchwald-Hartwig coupling reactions

10.33774/chemrxiv-2021-87hqt ◽

2021 ◽

Author(s):

Samuel Genheden ◽

Agnes Mårdh ◽

Gustav Lahti ◽

Ola Engkvist ◽

Simon Olsson ◽

...

Keyword(s):

Machine Learning ◽

Historical Data ◽

Temporal Characteristic ◽

Coupling Reactions ◽

Learning Models ◽

Careful Planning ◽

Label Data ◽

Reaction Data ◽

The Individual ◽

Machine Learning Models

We present machine learning models for predicting the chemical context for Buchwald-Hartwig coupling reactions. Using reaction data from in-house electronic lab notebooks, we train two models: one based on single-label data and one based on multi-label data. Both models show excellent top-3 accuracy around 90%, which suggests strong predictivity. There seems to be an advantage of including multi-label data because the multi-label model shows higher accuracy and better sensitivity for the individual contexts than the single-label model. Although the models are performant, we also show that such models need to be re-trained periodically. There is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that these significant transitions in the context-use will likely affect any model predicting chemical contexts trained on historical data. Consequently, training such models warrants careful planning of what data is used for training and how often the model needs to be re-trained.

Download Full-text

Examining Swarm Intelligence-based Feature Selection for Multi-Label Classification

Journal of Soft Computing and Data Mining ◽

10.30880/jscdm.2021.02.02.006 ◽

2021 ◽

Vol 2 (2) ◽

Author(s):

Awder Mohammed Ahmed ◽

◽

Adnan Mohsin Abdulazeez ◽

Keyword(s):

Feature Selection ◽

Swarm Intelligence ◽

Real World ◽

Computational Cost ◽

Computing Methods ◽

Open Problems ◽

Label Data ◽

Filter Methods ◽

Wide Range ◽

Classification Tasks

Multi-label classification addresses the issues that more than one class label assigns to each instance. Many real-world multi-label classification tasks are high-dimensional due to digital technologies, leading to reduced performance of traditional multi-label classifiers. Feature selection is a common and successful approach to tackling this problem by retaining relevant features and eliminating redundant ones to reduce dimensionality. There is several feature selection that is successfully applied in multi-label learning. Most of those features are wrapper methods that employ a multi-label classifier in their processes. They run a classifier in each step, which requires a high computational cost, and thus they suffer from scalability issues. Filter methods are introduced to evaluate the feature subsets using information-theoretic mechanisms instead of running classifiers to deal with this issue. Most of the existing researches and review papers dealing with feature selection in single-label data. While, recently multi-label classification has a wide range of real-world applications such as image classification, emotion analysis, text mining, and bioinformatics. Moreover, researchers have recently focused on applying swarm intelligence methods in selecting prominent features of multi-label data. To the best of our knowledge, there is no review paper that reviews swarm intelligence-based methods for multi-label feature selection. Thus, in this paper, we provide a comprehensive review of different swarm intelligence and evolutionary computing methods of feature selection presented for multi-label classification tasks. To this end, in this review, we have investigated most of the well-known and state-of-the-art methods and categorize them based on different perspectives. We then provided the main characteristics of the existing multi-label feature selection techniques and compared them analytically. We also introduce benchmarks, evaluation measures, and standard datasets to facilitate research in this field. Moreover, we performed some experiments to compare existing works, and at the end of this survey, some challenges, issues, and open problems of this field are introduced to be considered by researchers in the future.

Download Full-text

Segmentation applying TAG type label data and Transformer

10.23919/iccas52745.2021.9650042 ◽

2021 ◽

Author(s):

Choi Keonghun ◽

Jong-Eun Ha

Keyword(s):

Label Data

Download Full-text

Kernelized Supervised Laplacian Eigenmap for Visualization and Classification of Multi-Label Data

Pattern Recognition ◽

10.1016/j.patcog.2021.108399 ◽

2021 ◽

pp. 108399

Author(s):

Mariko Tai ◽

Mineichi Kudo ◽

Akira Tanaka ◽

Hideyuki Imai ◽

Keigo Kimura

Keyword(s):

Label Data

Download Full-text

Improved QSAR model for cholestasis built with FDA drug label data

Toxicology Letters ◽

10.1016/s0378-4274(21)00631-7 ◽

2021 ◽

Vol 350 ◽

pp. S165

Author(s):

M. Girireddy ◽

R. Saiakhov ◽

S. Chakravarti

Keyword(s):

Qsar Model ◽

Drug Label ◽

Label Data

Download Full-text

MURAL LINEAR FEATURE ENHANCEMENT ALGORITHM COMBINING SEMI-SUPERVISED IMAGE SEGMENTATION AND DIMENSIONALITY REDUCTION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlvi-m-1-2021-423-2021 ◽

2021 ◽

Vol XLVI-M-1-2021 ◽

pp. 423-427

Author(s):

S. Lyu ◽

J. Mao ◽

M. Hou

Keyword(s):

Principal Component Analysis ◽

Image Segmentation ◽

Dimensionality Reduction ◽

Spatial Information ◽

Principal Component ◽

Hyperspectral Images ◽

Spectral Information ◽

Label Data ◽

Feature Enhancement ◽

Linear Feature

Abstract. Due to the influence of natural and human factors, the linear features in the murals are partially blurred, which brings great challenges to the digital preservation and virtual restoration of cultural heritage. Taking the advantages of non-invasive measurement as well as the rich image and spectral information of hyperspectral technology, we proposed a linear feature enhancement method by combining semi-supervised superpixel segmentation with block dimension reduction. The main research work includes: (1) The true color composite image was segmented to obtain the label data by using the local spatial information of the superpixel image and the global feature information extracted by fuzzy c-means (FCM) clustering.(2) According to the label data, the preprocessed hyperspectral data were divided into homogeneous regions, whose dimensionality was reduced by principal component analysis (PCA) and kernel principal component analysis (KPCA). (3) The principal component images with the largest gradient after dimensionality reduction were respectively selected and normalized. The optimal principal component images normalized by the block PCA and block KPCA dimensionality reduction algorithms are superimposed to produce the linear feature enhancement images of murals. The hyperspectral images of some murals in Qutan Temple, Qinghai Province, China were used to verify the method. The results show that the spatial information and the spectral information of different pattern areas in the hyperspectral image can be fully used by combining the superpixel FCM image segmentation algorithm with the dimensionality reduction algorithm. of. It can highlight the linear information in the hyperspectral images of fades murals.

Download Full-text

First-Order Efficient General-Purpose Clean-Label Data Poisoning

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications ◽

10.1109/infocom42981.2021.9488730 ◽

2021 ◽

Author(s):

Tianhang Zheng ◽

Baochun Li

Keyword(s):

General Purpose ◽

Label Data ◽

First Order

Download Full-text

PENDEKATAN BERBASIS KECERDASAN BUATAN DENGAN METODE NAÏVE BAYES UNTUK WEBSITE BAZNAS

Jurnal Teknologi Informasi Universitas Lambung Mangkurat (JTIULM) ◽

10.20527/jtiulm.v6i1.68 ◽

2021 ◽

Vol 6 (1) ◽

pp. 1-8

Author(s):

Endi Gunawan ◽

Johan Wahyudi ◽

Yuslena Sari

Keyword(s):

Naive Bayes ◽

Confusion Matrix ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Label Data

Badan amil zakat nasional (BAZNAS) adalah badan resmi nasional dalam menyalurkan zakat. Zakat adalah kewajiban umat Islam. Hal tersebut dijelaskan dalam kitab suci Alquran. Zakat perlu disalurkan dengan adil. Banyaknya data penerima zakat menyebabkan penyaluran sering terkendala. Dengan klasifikasi zakat antara fakir dan miskin akan menjadi solusi untuk kendala data yang banyak. Teknik kecerdasan buatan yang digunakan pada penelitian ini adalah Naïve Bayes Classifier. Ini adalah teknik klasifikasi berdasarkan Teorema Bayes dengan asumsi independensi antar prediktor. Secara sederhana, pengklasifikasian Naive Bayes mengasumsikan bahwa keberadaan fitur tertentu di kelas tidak terkait dengan keberadaan fitur lainnya. Label data penerima zakat ini ada dua, yaitu fakir dan miskin. Kecerdasan buatan sebagai metode dalam pembuatan prototype sistem pada website BAZNAS dalam penentuan penerima zakat. Pengukuran performa metode naïve bayes classifier menggunakan confusion matrix. Hasil dari metode naïve bayes classifier dalam mengklasifikasi penerima zakat cukup baik yaitu 96%.

Download Full-text

label dataRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Addressing class imbalance in multi-label data

Interactive Part Segmentation Using Edge Images

Prediction of the chemical context for Buchwald-Hartwig coupling reactions

Examining Swarm Intelligence-based Feature Selection for Multi-Label Classification

Segmentation applying TAG type label data and Transformer

Kernelized Supervised Laplacian Eigenmap for Visualization and Classification of Multi-Label Data

Improved QSAR model for cholestasis built with FDA drug label data

MURAL LINEAR FEATURE ENHANCEMENT ALGORITHM COMBINING SEMI-SUPERVISED IMAGE SEGMENTATION AND DIMENSIONALITY REDUCTION

First-Order Efficient General-Purpose Clean-Label Data Poisoning

PENDEKATAN BERBASIS KECERDASAN BUATAN DENGAN METODE NAÏVE BAYES UNTUK WEBSITE BAZNAS

label data
Recently Published Documents