BAND: A Benchmark Dataset forBangla News Audio Classification

Introduction: The ability to discriminate among ligands binding to the same protein target in terms of their relative binding affinity lies at the heart of structure-based drug design. Any improvement in the accuracy and reliability of binding affinity prediction methods decreases the discrepancy between experimental and computational results. Objectives: The primary objectives were to find the most relevant features affecting binding affinity prediction, least use of manual feature engineering, and improving the reliability of binding affinity prediction using efficient deep learning models by tuning the model hyperparameters. Methods: The binding site of target proteins was represented as a grid box around their bound ligand. Both binary and distance-dependent occupancies were examined for how an atom affects its neighbor voxels in this grid. A combination of different features including ANOLEA, ligand elements, and Arpeggio atom types were used to represent the input. An efficient convolutional neural network (CNN) architecture, DeepAtom, was developed, trained and tested on the PDBbind v2016 dataset. Additionally an extended benchmark dataset was compiled to train and evaluate the models. Results: The best DeepAtom model showed an improved accuracy in the binding affinity prediction on PDBbind core subset (Pearson’s R=0.83) and is better than the recent state-of-the-art models in this field. In addition when the DeepAtom model was trained on our proposed benchmark dataset, it yields higher correlation compared to the baseline which confirms the value of our model. Conclusions: The promising results for the predicted binding affinities is expected to pave the way for embedding deep learning models in virtual screening and rational drug design fields.

Download Full-text

Audio Classification Using Class-Specific Learned Descriptors

10.21437/interspeech.2017-982 ◽

2017 ◽

Author(s):

Sukanya Sonowal ◽

Tushar Sandhan ◽

Inkyu Choi ◽

Nam Soo Kim

Keyword(s):

Audio Classification

Download Full-text

Visual Speech In Real Noisy Environments (VISION): A Novel Benchmark Dataset and Deep Learning-Based Baseline System

10.21437/interspeech.2020-2935 ◽

2020 ◽

Author(s):

Mandar Gogate ◽

Kia Dashtipour ◽

Amir Hussain

Keyword(s):

Deep Learning ◽

Benchmark Dataset ◽

Visual Speech ◽

Noisy Environments ◽

Baseline System

Download Full-text

A Benchmark Dataset for RGB-D Sphere Based Calibration

10th International Conference on Pattern Recognition Systems (ICPRS-2019) ◽

10.1049/cp.2019.0255 ◽

2019 ◽

Author(s):

D.J.T. Boas ◽

S. Poltaretskyi ◽

J.-Y. Ramel ◽

J. Chaoui ◽

J. Berhouet ◽

...

Keyword(s):

Benchmark Dataset

Download Full-text

Plant Disease Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss Reweighting Approach

IEEE Transactions on Image Processing ◽

10.1109/tip.2021.3049334 ◽

2021 ◽

Vol 30 ◽

pp. 2003-2015

Author(s):

Xinda Liu ◽

Weiqing Min ◽

Shuhuan Mei ◽

Lili Wang ◽

Shuqiang Jiang

Keyword(s):

Large Scale ◽

Plant Disease ◽

Benchmark Dataset ◽

Visual Region

Download Full-text

Improving Semi-Supervised Learning for Audio Classification with FixMatch

Electronics ◽

10.3390/electronics10151807 ◽

2021 ◽

Vol 10 (15) ◽

pp. 1807

Author(s):

Sascha Grollmisch ◽

Estefanía Cano

Keyword(s):

Neural Networks ◽

Supervised Learning ◽

Transfer Learning ◽

Data Transfer ◽

State Of The Art ◽

Training Data ◽

Audio Classification ◽

Image Domain ◽

Full Dataset ◽

Audio Data

Including unlabeled data in the training process of neural networks using Semi-Supervised Learning (SSL) has shown impressive results in the image domain, where state-of-the-art results were obtained with only a fraction of the labeled data. The commonality between recent SSL methods is that they strongly rely on the augmentation of unannotated data. This is vastly unexplored for audio data. In this work, SSL using the state-of-the-art FixMatch approach is evaluated on three audio classification tasks, including music, industrial sounds, and acoustic scenes. The performance of FixMatch is compared to Convolutional Neural Networks (CNN) trained from scratch, Transfer Learning, and SSL using the Mean Teacher approach. Additionally, a simple yet effective approach for selecting suitable augmentation methods for FixMatch is introduced. FixMatch with the proposed modifications always outperformed Mean Teacher and the CNNs trained from scratch. For the industrial sounds and music datasets, the CNN baseline performance using the full dataset was reached with less than 5% of the initial training data, demonstrating the potential of recent SSL methods for audio data. Transfer Learning outperformed FixMatch only for the most challenging dataset from acoustic scene classification, showing that there is still room for improvement.

Download Full-text

Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2021.3082307 ◽

2021 ◽

pp. 1-1

Author(s):

Khaled Koutini ◽

Hamid Eghbal-zadeh ◽

Gerhard Widmer

Keyword(s):

Neural Networks ◽

Receptive Field ◽

Convolutional Neural Networks ◽

Audio Classification ◽

Deep Convolutional Neural Networks ◽

Regularization Techniques

Download Full-text

A Study of Features and Deep Neural Network Architectures and Hyper-Parameters for Domestic Audio Classification

Applied Sciences ◽

10.3390/app11114880 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4880

Author(s):

Abigail Copiaco ◽

Christian Ritz ◽

Nidhal Abdulaziz ◽

Stefano Fasciani

Keyword(s):

Network Architecture ◽

Single Channel ◽

Classification Performance ◽

Network Size ◽

Directed Acyclic Graphs ◽

Spectral Features ◽

Audio Classification ◽

Resource Requirements ◽

Efficient Alternative ◽

Computational Resources

Recent methodologies for audio classification frequently involve cepstral and spectral features, applied to single channel recordings of acoustic scenes and events. Further, the concept of transfer learning has been widely used over the years, and has proven to provide an efficient alternative to training neural networks from scratch. The lower time and resource requirements when using pre-trained models allows for more versatility in developing system classification approaches. However, information on classification performance when using different features for multi-channel recordings is often limited. Furthermore, pre-trained networks are initially trained on bigger databases and are often unnecessarily large. This poses a challenge when developing systems for devices with limited computational resources, such as mobile or embedded devices. This paper presents a detailed study of the most apparent and widely-used cepstral and spectral features for multi-channel audio applications. Accordingly, we propose the use of spectro-temporal features. Additionally, the paper details the development of a compact version of the AlexNet model for computationally-limited platforms through studies of performances against various architectural and parameter modifications of the original network. The aim is to minimize the network size while maintaining the series network architecture and preserving the classification accuracy. Considering that other state-of-the-art compact networks present complex directed acyclic graphs, a series architecture proposes an advantage in customizability. Experimentation was carried out through Matlab, using a database that we have generated for this task, which composes of four-channel synthetic recordings of both sound events and scenes. The top performing methodology resulted in a weighted F1-score of 87.92% for scalogram features classified via the modified AlexNet-33 network, which has a size of 14.33 MB. The AlexNet network returned 86.24% at a size of 222.71 MB.

Download Full-text

SUM: A benchmark dataset of Semantic Urban Meshes

ISPRS Journal of Photogrammetry and Remote Sensing ◽

10.1016/j.isprsjprs.2021.07.008 ◽

2021 ◽

Vol 179 ◽

pp. 108-120

Author(s):

Weixiao Gao ◽

Liangliang Nan ◽

Bas Boom ◽

Hugo Ledoux

Keyword(s):

Benchmark Dataset

Download Full-text

BAND: A Benchmark Dataset forBangla News Audio Classification

Analyzing the Potential of Pre-Trained Embeddings for Audio Classification Tasks

Improving the Accuracy of Protein-Ligand Binding Affinity Prediction by Deep Learning Models: Benchmark and Model

Audio Classification Using Class-Specific Learned Descriptors

Visual Speech In Real Noisy Environments (VISION): A Novel Benchmark Dataset and Deep Learning-Based Baseline System

A Benchmark Dataset for RGB-D Sphere Based Calibration

Plant Disease Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss Reweighting Approach

Improving Semi-Supervised Learning for Audio Classification with FixMatch

Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks

A Study of Features and Deep Neural Network Architectures and Hyper-Parameters for Domestic Audio Classification

SUM: A benchmark dataset of Semantic Urban Meshes

Export Citation Format