Adaptive Model Scheduling for Resource-efficient Data Labeling

Mu Yuan; Lan Zhang; Xiang-Yang Li; Lin-Zhuo Yang; Hui Xiong

doi:10.1145/3494559

Adaptive Model Scheduling for Resource-efficient Data Labeling

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3494559 ◽

2022 ◽

Vol 16 (4) ◽

pp. 1-22

Author(s):

Mu Yuan ◽

Lan Zhang ◽

Xiang-Yang Li ◽

Lin-Zhuo Yang ◽

Hui Xiong

Keyword(s):

Deep Learning ◽

Heuristic Algorithms ◽

Data Item ◽

Adaptive Model ◽

Learning Models ◽

The People ◽

Novel Approach ◽

Machine Learning Model ◽

Efficient Data ◽

Model Execution

Labeling data (e.g., labeling the people, objects, actions, and scene in images) comprehensively and efficiently is a widely needed but challenging task. Numerous models were proposed to label various data and many approaches were designed to enhance the ability of deep learning models or accelerate them. Unfortunately, a single machine-learning model is not powerful enough to extract various semantic information from data. Given certain applications, such as image retrieval platforms and photo album management apps, it is often required to execute a collection of models to obtain sufficient labels. With limited computing resources and stringent delay, given a data stream and a collection of applicable resource-hungry deep-learning models, we design a novel approach to adaptively schedule a subset of these models to execute on each data item, aiming to maximize the value of the model output (e.g., the number of high-confidence labels). Achieving this lofty goal is nontrivial since a model’s output on any data item is content-dependent and unknown until we execute it. To tackle this, we propose an Adaptive Model Scheduling framework, consisting of (1) a deep reinforcement learning-based approach to predict the value of unexecuted models by mining semantic relationship among diverse models, and (2) two heuristic algorithms to adaptively schedule the model execution order under a deadline or deadline-memory constraints, respectively. The proposed framework does not require any prior knowledge of the data, which works as a powerful complement to existing model optimization technologies. We conduct extensive evaluations on five diverse image datasets and 30 popular image labeling models to demonstrate the effectiveness of our design: our design could save around 53% execution time without loss of any valuable labels.

Download Full-text

Fake news detection using deep learning models: A novel approach

Transactions on Emerging Telecommunications Technologies ◽

10.1002/ett.3767 ◽

2019 ◽

Vol 31 (2) ◽

Cited By ~ 1

Author(s):

Sachin Kumar ◽

Rohan Asthana ◽

Shashwat Upadhyay ◽

Nidhi Upreti ◽

Mohammad Akbar

Keyword(s):

Deep Learning ◽

Fake News ◽

Learning Models ◽

Novel Approach

Download Full-text

Deep-Cov19-Hate: A Textual-Based Novel Approach for Automatic Detection of Hate Speech in Online Social Networks throughout COVID-19 with Shallow and Deep Learning Models

Tehnicki vjesnik - Technical Gazette ◽

10.17559/tv-20210708143535 ◽

2022 ◽

Vol 29 (1) ◽

Keyword(s):

Social Networks ◽

Deep Learning ◽

Online Social Networks ◽

Hate Speech ◽

Automatic Detection ◽

Learning Models ◽

Novel Approach

Download Full-text

A Novel Approach of Ensembling the Transfer Learning Methods for Rice Plant Disease Detection and Classification

Webology ◽

10.14704/web/v18i2/web18331 ◽

2021 ◽

Vol 18 (2) ◽

pp. 439-448

Author(s):

Parameswar Kanuparthi ◽

Vaibhav Bejgam ◽

V. Madhu Viswanatham

Keyword(s):

Transfer Learning ◽

Rice Plant ◽

Machine Learning Algorithms ◽

Majority Voting ◽

Brown Spot ◽

Generalization Error ◽

Learning Models ◽

The People ◽

Novel Approach ◽

Crop Type

Agriculture, the primary sector of Indian economy. It contributes around 18 percent of overall GDP (Gross Domestic Product). More than fifty percent of Indians belong to an agricultural background. There is a necessary to rapidly increase the agriculture production in India due to the vast increasing of population. The significant crop type for most of the people in India is rice but it was one of the crops that has been mostly affected by the cause of diseases in majority of the cases. This results in reduced yield that lead to loss for farmers. The major challenges faced while cultivating the rice crops is getting infected by the diseases due to the various effects that include environmental conditions, pesticides used and natural disasters. Early detection of rice diseases will eventually help farmers to get out from disasters and help in better yield. In this paper, we are proposing a new method of ensembling the transfer learning models to detect the rice plant and classify the diseases using images. Using this model, the three most common rice crop diseases are detected such as Brown spot, Leaf smut and Bacterial leaf blight. Generally, transfer learning uses pre-trained models and gives better accuracy for the image datasets. Also, ensembling of machine learning algorithms (combining two or more ML algorithms) will help in reducing the generalization error and also makes the model more robust. Ensemble learning is becoming trendier as it reduces generalization error as well as makes the model more robust. The ensembling technique that was used in the paper is majority voting. Here we are proposing a novel model that ensembles three transfer learning models which are InceptionV3, MobileNetV2 and DenseNet121 with an accuracy of 96.42%.

Download Full-text

A novel approach based on combining deep learning models with statistical methods for COVID-19 time series forecasting

Neural Computing and Applications ◽

10.1007/s00521-021-06548-9 ◽

2021 ◽

Author(s):

Hossein Abbasimehr ◽

Reza Paki ◽

Aram Bahrini

Keyword(s):

Time Series ◽

Deep Learning ◽

Statistical Methods ◽

Time Series Forecasting ◽

Learning Models ◽

Novel Approach

Download Full-text

Efficient Pneumonia Detection in Chest Xray Images Using Deep Transfer Learning

Diagnostics ◽

10.3390/diagnostics10060417 ◽

2020 ◽

Vol 10 (6) ◽

pp. 417 ◽

Cited By ~ 5

Author(s):

Mohammad Farukh Hashmi ◽

Satyarth Katiyar ◽

Avinash G Keskar ◽

Neeraj Dhanraj Bokde ◽

Zong Woo Geem

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Data Augmentation ◽

Medical Center ◽

Training Dataset ◽

Test Accuracy ◽

X Rays ◽

Learning Models ◽

Novel Approach ◽

Unseen Data

Pneumonia causes the death of around 700,000 children every year and affects 7% of the global population. Chest X-rays are primarily used for the diagnosis of this disease. However, even for a trained radiologist, it is a challenging task to examine chest X-rays. There is a need to improve the diagnosis accuracy. In this work, an efficient model for the detection of pneumonia trained on digital chest X-ray images is proposed, which could aid the radiologists in their decision making process. A novel approach based on a weighted classifier is introduced, which combines the weighted predictions from the state-of-the-art deep learning models such as ResNet18, Xception, InceptionV3, DenseNet121, and MobileNetV3 in an optimal way. This approach is a supervised learning approach in which the network predicts the result based on the quality of the dataset used. Transfer learning is used to fine-tune the deep learning models to obtain higher training and validation accuracy. Partial data augmentation techniques are employed to increase the training dataset in a balanced way. The proposed weighted classifier is able to outperform all the individual models. Finally, the model is evaluated, not only in terms of test accuracy, but also in the AUC score. The final proposed weighted classifier model is able to achieve a test accuracy of 98.43% and an AUC score of 99.76 on the unseen data from the Guangzhou Women and Children’s Medical Center pneumonia dataset. Hence, the proposed model can be used for a quick diagnosis of pneumonia and can aid the radiologists in the diagnosis process.

Download Full-text

Deep Learning Applications in Medical Imaging

Deep Learning Applications in Medical Imaging - Advances in Medical Technologies and Clinical Practice ◽

10.4018/978-1-7998-5071-7.ch008 ◽

2021 ◽

pp. 178-208

Author(s):

S. Sasikala ◽

S. J. Subhashini ◽

P. Alli ◽

J. Jane Rubel Angelina

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Deep Learning ◽

Medical Imaging ◽

Learning Model ◽

Learning Models ◽

Driving System ◽

Machine Learning Model ◽

Car Driving ◽

Machine Learning Models

Machine learning is a technique of parsing data, learning from that data, and then applying what has been learned to make informed decisions. Deep learning is actually a subset of machine learning. It technically is machine learning and functions in the same way, but it has different capabilities. The main difference between deep and machine learning is, machine learning models become well progressively, but the model still needs some guidance. If a machine learning model returns an inaccurate prediction, then the programmer needs to fix that problem explicitly, but in the case of deep learning, the model does it by itself. Automatic car driving system is a good example of deep learning. On other hand, Artificial Intelligence is a different thing from machine learning and deep learning. Deep learning and machine learning both are the subsets of AI.

Download Full-text

SongExplorer: A deep learning workflow for discovery and segmentation of animal acoustic communication signals

10.1101/2021.03.26.437280 ◽

2021 ◽

Author(s):

Benjamin J. Arthur ◽

Yun Ding ◽

Medhini Sosale ◽

Faduma Khalif ◽

Elizabeth Kim ◽

...

Keyword(s):

Deep Learning ◽

Heuristic Algorithms ◽

Learning Algorithm ◽

Communication Signals ◽

Deep Learning Algorithm ◽

Machine Learning Model ◽

Automated Algorithms ◽

Low Dimensional ◽

Similar Accuracy ◽

Song Types

AbstractMany animals produce distinct sounds or substrate-borne vibrations, but these signals have proved challenging to segment with automated algorithms. We have developed SongExplorer, a web-browser based interface wrapped around a deep-learning algorithm that supports an interactive workflow for (1) discovery of animal sounds, (2) manual annotation, (3) supervised training of a deep convolutional neural network, and (4) automated segmentation of recordings. Raw data can be explored by simultaneously examining song events, both individually and in the context of the entire recording, watching synced video, and listening to song. We provide a simple way to visualize many song events from large datasets within an interactive low-dimensional visualization, which facilitates detection and correction of incorrectly labelled song events. The machine learning model we implemented displays higher accuracy than existing heuristic algorithms and similar accuracy as two expert human annotators. We show that SongExplorer allows rapid detection of all song types from new species and of novel song types in previously well-studied species.

Download Full-text

Using deep learning models for learning semantic text similarity of Arabic questions

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i4.pp3519-3528 ◽

2021 ◽

Vol 11 (4) ◽

pp. 3519

Author(s):

Mahmoud Hammad ◽

Mohammed Al-Smadi ◽

Qanita Bani Baker ◽

Sa’ad A. Al-Zboon

Keyword(s):

Deep Learning ◽

Question Answering ◽

Supervised Machine Learning ◽

Learning Models ◽

Text Similarity ◽

Baseline Model ◽

Life Problems ◽

Machine Learning Model ◽

Recurrent Architecture ◽

The Right

Question-answering platforms serve millions of users seeking knowledge and solutions for their daily life problems. However, many knowledge seekers are facing the challenge to find the right answer among similar answered questions and writer’s responding to asked questions feel like they need to repeat answers many times for similar questions. This research aims at tackling the problem of learning the semantic text similarity among different asked questions by using deep learning. Three models are implemented to address the aforementioned problem: i) a supervised-machine learning model using XGBoost trained with pre-defined features, ii) an adapted Siamese-based deep learning recurrent architecture trained with pre-defined features, and iii) a Pre-trained deep bidirectional transformer based on BERT model. Proposed models were evaluated using a reference Arabic dataset from the mawdoo3.com company. Evaluation results show that the BERT-based model outperforms the other two models with an F1=92.99%, whereas the Siamese-based model comes in the second place with F1=89.048%, and finally, the XGBoost as a baseline model achieved the lowest result of F1=86.086%.

Download Full-text

Adversarial Attacks on Neural Networks for Graph Data

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/872 ◽

2019 ◽

Cited By ~ 24

Author(s):

Daniel Zügner ◽

Amir Akbarnejad ◽

Stephan Günnemann

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Test Time ◽

Graph Structure ◽

Classification Models ◽

Learning Models ◽

Important Data ◽

Machine Learning Model ◽

Attributed Graphs ◽

Node Classification

Deep learning models for graphs have achieved strong performance for the task of node classification. Despite their proliferation, currently there is no study of their robustness to adversarial attacks. Yet, in domains where they are likely to be used, e.g. the web, adversaries are common. Can deep learning models for graphs be easily fooled? In this extended abstract we summarize the key findings and contributions of our work, in which we introduce the first study of adversarial attacks on attributed graphs, specifically focusing on models exploiting ideas of graph convolutions. In addition to attacks at test time, we tackle the more challenging class of poisoning/causative attacks, which focus on the training phase of a machine learning model. We generate adversarial perturbations targeting the node's features and the graph structure, thus, taking the dependencies between instances in account. Moreover, we ensure that the perturbations remain unnoticeable by preserving important data characteristics. To cope with the underlying discrete domain we propose an efficient algorithm Nettack exploiting incremental computations. Our experimental study shows that accuracy of node classification significantly drops even when performing only few perturbations. Even more, our attacks are transferable: the learned attacks generalize to other state-of-the-art node classification models and unsupervised approaches, and likewise are successful given only limited knowledge about the graph.

Download Full-text

Constructing Automatic Classification Models for Chinese-language Chief Complaint (Preprint)

10.2196/preprints.32228 ◽

2021 ◽

Author(s):

Si Shen

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Automatic Classification ◽

Chief Complaint ◽

Classification Task ◽

Performance Difference ◽

Learning Models ◽

Text Data ◽

Machine Learning Model ◽

Expansion Model

BACKGROUND Chief complaint is the initial, general, and written description of a patient’s symptoms provided during the hospital intake process. By improving the automatic classification of chief complaint text, the quality and efficiency of patients’ hospital visits can be improved. OBJECTIVE Using chief complaint data in Chinese from the Information Centre of Jiangsu Commission Health, we built models for automatically detecting the correct treating department and then conducted various tests on those models using machine learning and deep learning. METHODS The study tested and compared the performances of the traditional machine learning model of SVM with deep learning models of Bi-LSTM, Bi-LSTM-CRF, At-Bi-LSTM-CRF and Bi-GRU-CRF on the chief complaint text data mainly. It is mainly based on Chinese character expansion model train and test in all traditional machine learning and deep learning models. RESULTS We found that the Bi-LSTM performed better at the chief complaint classification task than the SVM and that the performance difference between the deep learning models constructed is not obvious. The F scores of Bi-LSTM, Bi-LSTM-CRF, At-Bi-LSTM-CRF and Bi-GRU-CRF model built for the experiment effectively reach 88.10, 87.91, 88.14 and 87.98. CONCLUSIONS We found that the Bi-LSTM performed better at the chief complaint classification task than the SVM and that the performance difference between the deep learning models constructed is not obvious. The F scores of Bi-LSTM, Bi-LSTM-CRF, At-Bi-LSTM-CRF and Bi-GRU-CRF model built for the experiment effectively reach 88.10, 87.91, 88.14 and 87.98.

Download Full-text