Multi-Task Learning with Neural Networks for Voice Query Understanding on an Entertainment Platform

Multi-task learning (MTL) allows deep neural networks to learn from related tasks by sharing parameters with other networks. In practice, however, MTL involves searching an enormous space of possible parameter sharing architectures to find (a) the layers or subspaces that benefit from sharing, (b) the appropriate amount of sharing, and (c) the appropriate relative weights of the different task losses. Recent work has addressed each of the above problems in isolation. In this work we present an approach that learns a latent multi-task architecture that jointly addresses (a)–(c). We present experiments on synthetic data and data from OntoNotes 5.0, including four different tasks and seven different domains. Our extension consistently outperforms previous approaches to learning latent architectures for multi-task problems and achieves up to 15% average error reductions over common approaches to MTL.

Download Full-text

Evaluation of network traffic prediction based on neural networks with multi-task learning and multiresolution decomposition

2011 IEEE 7th International Conference on Intelligent Computer Communication and Processing ◽

10.1109/iccp.2011.6047849 ◽

2011 ◽

Cited By ~ 14

Author(s):

Melinda Barabas ◽

Georgeta Boanea ◽

Andrei B. Rus ◽

Virgil Dobrota ◽

Jordi Domingo-Pascual

Keyword(s):

Neural Networks ◽

Network Traffic ◽

Traffic Prediction ◽

Task Learning ◽

Multiresolution Decomposition

Download Full-text

Real-Time Human Pose Estimation via Cascaded Neural Networks Embedded with Multi-task Learning

Computer Analysis of Images and Patterns - Lecture Notes in Computer Science ◽

10.1007/978-3-319-64698-5_21 ◽

2017 ◽

pp. 241-252

Author(s):

Satoshi Tanabe ◽

Ryosuke Yamanaka ◽

Mitsuru Tomono ◽

Makiko Ito ◽

Teruo Ishihara

Keyword(s):

Neural Networks ◽

Real Time ◽

Pose Estimation ◽

Human Pose Estimation ◽

Task Learning ◽

Human Pose

Download Full-text

Multi-Task Learning for Multi-Dimensional Regression: Application to Luminescence Sensing

Applied Sciences ◽

10.3390/app9224748 ◽

2019 ◽

Vol 9 (22) ◽

pp. 4748 ◽

Cited By ~ 1

Author(s):

Umberto Michelucci ◽

Francesca Venturini

Keyword(s):

Neural Networks ◽

Functional Dependence ◽

Feed Forward ◽

Multiple Parameters ◽

Task Learning ◽

Non Linear ◽

Input Dataset ◽

Feed Forward Neural Networks ◽

Regression Problems ◽

Multiple Variables

The classical approach to non-linear regression in physics is to take a mathematical model describing the functional dependence of the dependent variable from a set of independent variables, and then using non-linear fitting algorithms, extract the parameters used in the modeling. Particularly challenging are real systems, characterized by several additional influencing factors related to specific components, like electronics or optical parts. In such cases, to make the model reproduce the data, empirically determined terms are built in the models to compensate for the difficulty of modeling things that are, by construction, difficult to model. A new approach to solve this issue is to use neural networks, particularly feed-forward architectures with a sufficient number of hidden layers and an appropriate number of output neurons, each responsible for predicting the desired variables. Unfortunately, feed-forward neural networks (FFNNs) usually perform less efficiently when applied to multi-dimensional regression problems, that is when they are required to predict simultaneously multiple variables that depend from the input dataset in fundamentally different ways. To address this problem, we propose multi-task learning (MTL) architectures. These are characterized by multiple branches of task-specific layers, which have as input the output of a common set of layers. To demonstrate the power of this approach for multi-dimensional regression, the method is applied to luminescence sensing. Here, the MTL architecture allows predicting multiple parameters, the oxygen concentration and temperature, from a single set of measurements.

Download Full-text

Empirical evaluation of multi-task learning in deep neural networks for natural language processing

Neural Computing and Applications ◽

10.1007/s00521-020-05268-w ◽

2020 ◽

Author(s):

Jianquan Li ◽

Xiaokang Liu ◽

Wenpeng Yin ◽

Min Yang ◽

Liqun Ma ◽

...

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Deep Neural Networks ◽

Empirical Evaluation ◽

Task Learning

Download Full-text

Multi-task learning for the prediction of wind power ramp events with deep neural networks

Neural Networks ◽

10.1016/j.neunet.2019.12.017 ◽

2020 ◽

Vol 123 ◽

pp. 401-411 ◽

Cited By ~ 9

Author(s):

M. Dorado-Moreno ◽

N. Navarin ◽

P.A. Gutiérrez ◽

L. Prieto ◽

A. Sperduti ◽

...

Keyword(s):

Neural Networks ◽

Wind Power ◽

Deep Neural Networks ◽

Task Learning ◽

Ramp Events ◽

Wind Power Ramp Events

Download Full-text

Dataset-aware multi-task learning approaches for biomedical named entity recognition

Bioinformatics ◽

10.1093/bioinformatics/btaa515 ◽

2020 ◽

Vol 36 (15) ◽

pp. 4331-4338

Author(s):

Mei Zuo ◽

Yang Zhang

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

State Of The Art ◽

Named Entity Recognition ◽

Entity Recognition ◽

Quality Data ◽

Supplementary Information ◽

Named Entity ◽

Task Learning ◽

Biomedical Named Entity Recognition

Abstract Motivation Named entity recognition is a critical and fundamental task for biomedical text mining. Recently, researchers have focused on exploiting deep neural networks for biomedical named entity recognition (Bio-NER). The performance of deep neural networks on a single dataset mostly depends on data quality and quantity while high-quality data tends to be limited in size. To alleviate task-specific data limitation, some studies explored the multi-task learning (MTL) for Bio-NER and achieved state-of-the-art performance. However, these MTL methods did not make full use of information from various datasets of Bio-NER. The performance of state-of-the-art MTL method was significantly limited by the number of training datasets. Results We propose two dataset-aware MTL approaches for Bio-NER which jointly train all models for numerous Bio-NER datasets, thus each of these models could discriminatively exploit information from all of related training datasets. Both of our two approaches achieve substantially better performance compared with the state-of-the-art MTL method on 14 out of 15 Bio-NER datasets. Furthermore, we implemented our approaches by incorporating Bio-NER and biomedical part-of-speech (POS) tagging datasets. The results verify Bio-NER and POS can significantly enhance one another. Availability and implementation Our source code is available at https://github.com/zmmzGitHub/MTL-BC-LBC-BioNER and all datasets are publicly available at https://github.com/cambridgeltl/MTL-Bioinformatics-2016. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text