International Journal of Multimedia Information Retrieval

AbstractOptical Music Recognition (OMR) and Automatic Music Transcription (AMT) stand for the research fields that aim at obtaining a structured digital representation from sheet music images and acoustic recordings, respectively. While these fields have traditionally evolved independently, the fact that both tasks may share the same output representation poses the question of whether they could be combined in a synergistic manner to exploit the individual transcription advantages depicted by each modality. To evaluate this hypothesis, this paper presents a multimodal framework that combines the predictions from two neural end-to-end OMR and AMT systems by considering a local alignment approach. We assess several experimental scenarios with monophonic music pieces to evaluate our approach under different conditions of the individual transcription systems. In general, the multimodal framework clearly outperforms the single recognition modalities, attaining a relative improvement close to $$40\%$$ 40 % in the best case. Our initial premise is, therefore, validated, thus opening avenues for further research in multimodal OMR-AMT transcription.

Get full-text (via PubEx)

AMS-CNN: Attentive multi-stream CNN for video-based crowd counting

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-021-00220-7 ◽

2021 ◽

Author(s):

Santosh Kumar Tripathy ◽

Rajeev Srivastava

Keyword(s):

Crowd Counting

Get full-text (via PubEx)

Towards a high robust neural network via feature matching

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-021-00219-0 ◽

2021 ◽

Author(s):

Jian Li ◽

Yanming Guo ◽

Songyang Lao ◽

Yulun Wu ◽

Liang Bai ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

Feature Matching ◽

Feature Vector ◽

State Of The Art ◽

Model Performance ◽

Image Features ◽

Classification Systems ◽

Adversarial Attack

AbstractImage classification systems have been found vulnerable to adversarial attack, which is imperceptible to human but can easily fool deep neural networks. Recent researches indicate that regularizing the network by introducing randomness could greatly improve the model’s robustness against adversarial attack, but the randomness module would normally involve complex calculations and numerous additional parameters and seriously affect the model performance on clean data. In this paper, we propose a feature matching module to regularize the network. Specifically, our model learns a feature vector for each category and imposes additional restrictions on image features. Then, the similarity between image features and category features is used as the basis for classification. Our method does not introduce any additional network parameters than undefended model and can be easily integrated into any neural network. Experiments on the CIFAR10 and SVHN datasets highlight that our proposed module can effectively improve both clean data and perturbed data accuracy in comparison with the state-of-the-art defense methods and outperform the L2P method by 6.3$$\%$$ % , 24$$\%$$ % on clean and perturbed data, respectively, using ResNet-V2(18) architecture.

Get full-text (via PubEx)

Multi-class imbalanced image classification using conditioned GANs

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-021-00213-6 ◽

2021 ◽

Author(s):

M R Pavan Kumar ◽

Prabhu Jayagopal

Keyword(s):

Image Classification

Get full-text (via PubEx)

A review on deep learning in medical image analysis

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-021-00218-1 ◽

2021 ◽

Author(s):

S. Suganyadevi ◽

V. Seethalakshmi ◽

K. Balasamy

Keyword(s):

Image Analysis ◽

Deep Learning ◽

Medical Image ◽

Medical Image Analysis

Get full-text (via PubEx)

Alleviating the cold-start playlist continuation in music recommendation using latent semantic indexing

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-021-00214-5 ◽

2021 ◽

Author(s):

Ali Yürekli ◽

Cihan Kaleli ◽

Alper Bilge

Keyword(s):

Cold Start ◽

Latent Semantic Indexing ◽

Semantic Indexing ◽

Music Recommendation

Get full-text (via PubEx)

Editorial: web of science and scopus impact in IJMIR

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-021-00217-2 ◽

2021 ◽

Vol 10 (3) ◽

pp. 141-141

Author(s):

Michael Lew

Keyword(s):

Web Of Science

Get full-text (via PubEx)

International Journal of Multimedia Information Retrieval
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer-Verlag

Enhancing the performance of 3D auto-correlation gradient features in depth action classification

A fast and robust affine-invariant method for shape registration under partial occlusion

Correction to: Different techniques for Alzheimer’s disease classification using brain images: a study

Multimodal image and audio music transcription

AMS-CNN: Attentive multi-stream CNN for video-based crowd counting

Towards a high robust neural network via feature matching

Multi-class imbalanced image classification using conditioned GANs

A review on deep learning in medical image analysis

Alleviating the cold-start playlist continuation in music recommendation using latent semantic indexing

Editorial: web of science and scopus impact in IJMIR

Export Citation Format

International Journal of Multimedia Information RetrievalLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer-Verlag

Enhancing the performance of 3D auto-correlation gradient features in depth action classification

A fast and robust affine-invariant method for shape registration under partial occlusion

Correction to: Different techniques for Alzheimer’s disease classification using brain images: a study

Multimodal image and audio music transcription

AMS-CNN: Attentive multi-stream CNN for video-based crowd counting

Towards a high robust neural network via feature matching

Multi-class imbalanced image classification using conditioned GANs

A review on deep learning in medical image analysis

Alleviating the cold-start playlist continuation in music recommendation using latent semantic indexing

Editorial: web of science and scopus impact in IJMIR

International Journal of Multimedia Information Retrieval
Latest Publications