Unsupervised Geo-Demographic Classification of City-Area Using Multimodal Multimedia Data

2021 ◽

Vol 12 (2) ◽

Author(s):

Senthil Kumar K Pa, Et. al.

Keyword(s):

Data Transmission ◽

Pattern Detection ◽

Multimedia Data ◽

Video Sequences ◽

Classification Approach ◽

Shearlet Transform ◽

Multimedia Data Transmission ◽

Indoor And Outdoor

Detection and classifications of the haze affected image is important for the real time multimedia data transmission and reception in remote mode in order to improve the quality of the received image or video sequences. In this paper, Convolutional Neural Networks (CNN) classification approach is used with Shearlet Transform for the detection and segmentation of haze affected images.The image to be tested for haze pattern detection is preprocessed and then it is decomposed with shearlet transform. The features are computed from the shearlet transform decomposed coefficients and then these computed features are classified by the deep learning CNN for identifying the haze affected images. This proposed haze classification method is tested on both indoor and outdoor environmental images.

Download Full-text

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch017 ◽

2020 ◽

pp. 274-294

Author(s):

Yilin Yan ◽

Min Chen ◽

Saad Sadiq ◽

Mei-Ling Shyu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Network Models ◽

Multimedia Data ◽

Neural Network Models ◽

Minority Class ◽

Imbalanced Data Classification

The classification of imbalanced datasets has recently attracted significant attention due to its implications in several real-world use cases. The classifiers developed on datasets with skewed distributions tend to favor the majority classes and are biased against the minority class. Despite extensive research interests, imbalanced data classification remains a challenge in data mining research, especially for multimedia data. Our attempt to overcome this hurdle is to develop a convolutional neural network (CNN) based deep learning solution integrated with a bootstrapping technique. Considering that convolutional neural networks are very computationally expensive coupled with big training datasets, we propose to extract features from pre-trained convolutional neural network models and feed those features to another full connected neutral network. Spark implementation shows promising performance of our model in handling big datasets with respect to feasibility and scalability.

Download Full-text

Domain and Intelligence Based Multimedia Question Answering System

International Journal of Evaluation and Research in Education (IJERE) ◽

10.11591/ijere.v5i3.4544 ◽

2016 ◽

Vol 5 (3) ◽

pp. 227

Author(s):

Krishnamoorthi Magesh Kumar ◽

P. Valarmathie

Keyword(s):

Question Answering ◽

Multimedia Data ◽

Multimedia System ◽

The Past ◽

Question Answering Systems ◽

Query Generation ◽

Bayesian Ranking ◽

Text Images ◽

Take All

Multimedia question answering systems have become very popular over the past few years. It allows users to share their thoughts by answering given question or obtain information from a set of answered questions. However, existing QA systems support only textual answer which is not so instructive for many users. The user’s discussion can be enhanced by adding suitable multimedia data. Multimedia answers offer intuitive information with more suitable image, voice and video. This system includes a set of information as well as classification of question and answer, query generation, multimedia data selection and presentation. This system will take all kinds of media such as text, images, videos, and videos which will be combined with a textual answer. In a way, it automatically collects information from the user to improvising the answer. This method consists of ranking for answers to select the best answer. By dealing out a huge set of QA pairs and adding them to a database, multimedia question answering approach for users which finds multimedia answers by matching their questions with those in the database. The effectiveness of Multimedia system is determined by ranking of text, image, audio and video in users answer. The answer which is given by the user it’s processed by Semantic match algorithm and the best answers can be viewed by Naive Bayesian ranking system.

Download Full-text

AD or Non-AD: A Deep Learning Approach to Detect Advertisements from Magazines

Entropy ◽

10.3390/e20120982 ◽

2018 ◽

Vol 20 (12) ◽

pp. 982 ◽

Cited By ~ 3

Author(s):

Khaled Almgren ◽

Murali Krishnan ◽

Fatima Aljanobi ◽

Jeongkyu Lee

Keyword(s):

Deep Learning ◽

Real World ◽

Marketing Strategies ◽

Multimedia Data ◽

Visual Features ◽

Image Detection ◽

Real World Application ◽

Real World Applications ◽

Scanned Images

The processing and analyzing of multimedia data has become a popular research topic due to the evolution of deep learning. Deep learning has played an important role in addressing many challenging problems, such as computer vision, image recognition, and image detection, which can be useful in many real-world applications. In this study, we analyzed visual features of images to detect advertising images from scanned images of various magazines. The aim is to identify key features of advertising images and to apply them to real-world application. The proposed work will eventually help improve marketing strategies, which requires the classification of advertising images from magazines. We employed convolutional neural networks to classify scanned images as either advertisements or non-advertisements (i.e., articles). The results show that the proposed approach outperforms other classifiers and the related work in terms of accuracy.

Download Full-text

Machine Learning-Based Supervised Classification of Point Clouds Using Multiscale Geometric Features

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10030187 ◽

2021 ◽

Vol 10 (3) ◽

pp. 187

Author(s):

Muhammed Enes Atik ◽

Zaide Duran ◽

Dursun Zafer Seker

Keyword(s):

Machine Learning ◽

Point Cloud ◽

Supervised Classification ◽

Point Clouds ◽

Support Vector ◽

Geometric Features ◽

Mathematical Tool ◽

City Area ◽

3D Point Clouds

3D scene classification has become an important research field in photogrammetry, remote sensing, computer vision and robotics with the widespread usage of 3D point clouds. Point cloud classification, called semantic labeling, semantic segmentation, or semantic classification of point clouds is a challenging topic. Machine learning, on the other hand, is a powerful mathematical tool used to classify 3D point clouds whose content can be significantly complex. In this study, the classification performance of different machine learning algorithms in multiple scales was evaluated. The feature spaces of the points in the point cloud were created using the geometric features generated based on the eigenvalues of the covariance matrix. Eight supervised classification algorithms were tested in four different areas from three datasets (the Dublin City dataset, Vaihingen dataset and Oakland3D dataset). The algorithms were evaluated in terms of overall accuracy, precision, recall, F1 score and process time. The best overall results were obtained for four test areas with different algorithms. Dublin City Area 1 was obtained with Random Forest as 93.12%, Dublin City Area 2 was obtained with a Multilayer Perceptron algorithm as 92.78%, Vaihingen was obtained as 79.71% with Support Vector Machines and Oakland3D with Linear Discriminant Analysis as 97.30%.

Download Full-text

Where not to live: a geo-demographic classification of mortality for England and Wales, 1981–2000

Health & Place ◽

10.1016/j.healthplace.2005.08.012 ◽

2006 ◽

Vol 12 (4) ◽

pp. 557-569 ◽

Cited By ~ 19

Author(s):

Nicola J. Shelton ◽

Mark H. Birkin ◽

Danny Dorling

Keyword(s):

England And Wales ◽

Demographic Classification

Download Full-text

A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001415550034 ◽

2015 ◽

Vol 29 (01) ◽

pp. 1555003 ◽

Cited By ~ 6

Author(s):

Samabia Tehsin ◽

Asif Masood ◽

Sumaira Kausar ◽

Yunous Javed

Keyword(s):

Feature Vector ◽

Extraction Process ◽

Text Detection ◽

Multimedia Data ◽

Support Vector ◽

Feature Vectors ◽

Text Extraction ◽

Indexing And Retrieval ◽

Detection And Localization

Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. Text extraction process has many inherent problems due to the variation in font sizes, color, backgrounds and resolution. Text detection and localization are the most challenging phases of text extraction process whereas text extraction results are highly dependent upon these phases. This paper focuses on the text localization because of its very fundamental importance. Two effective feature vectors are introduced for the classification of the text and nontext objects. First feature vector is represented by the Radon transform of text candidate objects. Second feature vector is derived from the detailed geometrical analysis of text contents. Union of two feature vectors is used for the classification of text and nontext objects using support vector machine (SVM). Text detection and localization results are evaluated on two publicly available datasets namely ICDAR 2013 and IPC-Artificial text. Moreover, results are compared with state-of-the-art techniques and the Comparison demonstrates the superiority of the presented research.

Download Full-text

Digital Image Watermarking: An Overview

Oriental journal of computer science and technology ◽

10.13005/ojcst/901.02 ◽

2016 ◽

Vol 9 (1) ◽

pp. 07-11 ◽

Cited By ~ 1

Author(s):

H. B Kumar

Keyword(s):

Information Hiding ◽

Image Watermarking ◽

Original Data ◽

Spatial Domain ◽

Multimedia Data ◽

Internet Technology ◽

Transform Domain ◽

Crucial Information ◽

Transform Coefficients

Multimedia security is extremely significant concern for the internet technology because of the ease of the duplication, distribution and manipulation of the multimedia data. The digital watermarking is a field of information hiding which hide the crucial information in the original data for protection illegal duplication and distribution of multimedia data. The image watermarking techniques may divide on the basis of domain like spatial domain or transform domain or on the basis of wavelets. The spatial domain techniques directly work on the pixels and the frequency domain works on the transform coefficients of the image. This paper presents classification of watermarking, stages in watermarking, watermarking approaches and its applications.

Download Full-text

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2017010101 ◽

2017 ◽

Vol 8 (1) ◽

pp. 1-20 ◽

Cited By ~ 15

Author(s):

Yilin Yan ◽

Min Chen ◽

Saad Sadiq ◽

Mei-Ling Shyu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Imbalanced Data ◽

Network Models ◽

Multimedia Data ◽

Neural Network Models ◽

Minority Class ◽

Imbalanced Data Classification

The classification of imbalanced datasets has recently attracted significant attention due to its implications in several real-world use cases. The classifiers developed on datasets with skewed distributions tend to favor the majority classes and are biased against the minority class. Despite extensive research interests, imbalanced data classification remains a challenge in data mining research, especially for multimedia data. Our attempt to overcome this hurdle is to develop a convolutional neural network (CNN) based deep learning solution integrated with a bootstrapping technique. Considering that convolutional neural networks are very computationally expensive coupled with big training datasets, we propose to extract features from pre-trained convolutional neural network models and feed those features to another full connected neutral network. Spark implementation shows promising performance of our model in handling big datasets with respect to feasibility and scalability.

Download Full-text

Demographic classification of the municipality of the Bryansk-Belgorod region of the Russian borderland

Proceedings of the International conference “InterCarto/InterGIS” ◽

10.35595/2414-9179-2019-1-25-151-162 ◽

2019 ◽

Vol 25 (1) ◽

pp. 151-162

Author(s):

Aleksandr Igonin ◽

Keyword(s):

Demographic Classification

Download Full-text

Unsupervised Geo-Demographic Classification of City-Area Using Multimodal Multimedia Data

Detection and Classification of Haze Affected Images Using CNN Approach

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

Domain and Intelligence Based Multimedia Question Answering System

AD or Non-AD: A Deep Learning Approach to Detect Advertisements from Magazines

Machine Learning-Based Supervised Classification of Point Clouds Using Multiscale Geometric Features

Where not to live: a geo-demographic classification of mortality for England and Wales, 1981–2000

A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data

Digital Image Watermarking: An Overview

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

Demographic classification of the municipality of the Bryansk-Belgorod region of the Russian borderland

Export Citation Format