Automatic Multiscale-based Peak Detection on Short Time Energy and Spectral Centroid Feature Extraction for Conversational Speech Segmentation

In this paper we present a new system to classify TV programs into predefined categories based on the analysis of their audio and video contents. This is very useful in intelligent display and storage systems that can select channels and record or skip contents according to the consumer's preference. Distinguishable patterns exist in different categories of TV programs in terms of human faces and audio. In this paper four categories divided into news, cartoon, variety and sport are of interest. News and variety have differences between frames less than sport and cartoon. For audio feature, we apply short time energy, zero crossing, spectral centroid and short time Fourier transform for feature extraction. For face feature, in the first step, Haar like feature is employed for face detection and eigenface is then applied for feature extraction. Then, neural network is implemented for classification. From experimental results, classification rate of 95% accuracy which is better than the other paper is achievable.

Download Full-text

Penerapan Algoritma Levenberg-Marquadt dan Backpropagation Neural Network Untuk Klasifikasi Suara Manusia

Jurnal Buana Informatika ◽

10.24002/jbi.v4i1.327 ◽

2012 ◽

Vol 4 (1) ◽

Author(s):

David David

Keyword(s):

Neural Network ◽

Speech Processing ◽

Backpropagation Neural Network ◽

Zero Crossing ◽

Spectral Centroid ◽

Human Voice ◽

Voice Signal ◽

Zero Crossing Rate ◽

Short Time ◽

Short Time Energy

Abstract. Voice recognition technology is currently experiencing growth, especially in the case of speech processing. Speech processing is a way to extract the desired information from a voice signal. This study discusses the classification of human voice system male and female. Extract the characteristics of the voice signal in each frame time domain and frequency domain is to help simplify and speed calculations. The features for voice or other audio between Short Time Energy, Zero Crossing Rate, Spectral Centroid, and others. Test results show that the classification system the human voice using the backpropagation neural network and Levenberg-Marquadt algorithm to change matrix weight is very good because of the complexity and rapid calculation which is not too high. Database voice sample of 40 voices with the test data as much as 5 votes. The output of the system is the result of the classification that has been identified with a similarity value>=0.5 for male and <0.5 as a female. Testing using artificial neural network produced an average success rate in voice classification amounted to 91%.Keywords: Feature Extraction, Classification, Backpropagation, Levenberg-Marquadt Algorithm, Human VoiceÂ Abstrak. Teknologi pengenalan suara saat ini telah mengalami perkembangan terutama dalam hal speech processing. Speech processing merupakan suatu cara untuk mengekstrak informasi yang diinginkan dari sebuah sinyal suara. Penelitian ini membahas sistem klasifikasi suara manusia male dan female. Mengekstrak ciri dari sinyal suara setiap frame pada kawasan waktu dan kawasan frekuensi sangat membantu untukÂ menyederhanakan dan mempercepat perhitungan. Adapun fitur-fitur untuk suara atau audio antara lain Short Time Energy, Zero Crossing Rate, Spectral Centroid dan lain-lain. Hasil pengujian sistem menunjukkan bahwa klasifikasi suara manusia dengan menggunakan jaringan saraf tiruan backpropagation dan algoritma Levenberg-Marquadt untuk perubahan matriks bobot, sangat baik dan cepat karena kompleksitas perhitungan yang tidak terlalu tinggi. Database sample suara sebanyak 40 buah dengan data test sebanyak 5 suara. Output dari sistem adalah hasil klasifikasi yang telah dikenali dengan nilai kemiripan >= 0,5 sebagai pria dan < 0,5 sebagai wanita. Pengujian dengan menggunakan jaringan saraf tiruan dihasilkan rata-rata tingkat keberhasilan dalam klasifikasi suara adalah sebesar 91 %.Kata Kunci: Feature Extraction, Klasifikasi, Backpropagation, Algoritma Levenberg-Marquadt, Suara Manusia

Download Full-text

Inferring Long-Term Demand of Newly Established Stations for Expansion Areas in Bike Sharing System

Applied Sciences ◽

10.3390/app11156748 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6748

Author(s):

Hsun-Ping Hsieh ◽

Fandel Lin ◽

Jiawei Jiang ◽

Tzu-Ying Kuo ◽

Yu-En Chang

Keyword(s):

New York ◽

Feature Extraction ◽

Real World ◽

Extraction Methods ◽

Real World Data ◽

Urban Dynamics ◽

Bike Sharing ◽

The Government ◽

Short Time

Research on flourishing public bike-sharing systems has been widely discussed in recent years. In these studies, many existing works focus on accurately predicting individual stations in a short time. This work, therefore, aims to predict long-term bike rental/drop-off demands at given bike station locations in the expansion areas. The real-world bike stations are mainly built-in batches for expansion areas. To address the problem, we propose LDA (Long-Term Demand Advisor), a framework to estimate the long-term characteristics of newly established stations. In LDA, several engineering strategies are proposed to extract discriminative and representative features for long-term demands. Moreover, for original and newly established stations, we propose several feature extraction methods and an algorithm to model the correlations between urban dynamics and long-term demands. Our work is the first to address the long-term demand of new stations, providing the government with a tool to pre-evaluate the bike flow of new stations before deployment; this can avoid wasting resources such as personnel expense or budget. We evaluate real-world data from New York City’s bike-sharing system, and show that our LDA framework outperforms baseline approaches.

Download Full-text

Short-Time Fourier Transform Covariance and Selection, A Feature Extraction Method for Binary Motor Imagery Classification

10.1109/rcar52367.2021.9517461 ◽

2021 ◽

Author(s):

Yue Ma ◽

Liangsheng Zheng ◽

Zhengkun Yi ◽

Yang Xiao ◽

Can Wang ◽

...

Keyword(s):

Feature Extraction ◽

Fourier Transform ◽

Motor Imagery ◽

Extraction Method ◽

Short Time Fourier Transform ◽

Feature Extraction Method ◽

Short Time

Download Full-text

Extracting Features from Time Series

Fundamentals of Clinical Data Science ◽

10.1007/978-3-319-99713-1_7 ◽

2018 ◽

pp. 85-100 ◽

Cited By ~ 1

Author(s):

Christian Herff ◽

Dean J. Krusienski

Keyword(s):

Time Series ◽

Feature Extraction ◽

Clinical Data ◽

Noise Filtering ◽

Time Intervals ◽

Time Points ◽

Biomedical Systems ◽

Patient Weight ◽

Short Time

AbstractClinical data is often collected and processed as time series: a sequence of data indexed by successive time points. Such time series can be from sources that are sampled over short time intervals to represent continuous biophysical wave-(one word waveforms) forms such as the voltage measurements representing the electrocardiogram, to measurements that are sampled daily, weekly, yearly, etc. such as patient weight, blood triglyceride levels, etc. When analyzing clinical data or designing biomedical systems for measurements, interventions, or diagnostic aids, it is important to represent the information contained within such time series in a more compact or meaningful form (e.g., noise filtering), amenable to interpretation by a human or computer. This process is known as feature extraction. This chapter will discuss some fundamental techniques for extracting features from time series representing general forms of clinical data.

Download Full-text

A comparative study of speech segmentation and feature extraction on the recognition of different dialects

IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028) ◽

10.1109/icsmc.1999.814149 ◽

2003 ◽

Cited By ~ 3

Author(s):

B.N.L. Li ◽

J.N.K. Liu

Keyword(s):

Feature Extraction ◽

Comparative Study ◽

Speech Segmentation

Download Full-text

Sensitivity and positive prediction accuracy analysis for r peak detection in ECG feature extraction

2017 2nd International Conference for Convergence in Technology (I2CT) ◽

10.1109/i2ct.2017.8226216 ◽

2017 ◽

Author(s):

Pratik D. Sherathia ◽

V. P. Patel

Keyword(s):

Feature Extraction ◽

Prediction Accuracy ◽

Peak Detection ◽

Accuracy Analysis ◽

Positive Prediction

Download Full-text

A Novel Artificial Intelligence Technique for Analysis of Real-Time Electro-Cardiogram Signal for the Prediction of Early Cardiac Ailment Onset

Handbook of Research on Advancements of Artificial Intelligence in Healthcare Engineering - Advances in Healthcare Information Systems and Administration ◽

10.4018/978-1-7998-2120-5.ch003 ◽

2020 ◽

pp. 42-66

Author(s):

Dinesh Bhatia ◽

Animesh Mishra

Keyword(s):

Feature Extraction ◽

Fourier Transform ◽

Fractional Fourier Transform ◽

Mapping Technique ◽

Self Organizing Maps ◽

Map Generation ◽

Unsupervised Training ◽

Cardio Vascular ◽

Short Time

The role of ECG analysis in the diagnosis of cardio-vascular ailments has been significant in recent times. Although effective, the present computational algorithms lack accuracy, and no technique till date is capable of predicting the onset of a CVD condition with precision. In this chapter, the authors attempt to formulate a novel mapping technique based on feature extraction using fractional Fourier transform (FrFT) and map generation using self-organizing maps (SOM). FrFT feature extraction from the ECG data has been performed in a manner reminiscent of short time Fourier transform (STFT). Results show capability to generate maps from the isolated ECG wavetrains with better prediction capability to ascertain the onset of CVDs, which is not possible using conventional algorithms. Promising results provide the ability to visualize the data in a time evolution manner with the help of maps and histograms to predict onset of different CVD conditions and the ability to generate the required output with unsupervised training helping in greater generalization than previous reported techniques.

Download Full-text

Über Energien von Drahtexplosionsstoßwellen / Energies of Shock Waves Produced bv Wire Explosions

Zeitschrift für Naturforschung A ◽

10.1515/zna-1973-0118 ◽

1973 ◽

Vol 28 (1) ◽

pp. 105-109 ◽

Cited By ~ 1

Author(s):

H. Jäger ◽

R. Schöfer

Keyword(s):

Shock Wave ◽

Shock Waves ◽

Energy Input ◽

Discharge Circuit ◽

Expansion Velocity ◽

Input Condition ◽

Wire Material ◽

Short Time ◽

The Waves ◽

Short Time Energy

For shock waves produced by special wire explosions the short time energy input condition of the theories of Lin, Sakurai and Vlases-Jones is fairly good fulfilled. In these cases the shock wave energies can be easily determined from the expansion velocity of the waves. Variation of the parameters of the discharge circuit show, how these parameters should be chosen in order to get a maximum transfer of energy either to the shock waves or to the wire material.

Download Full-text

Feature Extraction of Impulse Faults for Vibration Signals Based on Sparse Non-Negative Tensor Factorization

Applied Sciences ◽

10.3390/app9183642 ◽

2019 ◽

Vol 9 (18) ◽

pp. 3642

Author(s):

Lin Liang ◽

Haobin Wen ◽

Fei Liu ◽

Guang Li ◽

Maolin Li

Keyword(s):

Feature Extraction ◽

Fourier Transform ◽

Mechanical Equipment ◽

Tensor Factorization ◽

Short Time Fourier Transform ◽

Vibration Signals ◽

Frequency Distributions ◽

Time Frequency ◽

Value Decomposition ◽

Short Time

The incipient damages of mechanical equipment excite weak impulse vibration, which is hidden, almost unobservable, in the collected signal, making fault detection and failure prevention at the inchoate stage rather challenging. Traditional feature extraction techniques, such as bandpass filtering and time-frequency analysis, are suitable for matrix processing but challenged by the higher-order data. To tackle these problems, a novel method of impulse feature extraction for vibration signals, based on sparse non-negative tensor factorization is presented in this paper. Primarily, the phase space reconstruction and the short time Fourier transform are successively employed to convert the original signal into time-frequency distributions, which are further arranged into a three-way tensor to obtain a time-frequency multi-aspect array. The tensor is decomposed by sparse non-negative tensor factorization via hierarchical alternating least squares algorithm, after which the latent components are reconstructed from the factors by the inverse short time Fourier transform and eventually help extract the impulse feature through envelope analysis. For performance verification, the experimental analysis on the bearing datasets and the swashplate piston pump has confirmed the effectiveness of the proposed method. Comparisons to the traditional methods, including maximum correlated kurtosis deconvolution, singular value decomposition, and maximum spectrum kurtosis, also suggest its better performance of feature extraction.

Download Full-text