Authenticity investigation of digital audio recorded as MP3 files

2014 ◽  
Vol 283 ◽  
pp. 54-67
Author(s):  
Rafał Korycki ◽  

In the work, the problem of detecting discontinuities in lossily compressed audio recordings was outlined and new methods that can be used to examine the authenticity of digital audio records were presented. The described solutions are based on statistical analysis of the data, calculated on the basis of the value of MDCT coefficients. Designated vectors, consisting of 228 features, were used as the training sequences of two machinę learning algorithms under the supervision of the linear discriminant analysis (LDA) and the support vector machinę (SVM). Detection of multiple compression was both used to detect modification of the recording as well as to reveal traces of montage in digital audio recordings. The effectiveness of the algorithms for the detection of discontinuities was tested on the database of recorded musie consisting of nearly one million MP3 files, specially prepared forthis purpose. The results of the research were discussed in the context of the influence of parameters of the compression on the abiiity to detect interference in digital audio recordings.

Author(s):  
S. R. Mani Sekhar ◽  
G. M. Siddesh

Machine learning is one of the important areas in the field of computer science. It helps to provide an optimized solution for the real-world problems by using past knowledge or previous experience data. There are different types of machine learning algorithms present in computer science. This chapter provides the overview of some selected machine learning algorithms such as linear regression, linear discriminant analysis, support vector machine, naive Bayes classifier, neural networks, and decision trees. Each of these methods is illustrated in detail with an example and R code, which in turn assists the reader to generate their own solutions for the given problems.


2015 ◽  
Vol 287 ◽  
pp. 84-94
Author(s):  
Rafał Korycki ◽  

The paper outlines the problem of identification of MP3 encoders and recording devices based on the analysis of audio recordings subjected to lossy date compression. The proposed method can be used as a support for other solutions used to detect double compression and discontinuities. The approach is based on the statistical analysis of the variables obtained directly from the MP3 data stream and constitute an inherent element of compression performance. Designated vectors consisting of 46 features were used as training sequences of linear discriminant analysis (LDA), one of the most popular supervised machine learning algorithms. The effectiveness of this algorithms for the identification of MP3 encoders and recording equipment was tested on a music database consisting of nearly one million MP3 files, specially prepared for this purpose. The results of the research were discussed in the context of the influence of compression parameters on the ability to detect falsification in digital audio recordings.


2020 ◽  
Vol 32 (02) ◽  
pp. 2050010
Author(s):  
Fatma EL-Zahraa M. Labib ◽  
Islam A. Fouad ◽  
Mai S. Mabrouk ◽  
Amr A. Sharawy

A brain–computer interface (BCI) can be used for people with severe physical disabilities such as ALS or amyotrophic lateral sclerosis. BCI can allow these individuals to communicate again by creating a new communication channel directly from the brain to an output device. BCI technology can allow paralyzed people to share their intent with others, and thereby demonstrate that direct communication from the brain to the external world is possible and that it might serve useful functions. BCI systems include machine learning algorithms (MLAs). Their performance depends on the feature extraction and classification techniques employed. In this paper, we propose a system to exploit the P300 signal in the brain, a positive deflection in event-related potentials. The P300 signal can be incorporated into a spelling device. There are two benefits behind this kind of research. First of all, this work presents the research status and the advantages of communication via a BCI system, especially the P300 BCI system for disordered people, and the related literature review is presented. Secondly, the paper discusses the performance of different machine learning algorithms. Two different datasets are presented: the first dataset 2004 and the second dataset 2019. A preprocessing step is introduced to the subjects in both datasets first to extract the important features before applying the proposed machine learning methods: linear discriminant analysis (LDA I and LDA II), support vector machine (SVM I, SVM II, SVM III, and SVM IV), linear regression (LREG), Bayesian linear discriminant analysis (BLDA), and twin support vector machine (TSVM). By comparing the performance of the different machine learning systems, in the first dataset it is found that BLDA and SVMIV classifiers yield the highest performance for both subjects “A” and “B”. BLDA yields 98% and 66% for 15th and 5th sequences, respectively, whereas SVMIV yields 98% and 54.4% for 15th and 5th sequences, respectively. While in the second dataset, it is obvious that BLDA classifier yields the highest performance for both subjects “1” and “2”, it achieves 90.115%. The paper summarizes the P300 BCI system for the two introduced datasets. It discusses the proposed system, compares the classification methods performances, and considers some aspects for the future work to be handled. The results show high accuracy and less computational time which makes the system more applicable for online applications.


Author(s):  
S. R. Mani Sekhar ◽  
G. M. Siddesh

Machine learning is one of the important areas in the field of computer science. It helps to provide an optimized solution for the real-world problems by using past knowledge or previous experience data. There are different types of machine learning algorithms present in computer science. This chapter provides the overview of some selected machine learning algorithms such as linear regression, linear discriminant analysis, support vector machine, naive Bayes classifier, neural networks, and decision trees. Each of these methods is illustrated in detail with an example and R code, which in turn assists the reader to generate their own solutions for the given problems.


2021 ◽  
Vol 186 (Supplement_1) ◽  
pp. 445-451
Author(s):  
Yifei Sun ◽  
Navid Rashedi ◽  
Vikrant Vaze ◽  
Parikshit Shah ◽  
Ryan Halter ◽  
...  

ABSTRACT Introduction Early prediction of the acute hypotensive episode (AHE) in critically ill patients has the potential to improve outcomes. In this study, we apply different machine learning algorithms to the MIMIC III Physionet dataset, containing more than 60,000 real-world intensive care unit records, to test commonly used machine learning technologies and compare their performances. Materials and Methods Five classification methods including K-nearest neighbor, logistic regression, support vector machine, random forest, and a deep learning method called long short-term memory are applied to predict an AHE 30 minutes in advance. An analysis comparing model performance when including versus excluding invasive features was conducted. To further study the pattern of the underlying mean arterial pressure (MAP), we apply a regression method to predict the continuous MAP values using linear regression over the next 60 minutes. Results Support vector machine yields the best performance in terms of recall (84%). Including the invasive features in the classification improves the performance significantly with both recall and precision increasing by more than 20 percentage points. We were able to predict the MAP with a root mean square error (a frequently used measure of the differences between the predicted values and the observed values) of 10 mmHg 60 minutes in the future. After converting continuous MAP predictions into AHE binary predictions, we achieve a 91% recall and 68% precision. In addition to predicting AHE, the MAP predictions provide clinically useful information regarding the timing and severity of the AHE occurrence. Conclusion We were able to predict AHE with precision and recall above 80% 30 minutes in advance with the large real-world dataset. The prediction of regression model can provide a more fine-grained, interpretable signal to practitioners. Model performance is improved by the inclusion of invasive features in predicting AHE, when compared to predicting the AHE based on only the available, restricted set of noninvasive technologies. This demonstrates the importance of exploring more noninvasive technologies for AHE prediction.


Author(s):  
Pratyush Kaware

In this paper a cost-effective sensor has been implemented to read finger bend signals, by attaching the sensor to a finger, so as to classify them based on the degree of bent as well as the joint about which the finger was being bent. This was done by testing with various machine learning algorithms to get the most accurate and consistent classifier. Finally, we found that Support Vector Machine was the best algorithm suited to classify our data, using we were able predict live state of a finger, i.e., the degree of bent and the joints involved. The live voltage values from the sensor were transmitted using a NodeMCU micro-controller which were converted to digital and uploaded on a database for analysis.


Author(s):  
Sheela Rani P ◽  
Dhivya S ◽  
Dharshini Priya M ◽  
Dharmila Chowdary A

Machine learning is a new analysis discipline that uses knowledge to boost learning, optimizing the training method and developing the atmosphere within which learning happens. There square measure 2 sorts of machine learning approaches like supervised and unsupervised approach that square measure accustomed extract the knowledge that helps the decision-makers in future to require correct intervention. This paper introduces an issue that influences students' tutorial performance prediction model that uses a supervised variety of machine learning algorithms like support vector machine , KNN(k-nearest neighbors), Naïve Bayes and supplying regression and logistic regression. The results supported by various algorithms are compared and it is shown that the support vector machine and Naïve Bayes performs well by achieving improved accuracy as compared to other algorithms. The final prediction model during this paper may have fairly high prediction accuracy .The objective is not just to predict future performance of students but also provide the best technique for finding the most impactful features that influence student’s while studying.


Author(s):  
Vidyashree M S

Abstract: Blood Cancer cells forming a tissue is called lymphoma. Thus, disease decreases the cells to fight against the infection or cancer blood cells. Blood cancer is also categorized in too many types. The two main categories of blood cancer are Acute Lymphocytic Lymphoma and Acute Myeloid Lymphoma. In this project proposes a approach that robotic detects and segments the nucleolus from white blood cells in the microscopic Blood images. Here in this project, we have used the two Machine learning algorithms that are k-means algorithm, Support vector machine algorithm. K-mean algorithm is use for segmentation and clustering. Support vector machine algorithm is used for classification. Keywords: k-means, Support vector machine, Lymphoma, Acute Lymphocytic Lymphoma, Machine Learning


2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Yao Huimin

With the development of cloud computing and distributed cluster technology, the concept of big data has been expanded and extended in terms of capacity and value, and machine learning technology has also received unprecedented attention in recent years. Traditional machine learning algorithms cannot solve the problem of effective parallelization, so a parallelization support vector machine based on Spark big data platform is proposed. Firstly, the big data platform is designed with Lambda architecture, which is divided into three layers: Batch Layer, Serving Layer, and Speed Layer. Secondly, in order to improve the training efficiency of support vector machines on large-scale data, when merging two support vector machines, the “special points” other than support vectors are considered, that is, the points where the nonsupport vectors in one subset violate the training results of the other subset, and a cross-validation merging algorithm is proposed. Then, a parallelized support vector machine based on cross-validation is proposed, and the parallelization process of the support vector machine is realized on the Spark platform. Finally, experiments on different datasets verify the effectiveness and stability of the proposed method. Experimental results show that the proposed parallelized support vector machine has outstanding performance in speed-up ratio, training time, and prediction accuracy.


Sign in / Sign up

Export Citation Format

Share Document