scholarly journals Membandingkan Tingkat Kemiripan Rekaman Voice Changer Menggunakan Analisis Pitch, Formant dan Spectogram

2018 ◽  
Vol 5 (1) ◽  
pp. 17
Author(s):  
Ahmad Subki ◽  
Bambang Sugiantoro ◽  
Yudi Prayudi

<strong>Abstrak</strong><br />Audio forensik merupakan salah satu ilmu yang mnyandingkan antara ilmu pengetahuan dan metode ilmiah dalam proses analisis rekaman suara untuk membantu dan mendukung pengungkapan suatu tindak kejahatan yang diperlukan dalam proses persidangan. Undang-undang ITE No.19 Tahun 2016 menyebutkan bahwa rekaman suara merupakan salah satu alat bukti digital yang sah dan dapat digunakan sebagai penguat dakwaan. Rekaman suara yang merupakan barang bukti digital sangatlah mudah dan rentan dimanipulasi, baik secara sengaja maupun tidak disengaja. Pada penelitian ini dilakukan analisis terkait tingkat kemiripan antara rekaman suara voice changer dengan rekaman suara asli menggunakan analisis pitch, formant dan spectogram, rekaman suara yang dianalisis ada dua jenis rekaman suara yaitu suara laki-laki dan suara perempuan. Rekaman suara voice changer  dan rekaman suara asli, diekstrak menggunakan tools praat kemudian informasi yang diperoleh dianalisis dengan analisis statistik pitch, formant dan spectrogrammenggunakan tools gnumeric. Penelitian ini menghasilkan bahwa analisis rekaman suara voice changer dengan rekaman suara asli dapat menggunakan analisis statistik pitch, formant dan spectrogram, rekaman suara voice changer A memiliki tingkat kemiripan yang paling tinggi dengan rekaman suara asli pada posisi low pitch, sedangkan voice changer yang lain lebih sulit untuk diidentifikasi.<br />Kata Kunci: Audio Forensik, Voice Changer, Rekaman Suara, Pitch, Formant, Spectogram<br /><br /><strong>Abstract</strong><br />Audio forensics is one of the sciences that mnyandingkan between science and scientific methods in the process of sound recording analysis to assist and support the disclosure of a crime required in the trial process. The ITE Act No.19 of 2016 states that voice recording is one of the most valid digital instruments and can be used as an indictment. Voice recordings that are digital evidence are extremely easy and prone to be manipulated, either intentionally or unintentionally. In this research, there is an analysis related to the similarity level between the voice changer voice recording with the original sound recordings using pitch, formant and spectogram analysis. The sound recording that is analyzed are two types of sound recording, namely male voice and female voice. Voice recording of voice changer and original sound recording, extracted using praat tools then the information obtained is analyzed by pitch, formant and spectrogram analysis using gnumeric tools. This study resulted in the analysis of voice changer voice recording with original sound recording using pitch, formant and spectrogram statistical analysis, voice changer A voice recording has the highest level of resemblance with original sound recording in low pitch position, while other voice changer more difficult to be identified<br />Keywords: Audio forensics, Voice Changer, Sound Recording, Pitch, Formant, Spectogram

2021 ◽  
Vol 9 (1) ◽  
pp. 1-8
Author(s):  
Mifta Nur Farid ◽  
Dani Dwi Putra ◽  
Barokatun Hasanah

Audio forensics is a field of science that analyzes audio such as sound recordings. Voice recordings always have information in the form of frequency characteristics, the identities of these frequencies can be identified. Furthermore, an analysis of changes in pitch and formant will be carried out. This study used pitch analysis and analysis of variance on formants. With the correct procedure for handling recorded sound evidence which is then followed by procedural examination and analysis, it is hoped that the results of the voice recognition examination can scientifically show the ownership of the voice in the recording. Based on the results of the overall analysis of the sound recordings of evidence and comparison after carrying out various stages of analysis, the voice recordings are "not identical" from the same person. The thing that causes the inequality in voice identification is the difference in intonation or tone of the subject's speech when the voice is recorded.


2015 ◽  
Vol 11 (1) ◽  
pp. 17-32 ◽  
Author(s):  
Eun Kyung Park ◽  
Kwan Min Lee ◽  
Dong Hee Shin

The study investigated whether apologetic synthetic gendered voices affect users' perception of an error-prone VUI. In a TV viewing task, participants interacted with the conversational TV, and executed eight menus in a 2 (apologetic error message: yes vs. no) by 2 (voice gender) by 2 (subject gender) gender balanced, between participants experiment. When participants encountered errors, the TV provided verbal error messages, with or without an apology. The results revealed significant two-way interaction effects of apology (yes) and voice gender (male) on perception of the TV, and the voice. Irrespective of gender, participants responded to a male voice more, when it offered apologies for errors. It is interpreted that the context in which genuineness of apology was regarded as important made participants perceive a male voice as being more trustworthy than a female voice. The participants seem to have applied gender stereotypical perceptions to gendered VUI, as they do to other humans.


Author(s):  
Abiyan Bagus Baskoro ◽  
Niken Cahyani ◽  
Aji Gautama Putrada

Voice recordings can be changed in various ways, either intentionally or unintentionally, one of which is by using a voice changer. Reference voice recordings and suspect voice recordings will be more difficult to analyze if suspect voice recordings are changed using a voice changer application under certain effects such as telephone effect. Voice Changer can be one form of activity that can be carried out by anti-forensics, making it difficult for investigators to investigate if the voice recording is changed with telephone effect. This study has two types of recordings, namely the reference voice recording (unknown sample) and suspect voice recording (known sample) that has been changed using a voice changer application with telephone effect. Investigations were carried out based on data results extraction and analysis using pitch, formant, and spectrogram using the Analysis of variance (ANOVA) method and the likelihood ratio method. The results of this study indicate that the application of voice changer can be one form of activity that can be carried out by anti-forensics so that it can be difficult for investigators to conduct investigations on sound recording evidence. This research may help forensic communities, especially investigators to conduct investigations on sound recording.


2021 ◽  
Vol 57 (1) ◽  
pp. 40-55
Author(s):  
Gordana Varošanec-Škarić ◽  
Siniša Stevanović ◽  
Iva Bašić

In this study, we examined changes in the voice quality of a transgender client who had previously undergone male-to-female (MtF) transition. We conducted a longitudinal phonetic analysis after obtaining recordings from our client before and after undergoing laser-assisted voice adjustment (LAVA) surgery. The following acoustic parameters were compared: fundamental frequency (F0) measures, local jitter, shimmer, harmonic to noise ratio, phonation time, and long-term average spectrum. We assumed that the voice would not change significantly as a result of previous hormonal and vocal therapy, and that its timbre would be closer to female values after LAVA surgery. Since the client was on hormone therapy before the surgery, the average values of F0 corresponded to the values of a normal female voice (190.1 Hz), and, after surgery, the voice became significantly higher in phonation (235.6 Hz). Before surgery, the voice was high for a male voice during reading (mean F0 = 150.19 Hz for non-fricative text (NT) and mean F0 = 158.06 Hz for fricative text (FT)). After surgery, the voice exhibited higher F0 values (F0 = 184.72 Hz for NT and F0 = 191.87 Hz for FT). Before surgery, the voice was average high for a male voice during spontaneous speech (F0 = 119.90 Hz), while after surgery the F0 was 161.33 Hz during spontaneous speech, which is somewhat lower than the average pitch values of the female voice, but its timbral quality is more feminine. Since spontaneous speech is very important for comparison vocal timbre, we can conclude that the 42 Hz difference observed is notable. Although the minimal and maximal values of F0 based on phonation were significantly higher after surgery (p < 0.001), the range was limited. The total results of the F0 measures are higher than expected, while the shortened phonation time points to the need for voice therapy. Considering all our results, we can conclude that it is important to discuss a client’s profession before considering LAVA surgery.


Phonopoetics ◽  
2019 ◽  
pp. 169-184
Author(s):  
Jason Camlot

The Conclusion to Phonopoetics explores conceptions of voice preservation and models of the voice archive. It takes early ideas of the audible archival artifact (the sound recording) and the event-oriented scenario of its use as useful points of departure for a historically motivated theorization of the voice recording and voice archive at the present time. Specifically, it considers the impact of digital media technologies on the status of the record and its archive. The Conclusion mediates on how the analogue artifact of the sound archive has shaped our ideas and expectations about what a digital repository should be, and reflects on the status of the artifact of study as we move increasingly from the study of material media artifacts to virtual instantiations of the signals those media may once have held, in the form of digital media files.


2017 ◽  
pp. 1709-1726
Author(s):  
Eun Kyung Park ◽  
Kwan Min Lee ◽  
Dong Hee Shin

The study investigated whether apologetic synthetic gendered voices affect users' perception of an error-prone VUI. In a TV viewing task, participants interacted with the conversational TV, and executed eight menus in a 2 (apologetic error message: yes vs. no) by 2 (voice gender) by 2 (subject gender) gender balanced, between participants experiment. When participants encountered errors, the TV provided verbal error messages, with or without an apology. The results revealed significant two-way interaction effects of apology (yes) and voice gender (male) on perception of the TV, and the voice. Irrespective of gender, participants responded to a male voice more, when it offered apologies for errors. It is interpreted that the context in which genuineness of apology was regarded as important made participants perceive a male voice as being more trustworthy than a female voice. The participants seem to have applied gender stereotypical perceptions to gendered VUI, as they do to other humans.


2019 ◽  
Vol 4 (4) ◽  
pp. 607-614
Author(s):  
Jean Abitbol

The purpose of this article is to update the management of the treatment of the female voice at perimenopause and menopause. Voice and hormones—these are 2 words that clash, meet, and harmonize. If we are to solve this inquiry, we shall inevitably have to understand the hormones, their impact, and the scars of time. The endocrine effects on laryngeal structures are numerous: The actions of estrogens and progesterone produce modification of glandular secretions. Low dose of androgens are secreted principally by the adrenal cortex, but they are also secreted by the ovaries. Their effect may increase the low pitch and decease the high pitch of the voice at menopause due to important diminution of estrogens and the privation of progesterone. The menopausal voice syndrome presents clinical signs, which we will describe. I consider menopausal patients to fit into 2 broad types: the “Modigliani” types, rather thin and slender with little adipose tissue, and the “Rubens” types, with a rounded figure with more fat cells. Androgen derivatives are transformed to estrogens in fat cells. Hormonal replacement therapy should be carefully considered in the context of premenopausal symptom severity as alternative medicine. Hippocrates: “Your diet is your first medicine.”


2020 ◽  
Vol 52 (4) ◽  
pp. 726-732
Author(s):  
Claire Beaugrand

In a tweet posted on 29 March 2018, a bidūn activist—who was later jailed from July 2019 to January 2020 for peacefully protesting against the inhumane conditions under which the bidūn are living—shared a video. The brief video zooms in closely on an ID card, recognizable as one of those issued to the bidūn, or long-term residents of Kuwait who are in contention with the state regarding their legal status. More precisely, the mobile phone camera focuses on the back of the ID card, on one line with a special mention added by the Central System (al-jihāz al-markazī), the administration in charge of bidūn affairs. Other magnetic strip cards hide the personal data written above and below it. A male voice can be heard saying that he will read this additional remark, but before even doing so he bursts into laughter. The faceless voice goes on to read out the label in an unrestrained laugh: “ladayh qarīb … ladayh qarīna … dālla ʿalā al-jinsiyya al-ʿIrāqiyya” (he has a relative … who has presumptive evidence … suggesting an Iraqi nationality). The video shakes as the result of a contagious laugh that grows in intensity. In the Kuwaiti dialect, the voice continues commenting: “Uqsim bil-Allāh, gaʿadt sāʿa ufakkir shinū maʿanāt hal-ḥatchī” (I swear by God, it took me an hour to figure out the meaning of this nonsense), before reading the sentence again, stopping and guffawing, and asking if he should “repeat it a third time,” expressing amazement at its absurdity. The tweet, addressed to the head of the Central System (mentioned in the hashtag #faḍīḥat Sāliḥ al-Faḍāla, or #scandal Salih al-Fadala), reads: In lam tastaḥī fa-'ktub mā shaʾt (Don't bother, write what you want).


2019 ◽  
Vol 9 (15) ◽  
pp. 3097 ◽  
Author(s):  
Diego Renza ◽  
Jaime Andres Arango ◽  
Dora Maria Ballesteros

This paper addresses a problem in the field of audio forensics. With the aim of providing a solution that helps Chain of Custody (CoC) processes, we propose an integrity verification system that includes capture (mobile based), hash code calculation and cloud storage. When the audio is recorded, a hash code is generated in situ by the capture module (an application), and it is sent immediately to the cloud. Later, the integrity of the audio recording given as evidence can be verified according to the information stored in the cloud. To validate the properties of the proposed scheme, we conducted several tests to evaluate if two different inputs could generate the same hash code (collision resistance), and to evaluate how much the hash code changes when small changes occur in the input (sensitivity analysis). According to the results, all selected audio signals provide different hash codes, and these values are very sensitive to small changes over the recorded audio. On the other hand, in terms of computational cost, less than 2 s per minute of recording are required to calculate the hash code. With the above results, our system is useful to verify the integrity of audio recordings that may be relied on as digital evidence.


Author(s):  
Naoko Saito

This article broaches what can sometimes be seen as the suppression of the female voice, sometimes the repression of the feminine. To address these matters involves the reconsideration of the political discourse that pervades education and educational research. This article is an attempt to disclose inequity in apparently equitable space, through the acknowledgment of the voice of disequilibrium. It proposes to re-place the subject of philosophy, and the subject of woman, through an alternative idea of the feminine voice in philosophy. It tries to reconfigure the female voice without negating its fated biological origin and traits, and yet avoiding the confining of thought to the constraints of gender divides. In terms of education, it shall argue for the conversation of justice as a way of cultivating the feminine voice in philosophy: as the voice of disequilibrium. This is an occasion of mutual destabilization and transformation of man and woman, crossing gender divides, and preparing an alternative route to political criticism that not only reclaims the rights of women but releases the thinking of men and women, laying the way for a better, more pluralist, and more democratic politics. The feminine voice can find a way beyond the dominance of instrumental rationality and calculative thinking in the discourse on equity itself. And it can, one might reasonably hope, have an impact on the curriculum of university education.


Sign in / Sign up

Export Citation Format

Share Document