IMPLEMENTASI SPEECH RECOGNITION PADA APLIKASI BELAJAR BENTUK PERMAINAN MENEBAK KATA BAKU BAHASA INDONESIA

This paper is about the study of Indonesian Automatic Speech Recognition (ASR) designed by Informational- Technological Computer ( TIK).Specifically, this paper is aimed at describing how this tool operates in recognizing some in-puts in Indonesian language. TIK industry has something to do with Indonesian PWO where the use of the smart phones developes massively in Indonesia. This study processes around 10.774 data in Indonesian language in form of sentence, phrase, and word. From this number, less than 20% can be categorized perfect. The others have an error in format and recognition. This is due to some factors bringing about the failure of ASR in recognising the in put in Indonesian language.

Download Full-text

Rancang Bangun Sistem Kendali Berbasis Googlespeech Untuk Aktivasi Peralatan Listrik Rumah

JREC (Journal of Electrical and Electronics) ◽

10.33558/jrec.v5i2.459 ◽

2018 ◽

Vol 5 (2) ◽

pp. 83-98

Author(s):

Tesar Kurniawan ◽

Nursin Nursin ◽

Muhamad Amin Bakrie ◽

Seta Samsiana

Keyword(s):

Speech Recognition ◽

Free Space ◽

Recognition Accuracy ◽

Voice Recognition ◽

Smart Phones ◽

Maximum Distance ◽

Software Developer ◽

Door Lock ◽

App Inventor ◽

Bahasa Indonesia

App inventor adalah media pengembang perangkat lunak untuk sistem android, yang memudahkan para pengembangnya mengembangkan idenya, salah satunya aplikasi yang mampu mengendalikan peralatan listrik rumah menggunakan suara melalui telepon pintar yang dapat mengontrol aktivasi peralatan listrik rumah. Google Speech digunakan untuk pengenalan suara yang kemudian memberikan input ke Arduino untuk mengendalikan aktivasi peralatan listrik rumah, Peralatan listrik rumah seperti lampu, motor pompa akuarium, kipas, door lock dan motor servo yang memanfaatkan relay sebagai driver, kemudian dilakukanlah pengujian dan penelitian pada laporan ini berisi tentang pengujian akurasi pengenalan suara google Speech dan pengujian jarak koneksi Bluetooth. Tingkat keakurasian pada google Speech yang paling baik dari 3 bahasa yaitu Bahasa Indonesia disusulBahasa jawa dan terakhir Bahasa sunda, sedangkan untuk jarak koneksi pada Bluetooth dapat dioperasikan jarak maksimal pada ruang bebas adalah 20 m dan jarak maksimal pada ruang berhalangan adalah 13 m. App inventor is a software developer media for android systems, which makes it easy for developers to develop their ideas, i.e an application that is able to control home electrical appliances using voice over smart phones that can control the activation of home electrical appliances. Google Speech is used for voice recognition which then provides input to Arduino to control the activation of home electrical appliances, such as lamps, aquarium pump motors, fans, door locks. A servo motors is used as drivers, then test and research on this report Contains about Speech google speech recognition accuracy testing and Bluetooth connection distance testing. Level of accuracy on google Speech the best of 3 languages ie Indonesian followed by Java and last language Sundanese, while for the distance on the Bluetooth connection can be operated the maximum distance in free space is 20 m and the maximum distance in the absence room is 13 m.

Download Full-text

Konversi Suara Ucapan Bahasa Indonesia Ke Sistem Bahasa Isyarat Indonesia (Sibi)

Ainet : Jurnal Informatika ◽

10.26618/ainet.v2i2.4025 ◽

2020 ◽

Vol 2 (2) ◽

pp. 7-13

Author(s):

Andi Nasri

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Error Rate ◽

Word Error Rate ◽

Bahasa Indonesia

Dengan semakin berkembangnya teknologi speech recognition, berbagai software yang bertujuan untuk memudahkan orang tunarungu dalam berkomunikasi dengan yang lainnya telah dikembangkan. Sistem tersebut menterjemahkan suara ucapan menjadi bahasa isyarat atau sebaliknya bahasa isyarat diterjemahkan ke suara ucapan. Sistem tersebut sudah dikembangkan dalam berbagai bahasa seperti bahasa Inggris, Arab, Spanyol, Meksiko, Indonesia dan lain-lain. Khusus untuk bahasa Indonesia mulai juga sudah yang mencoba melakukan penelitian untuk membuat system seperti tersebut. Namun system yang dibuat masih terbatas pada Automatic Speech Recognition (ASR) yang digunakan dimana mempunyai kosa-kata yang terbatas. Dalam penelitian ini bertujuan untuk mengembangkan sistem penterjemah suara ucapan bahasa Indonesia ke Sistem Bahasa Isyarat Indonesia (SIBI) dengan data korpus yang lebih besar dan meggunkanan continue speech recognition untuk meningkatkan akurasi system.Dari hasil pengujian system menunjukan diperoleh hasil akurasi sebesar rata-rata 90,50 % dan Word Error Rate (WER) 9,50%. Hasil akurasi lebih tinggi dibandingkan penelitian kedua 48,75% dan penelitan pertama 66,67%. Disamping itu system juga dapat mengenali kata yang diucapkan secara kontinyu atau pengucapan kalimat. Kemudian hasil pengujian kinerja system mencapai 0,83 detik untuk Speech to Text dan 8,25 detik untuk speech to sign.

Download Full-text

Dataset Suara dan Teks Berbahasa Indonesia Pada Rekaman Podcast dan Talk show

JURNAL FASILKOM ◽

10.37859/jf.v11i2.2628 ◽

2021 ◽

Vol 11 (2) ◽

pp. 61-66

Author(s):

Martin Novela ◽

T. Basaruddin

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Speech Recognition ◽

Text To Speech ◽

Talk Show ◽

Input Dataset ◽

Bahasa Indonesia

Salah satu faktor keberhasilan suatu model pembelajaran dalam machine learning atau deep learning adalah dataset yang digunakan. Pada tulisan ini menyajikan dataset suara dari rekaman podcast dan talk show beserta transkripsi berbahasa Indonesia. Dataset ini disajikan karena belum adanya ketersediaan dataset berbahasa Indonesia yang dapat diakses secara publik untuk digunakan pada pembelajaran model Text-to-Speech ataupun Audio Speech Recognition. Dataset terdiri dari 3270 rekaman yang diproses untuk mendapatkan transkripsi berupa teks atau kalimat berbahasa Indonesia. Dalam pembuatan dataset ini dilakukan beberapa tahapan seperti pra-pemrosesan, tahapan translasi, tahapan validasi pertama dan tahapan validasi kedua. Dataset dibuat dengan format yang mengikuti format dari dataset LJSpeech untuk memudahkan pemrosesan dataset ketika digunakan dalam suatu model sebagai input. Dataset ini diharapkan dapat membantu meningkatkan kualitas pembelajaran untuk pemrosesan Text-to-Speech seperti pada model Tacotron2 ataupun pada pemrosesan Audio Speech Recognition untuk bahasa Indonesia.

Download Full-text

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v6i2.2684 ◽

2020 ◽

Vol 6 (2) ◽

Author(s):

Vincent Elbert Budiman ◽

Andreas Widjaja

Keyword(s):

Speech Recognition ◽

Error Rate ◽

Language Model ◽

Beam Width ◽

Acoustic Model ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Word Error Rate ◽

Testing Data ◽

Bahasa Indonesia

Here a development of an Acoustic and Language Model is presented. Low Word Error Rate is an early good sign of a good Language and Acoustic Model. Although there are still parameters other than Words Error Rate, our work focused on building Bahasa Indonesia with approximately 2000 common words and achieved the minimum threshold of 25% Word Error Rate. There were several experiments consist of different cases, training data, and testing data with Word Error Rate and Testing Ratio as the main comparison. The language and acoustic model were built using Sphinx4 from Carnegie Mellon University using Hidden Markov Model for the acoustic model and ARPA Model for the language model. The models configurations, which are Beam Width and Force Alignment, directly correlates with Word Error Rate. The configurations were set to 1e-80 for Beam Width and 1e-60 for Force Alignment to prevent underfitting or overfitting of the acoustic model. The goals of this research are to build continuous speech recognition in Bahasa Indonesia which has low Word Error Rate and to determine the optimum numbers of training and testing data which minimize the Word Error Rate.

Download Full-text

APLIKASI SPEECH TO TEXT BAHASA INDONESIA KE BAHASA BENGKULU MENGGUNAKAN POCKETSPHINX BERBASIS ANDROID

JSAI (Journal Scientific and Applied Informatics) ◽

10.36085/jsai.v1i2.14 ◽

2018 ◽

Vol 1 (2) ◽

pp. 54-57

Author(s):

Dwita Deslianti

Keyword(s):

Speech Recognition ◽

Smart Phone ◽

Bahasa Indonesia

Saat ini smart phone berbasis Android khususnya sudah menjadi kebutuhan masyarakat karena menjanjikan banyak kemudahan sehari-hari salah satunya ialah speech recognition yang dikembangkan oleh perusahaan Google. Dengan fitur seperti ini, pengguna dapat dimudahkan mencari lokasi, artikel dan apapun yang kita butuhkan saat kita sibuk dalam berkendara dengan hanya menggunakan suara kita saja. Saat bepergian keluar kota pun kita sangat memerlukan sebuah smart phone yang mampu mendampingi kita dalam perjalanan semisal untuk melakukan komunikasi dengan bahasa daerah yang kita kunjungi. Dari permasalahan tersebut maka penulis akan membahas bagaimana membangun suatu aplikasi Speech to Text Bahasa Indonesia ke Bahasa Bengkulu Menggunakan Pocketsphinx Berbasis Android. Tujuan penelitian ini adalah membangun sebuah program aplikasi Speech to Text Bahasa Indonesia ke Bahasa Bengkulu Menggunakan Pocketsphinx Berbasis Android Sehingga manfaat dari penelitian ini adalah Dapat membantu ketika sedang bepergian ke kota Bengkulu untuk melakukan komunikasi dengan bahasa Bengkulu.

Download Full-text

Langkah Praktis Membangun Sistem Pengenalan Suara dengan HTK

JSAI (Journal Scientific and Applied Informatics) ◽

10.36085/jsai.v2i2.314 ◽

2019 ◽

Vol 2 (2) ◽

pp. 149-153

Author(s):

Zulkarnaen Hatala

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Error Rate ◽

Principle Component Analysis ◽

Hidden Markov ◽

Recognition System ◽

Speech Recognition System ◽

Automatic Speech Recognition System ◽

Word Error Rate ◽

Bahasa Indonesia

Dipaparkan prosedur untuk mengembangkan Sistem Pengenalan Suara otomatis, Automatic Speech Recognition System (ASR) untuk kasus online recognition. Prosedur ini secara cepat dan efisien membangun ASR menggunakan Hidden Markov Toolkit (HTK). Langkah-langkah praktis ini dipaparkan secara jelas untuk mengimplementasikan ASR dengan daftar kata sedikit (Small Vocabulary) dalam contoh kasus pengenalan digit Bahasa Indonesia. Dijelaskan beberapa teknik meningkatkan performansi seperti cara mengatasi noise, pengejaan ganda dan penerapan Principle Component Analysis. Hasil akhir berupa Word Error Rate

Download Full-text

Speech Recognition Untuk Aplikasi Kamus Bahasa Indonesia-Sumbawa Berbasis Android

Jurnal Bumigora Information Technology (BITe) ◽

10.30812/bite.v1i2.606 ◽

2019 ◽

Vol 1 (2) ◽

pp. 126-137

Author(s):

Muhammad , ◽

Syahroni Hidayat ◽

Ahmad Zuli Amrullah

Keyword(s):

Speech Recognition ◽

Black Box ◽

Native People ◽

Great Demand ◽

Testing Method ◽

Test Application ◽

Analysis Design ◽

The Creation ◽

Tourism Potential ◽

Bahasa Indonesia

ABSTRAK Sumbawa sebagai salah satu daerah yang dianugrahi potensi wisata yang beragam menjadikan daya tarik masyarakat luar Sumbawa (wisatawan) untuk berkunjung, bekerja, maupun untuk belajar. Namun terkadang bahasa menjadi salah satu kendala yang dihadapi mayarakat luar Sumbawa jika ingin berinteraksi dengan masyarakat asli Sumbawa. Sehingga dibutuhkan sebuah instrument yang bisa digunakan sehingga perbedaan bahasa tidak menjadi kendala dalam berinteraksi yaitu kamus. Oleh karena itu, kamus yang disajikan haruslah sesuai dengan teknologi yang banyak diminati oleh masyarakat Indonesia pada umumnya yaitu smartphone Android dikarenakan fitur-fitur yang tersedia dalam smartphone tersebut. Salah satu fiturnya adalah speech recognition.Perancangan sistem ini dilakukan dengan metodologi waterfall yang terdiri dari proses analisis, desain, pengkodean, pengujian, dan terakhir pemeliharaan. Tools yang digunakan adalah Android Studio dan DB Browser for SQLite (DB4S). Metode pengujian menggunakan Black Box untuk uji fungsionalitas aplikasi dan Word Correct Rate (WCR) untuk menguji akurasi sistem dengan menggunakan 30 kata yang berbeda dan setiap kata diulang sebanyak 10 kali.Hasil yang sudah dicapai dalam penelitian ini adalah terciptanya aplikasi Kamus Bahasa Indonesia- Sumbawa Berbasis Android dengan memanfaatkan teknologi speech recognition.Kesimpulan dari penelitian ini adalah Uji fungsionalitas menunjukkan fitur-fitur aplikasi dapat bekerja dengan baik ketika offline maupun online. Sedangkan untuk uji coba akurasi sistem didapatkan hasil WCR secara berturut-turut sebesar 92.67% ketika offline dan 95.33% ketika online. ABSTRACT Sumbawa as one of the areas that is blessed with diverse tourism potential makes the appeal of people outside Sumbawa (tourists) to visit, work, or to study. But sometimes language becomes one of the obstacles faced by people outside Sumbawa if they want to interact with the native people of Sumbawa. So we need an instrument that can be used so that differences in language do not become obstacles in interacting with the dictionary. Therefore, the dictionary presented must be in accordance with the technology that is in great demand by the Indonesian people in general, namely Android smartphones because of the features available in these smartphones. One of the features is speech recognition. The design of this system is done by the waterfall methodology which consists of the process of analysis, design, coding, testing, and finally maintenance. The tools used are Android Studio and DB Browser for SQLite (DB4S). The testing method uses Black Box to test application functionality and Word Correct Rate (WCR) to test the accuracy of the system using 30 different words and each word is repeated 10 times. The results achieved in this study are the creation of an Indonesian-Sumbawa-based Dictionary application Android by utilizing speech recognition technology. The conclusion of this research is the functionality test shows that the application features can work well when offline or online. Whereas for testing the accuracy of the system the WCR results obtained were 92.67% when offline and 95.33% when online.

Download Full-text

Measuring Mandarin Speech Recognition Thresholds Using the Method of Adaptive Tracking

Journal of Speech Language and Hearing Research ◽

10.1044/2019_jslhr-h-18-0162 ◽

2019 ◽

Vol 62 (6) ◽

pp. 2009-2017

Author(s):

Yuxia Wang ◽

Zhaoyu Lu ◽

Xiaohu Yang ◽

Chang Liu

Keyword(s):

Speech Recognition ◽

Adaptive Tracking ◽

Mandarin Speech Recognition

Download Full-text

Selecting the Optimal FM System for Children With Cochlear Implants

Perspectives on Hearing and Hearing Disorders in Childhood ◽

10.1044/hhdc18.1.19 ◽

2008 ◽

Vol 18 (1) ◽

pp. 19-24

Author(s):

Erin C. Schafer

Keyword(s):

Speech Recognition ◽

Cochlear Implants ◽

Empirical Research ◽

Background Noise ◽

Signal To Noise Ratio ◽

Evidence Based ◽

Signal To Noise ◽

Speech Processor ◽

System Input ◽

Optimal Type

Children who use cochlear implants experience significant difficulty hearing speech in the presence of background noise, such as in the classroom. To address these difficulties, audiologists often recommend frequency-modulated (FM) systems for children with cochlear implants. The purpose of this article is to examine current empirical research in the area of FM systems and cochlear implants. Discussion topics will include selecting the optimal type of FM receiver, benefits of binaural FM-system input, importance of DAI receiver-gain settings, and effects of speech-processor programming on speech recognition. FM systems significantly improve the signal-to-noise ratio at the child's ear through the use of three types of FM receivers: mounted speakers, desktop speakers, or direct-audio input (DAI). This discussion will aid audiologists in making evidence-based recommendations for children using cochlear implants and FM systems.

Download Full-text