A Hybrid Approach for Video Indexing Using Computer Vision and Speech Recognition

The adaptability between different environments remains a challenge for Mobile Augmented Reality (MAR). If not done seamlessly, such transitions may cause discontinuities in navigation, consequently disorienting users and undermining the acceptance of this technology. The transition between environments is hard because there are currently no localization techniques that work well in any place: sensor-based applications can be harmed by obstacles that hamper sensor communication (e.g., GPS) and by infrastructure limitations (e.g., Wi-Fi), and image-based applications can be affected by lighting conditions that impair computer vision techniques. Hence, this paper presents an adaptive model to perform transitions between different types of environments for MAR applications. The model has a hybrid approach, choosing the best combination of long-range sensors, short-range sensors, and computer vision techniques to perform fluid transitions between environments that mitigate problems in location, orientation, and registration. To assess the model, we developed a MAR application and conducted a navigation test with volunteers to validate transitions between outdoor and indoor environments, followed by a short interview. The results show that the transitions were well succeeded, since the application self-adapted to the studied environments, seamlessly changing sensors when needed.

Download Full-text

The Concept of Integrating Artificial Intelligence Technologies Into Human Resources in a Digital Paradigm

Management of the personnel and intellectual resources in Russia ◽

10.12737/2305-7807-2020-5-9 ◽

2020 ◽

Vol 9 (2) ◽

pp. 5-9

Author(s):

Oksana Chulanova

Keyword(s):

Artificial Intelligence ◽

Computer Vision ◽

Natural Language Processing ◽

Decision Support ◽

Speech Recognition ◽

Human Resources ◽

Natural Language ◽

Language Processing

The article discusses the capabilities of artificial intelligence technologies - technologies based on the use of artificial intelligence, including natural language processing, intellectual decision support, computer vision, speech recognition and synthesis, and promising methods of artificial intelligence. The results of the author's study and the analysis of artificial intelligence technologies and their capabilities for optimizing work with staff are presented. A study conducted by the author allowed us to develop an author's concept of integrating artificial intelligence technologies into work with personnel in the digital paradigm.

Download Full-text

Noise robust acoustic signal processing using a Hybrid approach for speech recognition

2016 6th International Conference - Cloud System and Big Data Engineering (Confluence) ◽

10.1109/confluence.2016.7508169 ◽

2016 ◽

Author(s):

Divya Gupta ◽

Poonam Bansal ◽

Kavita Choudhary

Keyword(s):

Signal Processing ◽

Speech Recognition ◽

Acoustic Signal ◽

Hybrid Approach ◽

Noise Robust

Download Full-text

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Applied Sciences ◽

10.3390/app10186460 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6460

Author(s):

Junaid Younas ◽

Shoaib Ahmed Siddiqui ◽

Mohsin Munir ◽

Muhammad Imran Malik ◽

Faisal Shafait ◽

...

Keyword(s):

Computer Vision ◽

Image Representation ◽

Hybrid Approach ◽

Representation Learning ◽

Superior Performance ◽

Document Images ◽

Connected Component ◽

Study Results ◽

Image Representations ◽

Ablation Study

We propose a novel hybrid approach that fuses traditional computer vision techniques with deep learning models to detect figures and formulas from document images. The proposed approach first fuses the different computer vision based image representations, i.e., color transform, connected component analysis, and distance transform, termed as Fi-Fo image representation. The Fi-Fo image representation is then fed to deep models for further refined representation-learning for detecting figures and formulas from document images. The proposed approach is evaluated on a publicly available ICDAR-2017 Page Object Detection (POD) dataset and its corrected version. It produces the state-of-the-art results for formula and figure detection in document images with an f1-score of 0.954 and 0.922, respectively. Ablation study results reveal that the Fi-Fo image representation helps in achieving superior performance in comparison to raw image representation. Results also establish that the hybrid approach helps deep models to learn more discriminating and refined features.

Download Full-text

Multi-pass feature enhancement based on generative-discriminative hybrid approach for noise robust speech recognition

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2016.7472779 ◽

2016 ◽

Author(s):

Masakiyo Fujimoto ◽

Tomohiro Nakatani

Keyword(s):

Speech Recognition ◽

Hybrid Approach ◽

Robust Speech Recognition ◽

Feature Enhancement ◽

Noise Robust Speech Recognition ◽

Noise Robust

Download Full-text

The Implementation of Multilevel Colour Thresholding on a Prototype Coffee Machine

Journal of Science and Application Technology ◽

10.35472/jsat.v4i2.344 ◽

2020 ◽

Vol 4 (2) ◽

pp. 121

Author(s):

Nova Resfita ◽

Rahmadi Kurnia ◽

Fitrilina Fitrilina

Keyword(s):

Image Processing ◽

Computer Vision ◽

Speech Recognition ◽

Daily Life ◽

Recognition System ◽

Processing Technique ◽

Image Processing Technique ◽

Speech Recognition System ◽

Vast Number ◽

Coffee Machine

The development of computer vision has expanded widely as there is a vast number of its applications in various aspects of daily life. One of its implementations is integrating the image processing technique on a prototype coffee machine based on the speech recognition system. This study aims to detect the requested coffee colour spoken by users which are black, middle and light. The sensor used in this research is a digital PC camera and the applied method is Multilevel Colour Thresholding. Of all experiments conducted, the image processing technique can work perfectly as the camera is able to identify the requested colour of the coffee solution. Furthermore, the system might be developed by improving the multilevel colour thresholding technique as well as advancing the hardware design in order to establish more robust coffee machine based on the requested colour.

Download Full-text

HYBRID APPROACH OF GARBAGE CLASSIFICATION USING COMPUTER VISION AND DEEP LEARNING

International Journal of Engineering Applied Sciences and Technology ◽

10.33564/ijeast.2021.v05i10.032 ◽

2021 ◽

Vol 5 (10) ◽

Author(s):

Anish Tatke ◽

Madhura Patil ◽

Anuj Khot ◽

Parul Jadhav ◽

Dr Vishwanath Karad

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Comparative Analysis ◽

Deep Neural Networks ◽

Hybrid Approach ◽

Computer Architectures ◽

Excellent Performance ◽

Modern Computer ◽

Use Of Technology ◽

Waste Segregation

As waste segregation becomes an important issue in our lives, with the use of technology like deep neural networks and computer vision, we can make the process efficient and robust by image segmentation and classification. These systems on the rise need accurate and efficient segmentation and recognition mechanisms and this demand coincides with the increase of computational capabilities of modern computer architectures and more effective algorithms for image recognition. This paper does a comparative analysis of various different approaches and methods like Simple CNN, ResNet50, VGG16, etc in brief. The comparative analysis and study explains the performance of every approach, this paper concludes that ResNet50 gives excellent performance. VGG16 network also provides good performance which meets the needs of daily use.

Download Full-text

A Hybrid Approach for Video Indexing Using Computer Vision and Speech Recognition

A hybrid approach to adapting acoustic and pronunciation models for non-native speech recognition

A Hybrid Approach to Improving Automatic Speech Recognition Via NLP

A Novel Idea for Designing a Speech Recognition System Using Computer Vision Object Detection Techniques

A Model to Support Fluid Transitions between Environments for Mobile Augmented Reality Applications

The Concept of Integrating Artificial Intelligence Technologies Into Human Resources in a Digital Paradigm

Noise robust acoustic signal processing using a Hybrid approach for speech recognition

Fi-Fo Detector: Figure and Formula Detection Using Deformable Networks

Multi-pass feature enhancement based on generative-discriminative hybrid approach for noise robust speech recognition

The Implementation of Multilevel Colour Thresholding on a Prototype Coffee Machine

HYBRID APPROACH OF GARBAGE CLASSIFICATION USING COMPUTER VISION AND DEEP LEARNING

Export Citation Format