Fast pitch tracking process for LTP‐based speech coders

Abstract Electron tomography includes four main steps: tomography data acquisition, image processing, 3D reconstruction, and visualization. After acquisition, tilt-series alignments are performed. Two methods are used to align the tilt-series: cross-correlation and feature tracking. Normally, about 10-20 nm of fiducial markers, such as gold beads, are deposited onto one side of 100 mesh carbon-coated grids during the feature-tracking process. This paper presents a novel method for preparing electron tomography samples with gold beads inside to improve the feature tracking process and quality of 3D reconstruction. Results show that the novel electron tomography sample preparation method improves image alignment, which is essential for successful tomography in many contemporary semiconductor device structures.

Download Full-text

Max/MSP/Jitter for Music

10.1093/oso/9780199777679.001.0001 ◽

2011 ◽

Author(s):

V. J Manzo

Keyword(s):

Video Game ◽

Music Technology ◽

Camera Tracking ◽

Instructional Tools ◽

Music Therapists ◽

Interactive Music ◽

Pitch Tracking ◽

Special Value ◽

And Performance ◽

Private Studio

In Max/MSP/Jitter for Music, expert author and music technologist V. J. Manzo provides a user-friendly introduction to a powerful programming language that can be used to write custom software for musical interaction. Through clear, step-by-step instructions illustrated with numerous examples of working systems, the book equips you with everything you need to know in order to design and complete meaningful music projects. The book also discusses ways to interact with software beyond the mouse and keyboard through use of camera tracking, pitch tracking, video game controllers, sensors, mobile devices, and more. This book will be of special value for everyone who teaches music at any level, from classroom instructors to ensemble directors to private studio instructors. Whether you want to create simple exercises for beginning performers or more complex programs for aspiring composers, this book will show you how to write customized software that can complement and even inspire your instructional objectives. No specialist foreknowledge is required to use this book to enliven your experience with music technology. Even musicians with no prior programming skills can learn to supplement their lessons with interactive instructional tools, to develop adaptive instruments to aid in composition and performance activities, and to create measurement tools with which to conduct research. This book allows you to: -Learn how to design meaningful projects for composition, performance, music therapy, instruction, and research -Understand powerful software through this accessible introduction, written for beginners -Follow along through step-by-step tutorials -Grasp the principles by downloading the extensive software examples from the companion website This book is ideal for: -Music educators at all levels looking to integrate software in instruction -Musicians interested in how software can improve their practice and performance -Music composers with an interest in designing interactive music -Music therapists looking to tailor programs to the needs of specific groups or individuals And all who are interested in music technology. Visit the companion website at www.oup.com/us/maxmspjitter

Download Full-text

Multiple objects tracking in the UAV system based on hierarchical deep high-resolution network

Multimedia Tools and Applications ◽

10.1007/s11042-020-10427-1 ◽

2021 ◽

Author(s):

Wei Huang ◽

Xiaoshu Zhou ◽

Mingchao Dong ◽

Huaiyu Xu

Keyword(s):

High Resolution ◽

Object Tracking ◽

High Performance ◽

State Of The Art ◽

Class Imbalance ◽

Unified Framework ◽

Multiple Objects ◽

Tracking Process ◽

Objects Tracking ◽

Different Types

AbstractRobust and high-performance visual multi-object tracking is a big challenge in computer vision, especially in a drone scenario. In this paper, an online Multi-Object Tracking (MOT) approach in the UAV system is proposed to handle small target detections and class imbalance challenges, which integrates the merits of deep high-resolution representation network and data association method in a unified framework. Specifically, while applying tracking-by-detection architecture to our tracking framework, a Hierarchical Deep High-resolution network (HDHNet) is proposed, which encourages the model to handle different types and scales of targets, and extract more effective and comprehensive features during online learning. After that, the extracted features are fed into different prediction networks for interesting targets recognition. Besides, an adjustable fusion loss function is proposed by combining focal loss and GIoU loss to solve the problems of class imbalance and hard samples. During the tracking process, these detection results are applied to an improved DeepSORT MOT algorithm in each frame, which is available to make full use of the target appearance features to match one by one on a practical basis. The experimental results on the VisDrone2019 MOT benchmark show that the proposed UAV MOT system achieves the highest accuracy and the best robustness compared with state-of-the-art methods.

Download Full-text

An IoT-Based Participatory Antitheft System for Public Safety Enhancement in Smart Cities

Smart Cities ◽

10.3390/smartcities4020047 ◽

2021 ◽

Vol 4 (2) ◽

pp. 919-937

Author(s):

Nikos Papadakis ◽

Nikos Koukoulas ◽

Ioannis Christakis ◽

Ilias Stavrakas ◽

Dionisis Kandris

Keyword(s):

Smart Cities ◽

Low Cost ◽

Negative Influence ◽

Asset Tracking ◽

Tracking Tasks ◽

Tracking Process ◽

The Impact ◽

Operation Center ◽

Safety Enhancement ◽

Selection Of

The risk of theft of goods is certainly an important source of negative influence in human psychology. This article focuses on the development of a scheme that, despite its low cost, acts as a smart antitheft system that achieves small property detection. Specifically, an Internet of Things (IoT)-based participatory platform was developed in order to allow asset-tracking tasks to be crowd-sourced to a community. Stolen objects are traced by using a prototype Bluetooth Low Energy (BLE)-based system, which sends signals, thus becoming a beacon. Once such an item (e.g., a bicycle) is stolen, the owner informs the authorities, which, in turn, broadcast an alert signal to activate the BLE sensor. To trace the asset with the antitheft tag, participants use their GPS-enabled smart phones to scan BLE tags through a specific smartphone client application and report the location of the asset to an operation center so that owners can locate their assets. A stolen item tracking simulator was created to support and optimize the aforementioned tracking process and to produce the best possible outcome, evaluating the impact of different parameters and strategies regarding the selection of how many and which users to activate when searching for a stolen item within a given area.

Download Full-text

Multi-Scale Anti-Occlusion Correlation Filters Object Tracking Method Based on Complementary Features

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421550028 ◽

2020 ◽

pp. 2155002

Author(s):

Xiuhua Hu ◽

Yuan Chen ◽

Yan Hui ◽

Yingyu Liang ◽

Guiping Li ◽

...

Keyword(s):

Object Tracking ◽

Scale Model ◽

Correlation Filters ◽

Motion Feature ◽

Tracking Method ◽

Object State ◽

Multi Scale ◽

Tracking Process ◽

Object Appearance ◽

Response Peak

Aiming to tackle the problem of tracking drift easily caused by complex factors during the tracking process, this paper proposes an improved object tracking method under the framework of kernel correlation filter. To achieve discriminative information that is not sensitive to object appearance change, it combines dimensionality-reduced Histogram of Oriented Gradients features and Lab color features, which can be used to exploit the complementary characteristics robustly. Based on the idea of multi-resolution pyramid theory, a multi-scale model of the object is constructed, and the optimal scale for tracking the object is found according to the confidence maps’ response peaks of different sizes. For the case that tracking failure can easily occur when there exists inappropriate updating in the model, it detects occlusion based on whether the occlusion rate of the response peak corresponding to the best object state is less than a set threshold. At the same time, Kalman filter is used to record the motion feature information of the object before occlusion, and predict the state of the object disturbed by occlusion, which can achieve robust tracking of the object affected by occlusion influence. Experimental results show the effectiveness of the proposed method in handling various internal and external interferences under challenging environments.

Download Full-text

A Multi-Feature Fusion Slam System Attaching Semantic Invariant to Points and Lines

Sensors ◽

10.3390/s21041196 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1196

Author(s):

Gang Li ◽

Yawen Zeng ◽

Huilan Huang ◽

Shaojian Song ◽

Bin Liu ◽

...

Keyword(s):

Feature Matching ◽

Feature Fusion ◽

Error Function ◽

Line Segments ◽

Cumulative Error ◽

Localization And Mapping ◽

Tracking Process ◽

Line Features ◽

Point Line ◽

Segment Data

The traditional simultaneous localization and mapping (SLAM) system uses static points of the environment as features for real-time localization and mapping. When there are few available point features, the system is difficult to implement. A feasible solution is to introduce line features. In complex scenarios containing rich line segments, the description of line segments is not strongly differentiated, which can lead to incorrect association of line segment data, thus introducing errors into the system and aggravating the cumulative error of the system. To address this problem, a point-line stereo visual SLAM system incorporating semantic invariants is proposed in this paper. This system improves the accuracy of line feature matching by fusing line features with image semantic invariant information. When defining the error function, the semantic invariant is fused with the reprojection error function, and the semantic constraint is applied to reduce the cumulative error of the poses in the long-term tracking process. Experiments on the Office sequence of the TartanAir dataset and the KITTI dataset show that this system improves the matching accuracy of line features and suppresses the cumulative error of the SLAM system to some extent, and the mean relative pose error (RPE) is 1.38 and 0.0593 m, respectively.

Download Full-text

Monaural speech segregation based on pitch tracking and amplitude modulation

IEEE International Conference on Acoustics Speech and Signal Processing ◽

10.1109/icassp.2002.5743777 ◽

2002 ◽

Cited By ~ 16

Author(s):

Guoning Hu ◽

DeLiang Wang

Keyword(s):

Amplitude Modulation ◽

Pitch Tracking ◽

Speech Segregation

Download Full-text

On optimization op the tracking process

Journal of Applied Mathematics and Mechanics ◽

10.1016/0021-8928(69)90116-6 ◽

1969 ◽

Vol 33 (1) ◽

pp. 93-104 ◽

Cited By ~ 2

Author(s):

F.L. Chernous'ko

Keyword(s):

Tracking Process

Download Full-text

An algorithm for multi-pitch tracking in co-channel speech

10.21437/interspeech.2008-33 ◽

2008 ◽

Cited By ~ 1

Author(s):

Srikanth Vishnubhotla ◽

Carol Y. Espy-Wilson

Keyword(s):

Pitch Tracking

Download Full-text

Prosody of Ojibwe Discourse Markers

Inquiry@Queen's Undergraduate Research Conference Proceedings ◽

10.24908/iqurcp.14025 ◽

2020 ◽

Author(s):

Sonja Frazier

Keyword(s):

Great Lakes ◽

Narrative Structure ◽

Discourse Markers ◽

Pitch Tracking ◽

Discourse Particles ◽

Global Coherence ◽

Complex Picture ◽

And Linguistics

Discourse markers (DMs) are optional, sequentially dependent sentence-initial items (Schiffrin, 1987) that are used to bracket units of talk (e.g. oh, well, because, y’know, now ). This research aims to better understand Ojibwe DMs which typically occur as the first or second element of a sentence (Fairbanks, 2016). The proposed analysis seeks to understand the prosody of Ojibwe DMs broadly and specifically their use in narrative structure. The data is drawn from Gakina Dibaajimowin Gwayakwaawan ( All Teachings are Correct ) by Nancy Jones, 2013. The analysis was done by using the programs Audacity and PRAAT to identify individual sentences and their pitch prominences. Through careful listening and pitch tracking, prosodic properties of DMs were found to indicate the following: DMs attract the most prominent pitch in the sentence. DMs are used by the speaker to attract the hearer’s attention; in this sense they are interactional (Franks-Job, 2006). DMs are used by the speaker to structure the narration; as such they interact with topic changes and emphasis (Lenk, 1998) This study creates a more complex picture of Ojibwe DMs and adds to our understanding of the language. References: Fairbanks, B. 2016. Ojibwe Discourse Markers. University of Nebraska Press. Franks-Job, B. 2006. A dynamic-interactional approach to discourse markers. In Approaches to discourse particles, K. Fischer (ed.) pp. 395–413. Amsterdam: Elsevier. Lenk, U. 1998. Discourse markers and global coherence in conversation. Journal of Pragmatics 30(2):246-257 Ogimaawigwaebiik [Nancy Jones] 2013. Gakina Dibaajimowin Gwayakwaawan. In Dibaajimowinaan; Anishinaabe Stories of Culture and respect ; Nigaanigiizhig [Jim Saint-Arnold] (ed.), Great Lakes Indian Fish & Wildlife Commission, 9-10. Raso, Tommaso. 1996. Prosodic constraints for discourse markers. Spoken Corpora and Linguistic Studies. In Spoken Corpora and Linguistics Studies , T. Raso & H. Mello (eds.) 411-467. Benjamins: Amsterdam. Schiffrin, D. 1987. Discourse Markers. doi: 10.1017/cbo9780511611841.

Download Full-text