Extracting Multimodal Dynamics of Objects Using RNNPB

2005 ◽  
Vol 17 (6) ◽  
pp. 681-688 ◽  
Author(s):  
Tetsuya Ogata ◽  
◽  
Hayato Ohba ◽  
Jun Tani ◽  
Kazunori Komatani ◽  
...  

Dynamic features play an important role in recognizing objects that have similar static features in color or shape. This paper focuses on active sensing that exploits the dynamic feature of an object. An extended version of the robot, Robovie-IIs, uses its arms to move an object and determine its dynamic features. At issue is how to extract symbols from different temporal states of the object. We use a <I>recurrent neural network with parametric bias</I> (RNNPB) that generates self-organized nodes in parametric bias space. We trained an RNNPB with 42 neurons using data on sounds, trajectories, and tactile sensors generated while the robot was moving or hitting an object with its arm. Clusters of 20 types of objects were self-organized. Experiments with unknown (untrained) objects showed that our proposal configured them appropriately in PB space, demonstrating its <I>generalization</I>.

2021 ◽  
Author(s):  
Minseop Park ◽  
Hyeok Choi ◽  
Hee-Sung Ahn ◽  
Hee-Ju Kang ◽  
Saehoon Kim ◽  
...  

BACKGROUND A pressure ulcer (PU) is a localized cutaneous injury caused by pressure or shear, which usually occurs in the region of a bony prominence. PUs are common in hospitalized patients and cause complications including infection. OBJECTIVE This study aimed to build a recurrent neural network-based algorithm to predict PUs 24 hours before their occurrence. METHODS This study analyzed a freely accessible intensive care unit (ICU) dataset, MIMIC- III. Deep learning and machine learning algorithms including long short-term memory (LSTM), multilayer perceptron (MLP), and XGBoost were applied to 37 dynamic features (including the Braden scale, vital signs and laboratory results, and interventions to reduce the risk of PUs) and 35 static features (including the length of time spent in the ICU, demographics, and comorbidities). Their outcomes were compared in terms of the area under the receiver operating characteristic (AUROC) and the area under the precision-recall curve (AUPRC). RESULTS A total of 1,048 cases of PUs (10.0%) and 9,402 controls (90.0%) without PUs satisfied the inclusion criteria for analysis. The LSTM + MLP model (AUROC: 0.7929 ± 0.0095, AUPRC: 0.4819 ± 0.0109) outperformed the other models, namely: MLP model (AUROC: 0.7777 ± 0.0083, AUPRC: 0.4527 ± 0.0195) and XGBoost (AUROC: 0.7465 ± 0.0087, AUPRC: 0.4052 ± 0.0087). Various features, including the length of time spent in the ICU, Glasgow coma scale, and the Braden scale, contributed to the prediction model. CONCLUSIONS This study suggests that recurrent neural network-based algorithms such as LSTM can be applied to evaluate the risk of PUs in ICU patients.


2020 ◽  
Vol 10 (18) ◽  
pp. 6381 ◽  
Author(s):  
Pongsathon Janyoi ◽  
Pusadee Seresangtakul

The modeling of fundamental frequency (F0) in speech synthesis is a critical factor affecting the intelligibility and naturalness of synthesized speech. In this paper, we focus on improving the modeling of F0 for Isarn speech synthesis. We propose the F0 model for this based on a recurrent neural network (RNN). Sampled values of F0 are used at the syllable level of continuous Isarn speech combined with their dynamic features to represent supra-segmental properties of the F0 contour. Different architectures of the deep RNNs and different combinations of linguistic features are analyzed to obtain conditions for the best performance. To assess the proposed method, we compared it with several RNN-based baselines. The results of objective and subjective tests indicate that the proposed model significantly outperformed the baseline RNN model that predicts values of F0 at the frame level, and the baseline RNN model that represents the F0 contours of syllables by using discrete cosine transform.


Author(s):  
К.П. Соловьева ◽  
K.P. Solovyeva

In this article, we describe a simple binary neuron system, which implements a self-organized map. The system consists of R input neurons (R receptors), and N output neurons of a recurrent neural network. The neural network has a quasi-continuous set of attractor states (one-dimensional “bump attractor”). Due to the dynamics of the network, each external signal (i.e. activity state of receptors) imposes transition of the recurrent network into one of its stable states (points of its attractor). That makes our system different from the “winner takes all” construction of T.Kohonen. In case, when there is a one-dimensional cyclical manifold of external signals in R-dimensional input space, and the recurrent neural network presents a complete ring of neurons with local excitatory connections, there exists a process of learning of connections between the receptors and the neurons of the recurrent network, which enables a topologically correct mapping of input signals into the stable states of the neural network. The convergence rate of learning and the role of noises and other factors affecting the described phenomenon has been evaluated in computational simulations.


2019 ◽  
Vol 7 ◽  
pp. 421-436 ◽  
Author(s):  
Ion Madrazo Azpiazu ◽  
Maria Soledad Pera

We present a multiattentive recurrent neural network architecture for automatic multilingual readability assessment. This architecture considers raw words as its main input, but internally captures text structure and informs its word attention process using other syntax- and morphology-related datapoints, known to be of great importance to readability. This is achieved by a multiattentive strategy that allows the neural network to focus on specific parts of a text for predicting its reading level. We conducted an exhaustive evaluation using data sets targeting multiple languages and prediction task types, to compare the proposed model with traditional, state-of-the-art, and other neural network strategies.


HFSP Journal ◽  
2009 ◽  
Vol 3 (5) ◽  
pp. 340-349 ◽  
Author(s):  
Joschka Boedecker ◽  
Oliver Obst ◽  
N. Michael Mayer ◽  
Minoru Asada

2020 ◽  
Author(s):  
Matthew G. Perich ◽  
Charlotte Arlt ◽  
Sofia Soares ◽  
Megan E. Young ◽  
Clayton P. Mosher ◽  
...  

ABSTRACTBehavior arises from the coordinated activity of numerous anatomically and functionally distinct brain regions. Modern experimental tools allow unprecedented access to large neural populations spanning many interacting regions brain-wide. Yet, understanding such large-scale datasets necessitates both scalable computational models to extract meaningful features of interregion communication and principled theories to interpret those features. Here, we introduce Current-Based Decomposition (CURBD), an approach for inferring brain-wide interactions using data-constrained recurrent neural network models that directly reproduce experimentally-obtained neural data. CURBD leverages the functional interactions inferred by such models to reveal directional currents between multiple brain regions. We first show that CURBD accurately isolates inter-region currents in simulated networks with known dynamics. We then apply CURBD to multi-region neural recordings obtained from mice during running, macaques during Pavlovian conditioning, and humans during memory retrieval to demonstrate the widespread applicability of CURBD to untangle brain-wide interactions underlying behavior from a variety of neural datasets.


2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Yingxue Zhang ◽  
Zhe Li

Computer music creation boasts broad application prospects. It generally relies on artificial intelligence (AI) and machine learning (ML) to generate the music score that matches the original mono-symbol score model or memorize/recognize the rhythms and beats of the music. However, there are very few music melody synthesis models based on artificial neural networks (ANNs). Some ANN-based models cannot adapt to the transposition invariance of original rhythm training set. To overcome the defect, this paper tries to develop an automatic synthesis technology of music teaching melodies based on recurrent neural network (RNN). Firstly, a strategy was proposed to extract the acoustic features from music melody. Next, the sequence-sequence model was adopted to synthetize general music melodies. After that, an RNN was established to synthetize music melody with singing melody, such as to find the suitable singing segments for the music melody in teaching scenario. The RNN can synthetize music melody with a short delay solely based on static acoustic features, eliminating the need for dynamic features. The proposed model was proved valid through experiments.


2020 ◽  
Vol 10 (14) ◽  
pp. 4721
Author(s):  
Gunwoo Lee

Accidents involving marine crew members and passengers are still an issue that must be studied and obviated. Preventing such accidents at sea can improve the quality of life on board by ensuring a safe ship environment. This paper proposes a hybrid indoor positioning method, an approach which is becoming common on land, to enhance maritime safety. Specifically, a recurrent neural network (RNN)-based hybrid localization system (RHLS) that provides accurate and efficient user-tracking results is proposed. RHLS performs hybrid positioning by receiving wireless signals, such as Wi-Fi and Bluetooth, as well as inertial measurement unit data from smartphones. It utilizes the RNN to solve the problem of tracking accuracy reduction that may occur when using data collected from various sensors at various times. The results of experiments conducted in an offshore environment confirm that RHLS provides accurate and efficient tracking results. The scalability of RHLS provides managers with more intuitive monitoring of assets and crews, and, by providing information such as the location of safety equipment to the crew, it promotes welfare and safety.


Sign in / Sign up

Export Citation Format

Share Document