scholarly journals Microgenetic Learning Analytics Methods: Hands-on Sequence Analysis Workshop Report

2016 ◽  
Vol 3 (3) ◽  
pp. 96-114 ◽  
Author(s):  
Ani Aghababyan ◽  
Taylor Martin ◽  
Phillip Janisiewicz ◽  
Kevin Close

Learning analytics is an emerging discipline and, as such, it benefits from new tools and methodological approaches.  This work reviews and summarizes our workshop on microgenetic data analysis techniques using R, held at the 2nd annual Learning Analytics Summer Institute in Cambridge, Massachusetts on June 30th, 2014. Specifically, this paper introduces educational researchers to our experience using data analysis techniques with the RStudio development environment to analyze temporal records of 52 elementary students’ affective and behavioral responses to a digital learning environment. In the RStudio development environment, we used methods such as hierarchical clustering and sequential pattern mining. We also used RStudio to create effective data visualizations of our complex data. The scope of the workshop, and this paper, assumes little prior knowledge of the R programming language, and thus covers everything from data import and cleanup to advanced microgenetic analysis techniques. Additionally, readers will be introduced to software setup, R data types, and visualizations. This paper not only adds to the toolbox for learning analytics researchers (particularly when analyzing time series data), but also shares our experience interpreting a unique and complex dataset.

Temporal data clustering examines the time series data to determine the basic structure and other characteristics of the data. Many methodologies simply process the temporal dimension of data but it still faces the many challenges for extracting useful patterns due to complex data types. In order to analyze the complex temporal data, Hybridized Gradient Descent Spectral Graph and Local-Global Louvain Clustering (HGDSG-LGLC) technique are designed. The number of temporal data is gathered from input dataset. Then the HGDSG-LGLC technique performs graph-based clustering to partitions the vertices i.e. data into different clusters depending on similarity matrix spectrum. The distance similarity is measured between the data and cluster mean. The Gradient Descent function find minimum distance between data and cluster mean. Followed by, the Local-Global Louvain method performs the merging and filtering of temporal data to connect the local and global edges of the graph with similar data. Then for each data, the change in modularity is calculated for filtering the unwanted data from its own cluster and merging it into the neighboring cluster. As a result, optimal ‘k’ numbers of clusters are obtained with higher accuracy with minimum error rate. Experimental analysis is performed with various parameters like clustering accuracy ( ), error rate ( ), computation time ( ) and space complexity ( ) with respect to number of temporal data. The proposed HGDSG-LGLC technique achieves higher and lesser , minimum as well as than conventional methods.


Author(s):  
James Morrison ◽  
David Christie ◽  
Charles Greenwood ◽  
Ruairi Maciver ◽  
Arne Vogler

This paper presents a set of software tools for interrogating and processing time series data. The functionality of this toolset will be demonstrated using data from a specific deployment involving multiple sensors deployed for a specific time period. The approach was developed initially for Datawell Waverider MKII/MKII buoys [1] and expanded to include data from acoustic devices in this case Nortek AWACs. Tools of this nature are important to address a specific lack of features in the sensor manufacturers own tools. It also helps to develop standard approaches for dealing with anomalous data from sensors. These software tools build upon an effective modern interpreted programming language in this case Python which has access to high performance low level libraries. This paper demonstrates the use of these tools applied to a sensor network based on the North West coast of Scotland as described in [2,3]. Examples can be seen of computationally complex data being easily calculated for monthly averages. Analysis down to a wave by wave basis will also be demonstrated form the same source dataset. The tools make use of a flexible data structure called a DataFrame which supports mixed data types, hierarchical and time indexing and is also integrated with modern plotting libraries. This allows sub second querying and the ability for dynamic plotting of large datasets. By using modern compression techniques and file formats it is possible to process datasets which are larger than memory datasets without the need for a traditional relational database. The software library shall be of use to a wide variety of industry involved in offshore engineering along with any scientists interested in the coastal environment.


Agronomy ◽  
2019 ◽  
Vol 9 (3) ◽  
pp. 142 ◽  
Author(s):  
Chi-Hua Chen ◽  
Hsu-Yang Kung ◽  
Feng-Jang Hwang

This editorial introduces the Special Issue, entitled “Deep Learning (DL) Techniques for Agronomy Applications”, of Agronomy. Topics covered in this issue include three main parts: (I) DL-based image recognition techniques for agronomy applications, (II) DL-based time series data analysis techniques for agronomy applications, and (III) behavior and strategy analysis for agronomy applications. Three papers on DL-based image recognition techniques for agronomy applications are as follows: (1) “Automatic segmentation and counting of aphid nymphs on leaves using convolutional neural networks,” by Chen et al.; (2) “Estimating body condition score in dairy cows from depth images using convolutional neural networks, transfer learning, and model ensembling techniques,” by Alvarez et al.; and (3) “Development of a mushroom growth measurement system applying deep learning for image recognition,” by Lu et al. One paper on DL-based time series data analysis techniques for agronomy applications is as follows: “LSTM neural network based forecasting model for wheat production in Pakistan,” by Haider et al. One paper on behavior and strategy analysis for agronomy applications is as follows: “Research into the E-learning model of agriculture technology companies: analysis by deep learning,” by Lin et al.


2013 ◽  
Author(s):  
Stephen J. Tueller ◽  
Richard A. Van Dorn ◽  
Georgiy Bobashev ◽  
Barry Eggleston

2021 ◽  
Vol 83 (3) ◽  
Author(s):  
Maria-Veronica Ciocanel ◽  
Riley Juenemann ◽  
Adriana T. Dawes ◽  
Scott A. McKinley

AbstractIn developmental biology as well as in other biological systems, emerging structure and organization can be captured using time-series data of protein locations. In analyzing this time-dependent data, it is a common challenge not only to determine whether topological features emerge, but also to identify the timing of their formation. For instance, in most cells, actin filaments interact with myosin motor proteins and organize into polymer networks and higher-order structures. Ring channels are examples of such structures that maintain constant diameters over time and play key roles in processes such as cell division, development, and wound healing. Given the limitations in studying interactions of actin with myosin in vivo, we generate time-series data of protein polymer interactions in cells using complex agent-based models. Since the data has a filamentous structure, we propose sampling along the actin filaments and analyzing the topological structure of the resulting point cloud at each time. Building on existing tools from persistent homology, we develop a topological data analysis (TDA) method that assesses effective ring generation in this dynamic data. This method connects topological features through time in a path that corresponds to emergence of organization in the data. In this work, we also propose methods for assessing whether the topological features of interest are significant and thus whether they contribute to the formation of an emerging hole (ring channel) in the simulated protein interactions. In particular, we use the MEDYAN simulation platform to show that this technique can distinguish between the actin cytoskeleton organization resulting from distinct motor protein binding parameters.


2020 ◽  
Vol 44 (1) ◽  
pp. 35-50
Author(s):  
Anna Barth ◽  
Leif Karlstrom ◽  
Benjamin K. Holtzman ◽  
Arthur Paté ◽  
Avinash Nayak

Abstract Sonification of time series data in natural science has gained increasing attention as an observational and educational tool. Sound is a direct representation for oscillatory data, but for most phenomena, less direct representational methods are necessary. Coupled with animated visual representations of the same data, the visual and auditory systems can work together to identify complex patterns quickly. We developed a multivariate data sonification and visualization approach to explore and convey patterns in a complex dynamic system, Lone Star Geyser in Yellowstone National Park. This geyser has erupted regularly for at least 100 years, with remarkable consistency in the interval between eruptions (three hours) but with significant variations in smaller scale patterns between each eruptive cycle. From a scientific standpoint, the ability to hear structures evolving over time in multiparameter data permits the rapid identification of relationships that might otherwise be overlooked or require significant processing to find. The human auditory system is adept at physical interpretation of call-and-response or causality in polyphonic sounds. Methods developed here for oscillatory and nonstationary data have great potential as scientific observational and educational tools, for data-driven composition with scientific and artistic intent, and towards the development of machine learning tools for pattern identification in complex data.


Sign in / Sign up

Export Citation Format

Share Document