optimal representation Latest Research Papers

Season- and Trend-aware Symbolic Approximation for Accurate and Efficient Time Series Matching

Datenbank-Spektrum ◽

10.1007/s13222-021-00389-5 ◽

2021 ◽

Author(s):

Lars Kegel ◽

Claudio Hartmann ◽

Maik Thiele ◽

Wolfgang Lehner

Keyword(s):

Time Series ◽

State Of The Art ◽

Dimensional Space ◽

Symbolic Aggregate Approximation ◽

Current State ◽

Optimal Representation ◽

Symbolic Approximation ◽

Low Dimensional ◽

Deterministic Behavior ◽

Support Time

AbstractProcessing and analyzing time series datasets have become a central issue in many domains requiring data management systems to support time series as a native data type. A core access primitive of time series is matching, which requires efficient algorithms on-top of appropriate representations like the symbolic aggregate approximation (SAX) representing the current state of the art. This technique reduces a time series to a low-dimensional space by segmenting it and discretizing each segment into a small symbolic alphabet. Unfortunately, SAX ignores the deterministic behavior of time series such as cyclical repeating patterns or a trend component affecting all segments, which may lead to a sub-optimal representation accuracy. We therefore introduce a novel season- and a trend-aware symbolic approximation and demonstrate an improved representation accuracy without increasing the memory footprint. Most importantly, our techniques also enable a more efficient time series matching by providing a match up to three orders of magnitude faster than SAX.

Ganglion cells in larval zebrafish retina integrate inputs from multiple cone types

Journal of Neurophysiology ◽

10.1152/jn.00082.2021 ◽

2021 ◽

Author(s):

Victoria P Connaughton ◽

Ralph Francis Nelson

Keyword(s):

Danio Rerio ◽

Time Course ◽

Ganglion Cells ◽

Light Response ◽

Larval Zebrafish ◽

Cone Opsin ◽

Optimal Representation ◽

Discharge Rates ◽

Cone Opsins ◽

Spectral Responses

We recently showed the presence of 7 physiological cone opsins - R1 (575nm), R2 (556nm), G1 (460nm), G3 (480nm), B1 (415nm), B2 (440nm), UV (358nm) - in ERG recordings of larval zebrafish (Danio rerio) retina. Larval ganglion cells (GCs) are generally thought to integrate only 4 cone opsin signals (red, green blue and UV). We address the question as to whether they may integrate 7 cone spectral signals. Here, we examined the 127 possible combinations of 7 cone signals to find the optimal representation, as based on impulse discharge datasets from GC axons in the larval optic nerve. We recorded four varieties of light-response waveform: sustained-ON, transient-ON, ON-OFF, and OFF, based on the time course of mean discharge rates to all stimulus wavelengths combined. Modeling of GC responses revealed each received 1-6 cone opsin signals, with a mean of 3.8 ± 1.3 cone signals/GC. Most onset or offset responses were opponent (ON, 80%; OFF, 100%). The most common cone signals were UV (93%), R2 (50%), G3 (55%), and G1 (60%). 73% of cone opsin signals were excitatory, 27% were inhibitory. UV signals favored excitation, while G3 and B2 signals favored inhibition. R1/R2, G1/G3 and B1/B2 opsin signals were selectively associated along a non-synergistic/opponent axis. Overall, these results suggest that larval zebrafish GC spectral responses are complex and use inputs from the 7 expressed opsins.

Optimal Representation of the Nuclear Ensemble: Application to Electronic Spectroscopy

Journal of Chemical Theory and Computation ◽

10.1021/acs.jctc.1c00749 ◽

2021 ◽

Author(s):

Štěpán Sršeň ◽

Petr Slavíček

Keyword(s):

Electronic Spectroscopy ◽

Optimal Representation

Binaural Background Noise Enhances Neuromagnetic Responses from Auditory Cortex

Symmetry ◽

10.3390/sym13091748 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1748

Author(s):

Dawei Shen ◽

Claude Alain ◽

Bernhard Ross

Keyword(s):

Auditory Cortex ◽

Background Noise ◽

Low Noise ◽

Harmonic Complex ◽

Optimal Representation ◽

Complex Tones ◽

Transient Events ◽

Auditory Evoked Fields ◽

Evoked Fields ◽

Auditory Cortices

The presence of binaural low-level background noise has been shown to enhance the transient evoked N1 response at about 100 ms after sound onset. This increase in N1 amplitude is thought to reflect noise-mediated efferent feedback facilitation from the auditory cortex to lower auditory centers. To test this hypothesis, we recorded auditory-evoked fields using magnetoencephalography while participants were presented with binaural harmonic complex tones embedded in binaural or monaural background noise at signal-to-noise ratios of 25 dB (low noise) or 5 dB (higher noise). Half of the stimuli contained a gap in the middle of the sound. The source activities were measured in bilateral auditory cortices. The onset and gap N1 response increased with low binaural noise, but high binaural and low monaural noise did not affect the N1 amplitudes. P1 and P2 onset and gap responses were consistently attenuated by background noise, and noise level and binaural/monaural presentation showed distinct effects. Moreover, the evoked gamma synchronization was also reduced by background noise, and it showed a lateralized reduction for monaural noise. The effects of noise on the N1 amplitude follow a bell-shaped characteristic that could reflect an optimal representation of acoustic information for transient events embedded in noise.

Optimal Representation of the Nuclear Ensemble: Application to Electronic Spectroscopy

10.33774/chemrxiv-2021-zrpzx ◽

2021 ◽

Author(s):

Štěpán Sršeň ◽

Petr Slavíček

Keyword(s):

Electronic Spectroscopy ◽

Optimal Representation

Tensor-tensor algebra for optimal representation and compression of multiway data

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2015851118 ◽

2021 ◽

Vol 118 (28) ◽

pp. e2015851118

Author(s):

Misha E. Kilmer ◽

Lior Horesh ◽

Haim Avron ◽

Elizabeth Newman

Keyword(s):

Large Datasets ◽

High Dimensional ◽

Tensor Algebra ◽

Tensor Representation ◽

Optimal Representation ◽

Data Dimensionality Reduction ◽

The Matrix ◽

Optimality Properties ◽

Value Decomposition ◽

High Dimensional Datasets

With the advent of machine learning and its overarching pervasiveness it is imperative to devise ways to represent large datasets efficiently while distilling intrinsic features necessary for subsequent analysis. The primary workhorse used in data dimensionality reduction and feature extraction has been the matrix singular value decomposition (SVD), which presupposes that data have been arranged in matrix format. A primary goal in this study is to show that high-dimensional datasets are more compressible when treated as tensors (i.e., multiway arrays) and compressed via tensor-SVDs under the tensor-tensor product constructs and its generalizations. We begin by proving Eckart–Young optimality results for families of tensor-SVDs under two different truncation strategies. Since such optimality properties can be proven in both matrix and tensor-based algebras, a fundamental question arises: Does the tensor construct subsume the matrix construct in terms of representation efficiency? The answer is positive, as proven by showing that a tensor-tensor representation of an equal dimensional spanning space can be superior to its matrix counterpart. We then use these optimality results to investigate how the compressed representation provided by the truncated tensor SVD is related both theoretically and empirically to its two closest tensor-based analogs, the truncated high-order SVD and the truncated tensor-train SVD.

Microbiome Preprocessing Machine Learning Pipeline

Frontiers in Immunology ◽

10.3389/fimmu.2021.677870 ◽

2021 ◽

Vol 12 ◽

Author(s):

Yoel Jasner ◽

Anna Belogolovski ◽

Meirav Ben-Itzhak ◽

Omry Koren ◽

Yoram Louzoun

Keyword(s):

Machine Learning ◽

Standard Test ◽

Optimal Combination ◽

Reduction Step ◽

Limited Effect ◽

16S Sequencing ◽

16S Gene ◽

Optimal Representation ◽

Test Sets ◽

Classification Tasks

Background16S sequencing results are often used for Machine Learning (ML) tasks. 16S gene sequences are represented as feature counts, which are associated with taxonomic representation. Raw feature counts may not be the optimal representation for ML.MethodsWe checked multiple preprocessing steps and tested the optimal combination for 16S sequencing-based classification tasks. We computed the contribution of each step to the accuracy as measured by the Area Under Curve (AUC) of the classification.ResultsWe show that the log of the feature counts is much more informative than the relative counts. We further show that merging features associated with the same taxonomy at a given level, through a dimension reduction step for each group of bacteria improves the AUC. Finally, we show that z-scoring has a very limited effect on the results.ConclusionsThe prepossessing of microbiome 16S data is crucial for optimal microbiome based Machine Learning. These preprocessing steps are integrated into the MIPMLP - Microbiome Preprocessing Machine Learning Pipeline, which is available as a stand-alone version at: https://github.com/louzounlab/microbiome/tree/master/Preprocess or as a service at http://mip-mlp.math.biu.ac.il/Home Both contain the code, and standard test sets.

Comparative analysis of molecular representations in prediction of cancer drug combination synergy and sensitivity

10.1101/2021.04.16.439299 ◽

2021 ◽

Author(s):

Bulat Zagidullin ◽

Ziyan Wang ◽

Yuanfang Guan ◽

Esa Pitkänen ◽

Jing Tang

Keyword(s):

Drug Combination ◽

High Throughput Screening ◽

Data Driven ◽

Cancer Drug ◽

Molecular Fingerprints ◽

Rule Based ◽

Drug Synergy ◽

The Past ◽

Optimal Representation ◽

Preclinical Drug Development

Application of machine and deep learning (ML/DL) methods in drug discovery and cancer research has gained a considerable amount of attention in the past years. As the field grows, it becomes crucial to systematically evaluate the performance of novel DL solutions in relation to established techniques. To this end we compare rule-based and data-driven molecular representations in prediction of drug combination sensitivity and drug synergy scores using standardized results of 14 high throughput screening studies, comprising 64,200 unique combinations of 4,153 molecules tested in 112 cancer cell lines. We evaluate the clustering performance of molecular fingerprints and quantify their similarity by adapting Centred Kernel Alignment metric. Our work demonstrates that in order to identify an optimal representation type it is necessary to supplement quantitative benchmark results with qualitative considerations, such as model interpretability and robustness, which may vary between and throughout preclinical drug development projects.

Karakteristik Permukiman Kota yang Memiliki Potensi Usaha Berbasis Rumah Tangga

SADE : Jurnal Arsitektur, Planologi dan Teknik Sipil ◽

10.29303/sade.v1i1.9 ◽

2021 ◽

Vol 1 (1) ◽

pp. 42-50

Author(s):

Ima Rahmawati Sushanti ◽

Intan Savia Fitri ◽

Febrita Susanti

Keyword(s):

Development Strategy ◽

Secondary Data ◽

Industrial Clusters ◽

Physical Characteristics ◽

Urban Settlement ◽

Optimal Representation ◽

Data Collection And Analysis ◽

Urban Settlements ◽

The City ◽

Area Development

Urban settlement is a built environment in an urban area that plays a role in determining the structure and identity of the city. The urban settlement area is currently not only used as a residence equipped with facilities and infrastructure to meet the living needs of the residents who live in it, but also to meet their economic needs. Urban settlements have certain characteristics based on the community and activities in them so that they can become the identity of the area. The existence of the Mutiara, Gold and Silver industrial clusters in Sekabela sub-district, Mataram city has implications for the surrounding settlements, both in economic, environmental and social aspects. The emergence of slum settlements in the residences around the Pearl, Gold and Silver industry causes less optimal representation of the area as a shopping tourism area. This study aims to determine the characteristics of settlements with household-based business potential and development strategies. The method used in this research is descriptive qualitative with primary and secondary data collection and analysis of Strength, Weakness, Opportunity and Threat. The results showed that the characteristics of the settlement were based on physical characteristics, namely: building layout, housing, facilities and infrastructure as well as the environment and non-physical characteristics, namely: the community and the activities that took place in it. The area development strategy based on settlement characteristics is in quadrant IV, namely the Competitive Strategy. Efforts are being made to improve the visual quality or image of the area, diversify the business and develop markets.

Information theory and dimensionality of space

Scientific Reports ◽

10.1038/s41598-020-77855-9 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Subhash Kak

Keyword(s):

Information Theory ◽

Hausdorff Dimension ◽

Hubble Constant ◽

Theoretical Explanation ◽

Physical Principle ◽

Theoretic Approach ◽

Information Theoretic ◽

Optimal Representation ◽

Information Theoretic Approach ◽

Dimensionality Of Space

AbstractWe present an information-theoretic approach to the optimal representation of the intrinsic dimensionality of data and show it is a noninteger. Since optimality is accepted as a physical principle, this provides a theoretical explanation for why noninteger dimensions are useful in many branches of physics, where they have been introduced based on experimental considerations. Noninteger dimensions correlate with lesser density as in the Hausdorff dimension and this can have measurable effects. We use the lower density of noninteger dimension to resolve the problem of two different values of the Hubble constant obtained using different methods.

optimal representation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Season- and Trend-aware Symbolic Approximation for Accurate and Efficient Time Series Matching

Ganglion cells in larval zebrafish retina integrate inputs from multiple cone types

Optimal Representation of the Nuclear Ensemble: Application to Electronic Spectroscopy

Binaural Background Noise Enhances Neuromagnetic Responses from Auditory Cortex

Optimal Representation of the Nuclear Ensemble: Application to Electronic Spectroscopy

Tensor-tensor algebra for optimal representation and compression of multiway data

Microbiome Preprocessing Machine Learning Pipeline

Comparative analysis of molecular representations in prediction of cancer drug combination synergy and sensitivity

Karakteristik Permukiman Kota yang Memiliki Potensi Usaha Berbasis Rumah Tangga

Information theory and dimensionality of space

Export Citation Format

optimal representationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Season- and Trend-aware Symbolic Approximation for Accurate and Efficient Time Series Matching

Ganglion cells in larval zebrafish retina integrate inputs from multiple cone types

Optimal Representation of the Nuclear Ensemble: Application to Electronic Spectroscopy

Binaural Background Noise Enhances Neuromagnetic Responses from Auditory Cortex

Optimal Representation of the Nuclear Ensemble: Application to Electronic Spectroscopy

Tensor-tensor algebra for optimal representation and compression of multiway data

Microbiome Preprocessing Machine Learning Pipeline

Comparative analysis of molecular representations in prediction of cancer drug combination synergy and sensitivity

Karakteristik Permukiman Kota yang Memiliki Potensi Usaha Berbasis Rumah Tangga

Information theory and dimensionality of space

optimal representation
Recently Published Documents