Indefinite Proximity Learning: A Review

Efficient learning of a data analysis task strongly depends on the data representation. Most methods rely on (symmetric) similarity or dissimilarity representations by means of metric inner products or distances, providing easy access to powerful mathematical formalisms like kernel or branch-and-bound approaches. Similarities and dissimilarities are, however, often naturally obtained by nonmetric proximity measures that cannot easily be handled by classical learning algorithms. Major efforts have been undertaken to provide approaches that can either directly be used for such data or to make standard methods available for these types of data. We provide a comprehensive survey for the field of learning with nonmetric proximities. First, we introduce the formalism used in nonmetric spaces and motivate specific treatments for nonmetric proximity data. Second, we provide a systematization of the various approaches. For each category of approaches, we provide a comparative discussion of the individual algorithms and address complexity issues and generalization properties. In a summarizing section, we provide a larger experimental study for the majority of the algorithms on standard data sets. We also address the problem of large-scale proximity learning, which is often overlooked in this context and of major importance to make the method relevant in practice. The algorithms we discuss are in general applicable for proximity-based clustering, one-class classification, classification, regression, and embedding approaches. In the experimental part, we focus on classification tasks.

Download Full-text

Models, Practices and Methods of Reading: Evolution in Time and Space

Bibliotekovedenie [Library and Information Science (Russia)] ◽

10.25281/0869-608x-2009-0-1-59-64 ◽

2009 ◽

pp. 59-64

Author(s):

Yulia P. Melentyeva

Keyword(s):

Large Scale ◽

Social Groups ◽

National Program ◽

Time And Space ◽

Lack Of Information ◽

The Individual

In recent years as public in general and specialist have been showing big interest to the matters of reading. According to discussion and launch of the “Support and Development of Reading National Program”, many Russian libraries are organizing the large-scale events like marathons, lecture cycles, bibliographic trainings etc. which should draw attention of different social groups to reading. The individual forms of attraction to reading are used much rare. To author’s mind the main reason of such an issue has to be the lack of information about forms and methods of attraction to reading.

Download Full-text

Transformation of microbiology data into a standardised data representation using OpenEHR

Scientific Reports ◽

10.1038/s41598-021-89796-y ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Antje Wulff ◽

◽

Claas Baier ◽

Sarah Ballout ◽

Erik Tute ◽

...

Keyword(s):

Real Life ◽

Primary Source ◽

Multidrug Resistant ◽

Data Representation ◽

Data Models ◽

Outbreak Detection ◽

Standard Data ◽

Quality Checks ◽

Computer Based ◽

Control Application

AbstractThe spread of multidrug resistant organisms (MDRO) is a global healthcare challenge. Nosocomial outbreaks caused by MDRO are an important contributor to this threat. Computer-based applications facilitating outbreak detection can be essential to address this issue. To allow application reusability across institutions, the various heterogeneous microbiology data representations needs to be transformed into standardised, unambiguous data models. In this work, we present a multi-centric standardisation approach by using openEHR as modelling standard. Data models have been consented in a multicentre and international approach. Participating sites integrated microbiology reports from primary source systems into an openEHR-based data platform. For evaluation, we implemented a prototypical application, compared the transformed data with original reports and conducted automated data quality checks. We were able to develop standardised and interoperable microbiology data models. The publicly available data models can be used across institutions to transform real-life microbiology reports into standardised representations. The implementation of a proof-of-principle and quality control application demonstrated that the new formats as well as the integration processes are feasible. Holistic transformation of microbiological data into standardised openEHR based formats is feasible in a real-life multicentre setting and lays the foundation for developing cross-institutional, automated outbreak detection systems.

Download Full-text

Opportunistic Large Array Propagation Models: A Comprehensive Survey

Sensors ◽

10.3390/s21124206 ◽

2021 ◽

Vol 21 (12) ◽

pp. 4206

Author(s):

Farhan Nawaz ◽

Hemant Kumar ◽

Syed Ali Hassan ◽

Haejoon Jung

Keyword(s):

Energy Efficient ◽

Large Scale ◽

Performance Metrics ◽

Experimental Studies ◽

Cooperative Transmission ◽

Analytical Models ◽

Future Research ◽

Large Array ◽

Propagation Models ◽

Comprehensive Survey

Enabled by the fifth-generation (5G) and beyond 5G communications, large-scale deployments of Internet-of-Things (IoT) networks are expected in various application fields to handle massive machine-type communication (mMTC) services. Device-to-device (D2D) communications can be an effective solution in massive IoT networks to overcome the inherent hardware limitations of small devices. In such D2D scenarios, given that a receiver can benefit from the signal-to-noise-ratio (SNR) advantage through diversity and array gains, cooperative transmission (CT) can be employed, so that multiple IoT nodes can create a virtual antenna array. In particular, Opportunistic Large Array (OLA), which is one type of CT technique, is known to provide fast, energy-efficient, and reliable broadcasting and unicasting without prior coordination, which can be exploited in future mMTC applications. However, OLA-based protocol design and operation are subject to network models to characterize the propagation behavior and evaluate the performance. Further, it has been shown through some experimental studies that the most widely-used model in prior studies on OLA is not accurate for networks with networks with low node density. Therefore, stochastic models using quasi-stationary Markov chain are introduced, which are more complex but more exact to estimate the key performance metrics of the OLA transmissions in practice. Considering the fact that such propagation models should be selected carefully depending on system parameters such as network topology and channel environments, we provide a comprehensive survey on the analytical models and framework of the OLA propagation in the literature, which is not available in the existing survey papers on OLA protocols. In addition, we introduce energy-efficient OLA techniques, which are of paramount importance in energy-limited IoT networks. Furthermore, we discuss future research directions to combine OLA with emerging technologies.

Download Full-text

Network community structure of substorms using SuperMAG magnetometers

Nature Communications ◽

10.1038/s41467-021-22112-4 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

L. Orr ◽

S. C. Chapman ◽

J. W. Gjerloev ◽

W. Guo

Keyword(s):

Large Scale ◽

Three Dimensional ◽

Easy Access ◽

Directed Network ◽

Coherent System ◽

Dimensional System ◽

Ionospheric Currents ◽

Substorm Current Wedge ◽

Spatially Extended ◽

Current Wedge

AbstractGeomagnetic substorms are a global magnetospheric reconfiguration, during which energy is abruptly transported to the ionosphere. Central to this are the auroral electrojets, large-scale ionospheric currents that are part of a larger three-dimensional system, the substorm current wedge. Many, often conflicting, magnetospheric reconfiguration scenarios have been proposed to describe the substorm current wedge evolution and structure. SuperMAG is a worldwide collaboration providing easy access to ground based magnetometer data. Here we show application of techniques from network science to analyze data from 137 SuperMAG ground-based magnetometers. We calculate a time-varying directed network and perform community detection on the network, identifying locally dense groups of connections. Analysis of 41 substorms exhibit robust structural change from many small, uncorrelated current systems before substorm onset, to a large spatially-extended coherent system, approximately 10 minutes after onset. We interpret this as strong indication that the auroral electrojet system during substorm expansions is inherently a large-scale phenomenon and is not solely due to many meso-scale wedgelets.

Download Full-text

Flexibility Effects in a Single Link Robotic Manipulator

14th Biennial Conference on Mechanical Vibration and Noise: Vibrations and Dynamics of Robotic and Multibody Structures ◽

10.1115/detc1993-0150 ◽

1993 ◽

Author(s):

C. Nataraj

Keyword(s):

Large Scale ◽

Robotic Manipulator ◽

Flexible Beam ◽

Rotating Speed ◽

Weighted Residuals ◽

Individual Interaction ◽

Single Link ◽

Rigid Mass ◽

The Individual ◽

Future Work

Abstract A single link robotic manipulator is modeled as a rotating flexible beam with a rigid mass at the tip and accurate energy expressions are derived. The resulting partial differential equations are solved using an approximate method of weighted residuals. From the solutions, coupling between axial and flexural deformations and the interactions with rigid body motions are rigorously analyzed. The emphasis in the current paper is not on an exhaustive analysis of existing systems but it is rather intended to compare and highlight the various flexibility effects in a relatively simple system. Hence, a nondimensional parametric analysis is performed to determine the effect of several parameters (including the rotating speed) on the errors and the individual interaction effects are discussed. Comparison with previous work in the field shows important phenomena often ignored or buried in large scale numerical analyses. Future work including application to multi-link robots is outlined.

Download Full-text

HUMAN ENERGY UTILIZATION AND EFFECTIVENESS IN SMALL AND MARGINAL FARM HOLDINGS

International Journal of Research -GRANTHAALAYAH ◽

10.29121/granthaalayah.v6.i11.2018.1114 ◽

2018 ◽

Vol 6 (11) ◽

pp. 163-171

Author(s):

Tandra Mondal ◽

Pranab Kumar Nag

Keyword(s):

Large Scale ◽

Energy Utilization ◽

Hand Tools ◽

Present Contribution ◽

Land Preparation ◽

Farm Work ◽

Farm Mechanization ◽

Power Tiller ◽

Paddy Cultivation ◽

The Individual

In India, small and marginal farmers have emerged as a distinct and dominant category. While farm mechanization represents a rapid transformation from traditional to modern methods of farming, it is not uniform across the crops and regions. The level of mechanization, however, remains scattered due to the compulsiveness to the situation dominated by the economic layout of farm holdings, land size, and large-scale deprivation of access to the technology suitable to small holdings. This present contribution elucidates the extent of use tools and machinery among the rice farmers of the state of Wes Bengal, India. Analysis revealed that the total number of man-days involved in paddy cultivation was 120-140 per ha, i.e., 900-1000 man-hours depending upon the availability of labour, tools, and machinery used for the individual operation. Analysis of farm work in small and marginal holdings evolved that over 90% of the total number of farmers use either tractor or power tiller for land preparation. Use of the animal-drawn country plough is gradually phased out in the study regions. For sowing and transplanting operations are primarily manual methods using hand tools. The study provided an insight of the issues of work methods and practices of the farmworkers in small and marginal farm holdings.

Download Full-text

Making Soup: Preparing and Validating Molecular Simulations of the Bacterial Cytoplasm

10.26434/chemrxiv.10102289 ◽

2019 ◽

Author(s):

Leandro Oliveira Bortot ◽

Zahedeh Bashardanesh ◽

David van der Spoel

Keyword(s):

Large Scale ◽

Biological Properties ◽

Computational Power ◽

E Coli ◽

Solvent Models ◽

Explicit Solvent ◽

Computational Modeling And Simulation ◽

Dynamics Simulations ◽

The Individual ◽

Bacterial Cytoplasm

Biomolecular crowding affects the biophysical and biochemical behavior of macro- molecules when compared to the dilute environment present in experiments made with isolated proteins. Computational modeling and simulation are useful tools to study how crowding affects the structural dynamics and biological properties of macromolecules. As computational power increased, modeling and simulating large scale all-atom explicit solvent models of the prokaryote cytoplasm become possible. In this work, we build an atomistic model of the cytoplasm of Escherichia coli composed of 1.5 million atoms and submit it to a total of 3 μs of molecular dynamics simulations. The properties of biomolecules under crowding conditions are compared to those from simulations of the individual compounds under dilute conditions. The simulation model is found to be consistent with experimental data about the diffusion coefficient and stability of macromolecules under crowded conditions. In order to stimulate further work we provide a Python script and a set of files that enables other researchers to build their own E. coli cytoplasm models to address questions related to crowding.<br>

Download Full-text

Nonlinear visuoauditory integration in the mouse superior colliculus

10.1101/2021.01.26.428325 ◽

2021 ◽

Author(s):

Shinya Ito ◽

Yufei Si ◽

Alan M. Litke ◽

David A. Feldheim

Keyword(s):

Superior Colliculus ◽

Sensory Integration ◽

Large Scale ◽

Temporal Dynamics ◽

Critical Role ◽

Sensory Information ◽

Population Level ◽

Object Identification ◽

Response Properties ◽

The Individual

AbstractSensory information from different modalities is processed in parallel, and then integrated in associative brain areas to improve object identification and the interpretation of sensory experiences. The Superior Colliculus (SC) is a midbrain structure that plays a critical role in integrating visual, auditory, and somatosensory input to assess saliency and promote action. Although the response properties of the individual SC neurons to visuoauditory stimuli have been characterized, little is known about the spatial and temporal dynamics of the integration at the population level. Here we recorded the response properties of SC neurons to spatially restricted visual and auditory stimuli using large-scale electrophysiology. We then created a general, population-level model that explains the spatial, temporal, and intensity requirements of stimuli needed for sensory integration. We found that the mouse SC contains topographically organized visual and auditory neurons that exhibit nonlinear multisensory integration. We show that nonlinear integration depends on properties of auditory but not visual stimuli. We also find that a heuristically derived nonlinear modulation function reveals conditions required for sensory integration that are consistent with previously proposed models of sensory integration such as spatial matching and the principle of inverse effectiveness.

Download Full-text

Parallel Distributed Networks Resolved at High Resolution Reveal Close Juxtaposition of Distinct Regions

10.1101/475806 ◽

2018 ◽

Author(s):

Rodrigo M. Braga ◽

Koene R. A. Van Dijk ◽

Jonathan R. Polimeni ◽

Mark C. Eldaief ◽

Randy L. Buckner

Keyword(s):

High Resolution ◽

Large Scale ◽

Spatial Separation ◽

Distributed Networks ◽

Association Cortex ◽

Default Network ◽

Network Organization ◽

Parahippocampal Cortex ◽

The Individual ◽

Insight Into

Examination of large-scale distributed networks within the individual reveals details of cortical network organization that are absent in group-averaged studies. One recent discovery is that a distributed transmodal network, often referred to as the ‘default network’, is comprised of two separate but closely interdigitated networks, only one of which is coupled to posterior parahippocampal cortex. Not all studies of individuals have identified the same networks and questions remain about the degree to which the two networks are separate, particularly within regions hypothesized to be interconnected hubs. Here we replicate the observation of network separation across analytical (seed-based connectivity and parcellation) and data projection (volume and surface) methods in 2 individuals each scanned 31 times. Additionally, 3 individuals were examined with high-resolution fMRI to gain further insight into the anatomical details. The two networks were identified with separate regions localized to adjacent portions of the cortical ribbon, sometimes inside the same sulcus. Midline regions previously implicated as hubs revealed near complete spatial separation of the two networks, displaying a complex spatial topography in the posterior cingulate and precuneus. The network coupled to parahippocampal cortex also revealed a separate region directly within the hippocampus at or near the subiculum. These collective results support that the default network is composed of at least two spatially juxtaposed networks. Fine spatial details and juxta-positions of the two networks can be identified within individuals at high resolution, providing insight into the network organization of association cortex and placing further constraints on interpretation of group-averaged neuroimaging data.

Download Full-text

VariantKey: A Reversible Numerical Representation of Human Genetic Variants

10.1101/473744 ◽

2018 ◽

Cited By ~ 1

Author(s):

Nicola Asuni ◽

Steven Wilder

Keyword(s):

Open Source Software ◽

Genetic Variants ◽

Data Representation ◽

Lookup Table ◽

Memory Allocation ◽

Computer Memory ◽

Numerical Representation ◽

Chromosome Position ◽

The Individual ◽

Genomic Regions

AbstractHuman genetic variants are usually represented by four values with variable length: chromosome, position, reference and alternate alleles. There is no guarantee that these components are represented in a consistent way across different data sources, and processing variant-based data can be inefficient because four different comparison operations are needed for each variant, three of which are string comparisons. Existing variant identifiers do not typically represent every possible variant we may be interested in, nor they are directly reversible. Similarly, genomic regions are typically represented inconsistently by three or four values. Working with strings, in contrast to numbers, poses extra challenges on computer memory allocation and data-representation. To overcome these limitations, a novel reversible numerical encoding schema for human genetic variants (VariantKey) and genomics regions (RegionKey), is presented here alongside a multi-language open-source software implementation (https://github.com/Genomicsplc/variantkey). VariantKey and RegionKey represents variants and regions as single 64 bit numeric entities, while preserving the ability to be searched and sorted by chromosome and position. The individual components of short variants can be directly read back from the VariantKey, while long variants are supported with a fast lookup table.

Download Full-text