A Reconfigurable Posit Tensor Unit with Variable-Precision Arithmetic and Automatic Data Streaming

2014 ◽

Vol 7 (7) ◽

pp. 7137-7174 ◽

Cited By ~ 1

Author(s):

I. Žliobaitė ◽

J. Hollmén ◽

H. Junninen

Keyword(s):

Missing Data ◽

Solar Radiation ◽

Regression Models ◽

Minimum Energy ◽

Data Streaming ◽

Linear Regression Models ◽

Automatic Data ◽

Minimum Energy Consumption ◽

Physical Sensors

Abstract. Statistical models for environmental monitoring strongly rely on automatic data acquisition systems, using various physical sensors. Often, sensor readings are missing for extended periods of time while model outputs need to be continuously available in real time. With a case study in solar radiation nowcasting, we investigate how to deal with massively missing data (around 50% of the time some data are unavailable) in such situations. Our goal is to analyze the characteristics of missing data and recommend a strategy for deploying regression models, which would be robust to missing data in situations, where data are massively missing. We are after one model that performs well at all times, with and without data gaps. Due to the need to provide instantaneous outputs with minimum energy consumption for computing in the data streaming setting, we dismiss computationally demanding data imputation methods, and resort to a simple mean replacement. We use an established strategy for comparing different regression models, with the possibility of determining how many missing sensor readings can be tolerated before model outputs become obsolete. We experimentally analyze accuracies and robustness to missing data of seven linear regression models and recommend using regularized PCA regression. We recommend using our established guideline in training regression models, which themselves are robust to missing data.

Download Full-text

Regression models tolerant to massively missing data: a case study in solar-radiation nowcasting

Atmospheric Measurement Techniques ◽

10.5194/amt-7-4387-2014 ◽

2014 ◽

Vol 7 (12) ◽

pp. 4387-4399 ◽

Cited By ~ 6

Author(s):

I. Žliobaitė ◽

J. Hollmén ◽

H. Junninen

Keyword(s):

Missing Data ◽

Solar Radiation ◽

Regression Models ◽

Robust Regression ◽

Minimum Energy ◽

Data Streaming ◽

Linear Regression Models ◽

Automatic Data ◽

Minimum Energy Consumption

Abstract. Statistical models for environmental monitoring strongly rely on automatic data acquisition systems that use various physical sensors. Often, sensor readings are missing for extended periods of time, while model outputs need to be continuously available in real time. With a case study in solar-radiation nowcasting, we investigate how to deal with massively missing data (around 50% of the time some data are unavailable) in such situations. Our goal is to analyze characteristics of missing data and recommend a strategy for deploying regression models which would be robust to missing data in situations where data are massively missing. We are after one model that performs well at all times, with and without data gaps. Due to the need to provide instantaneous outputs with minimum energy consumption for computing in the data streaming setting, we dismiss computationally demanding data imputation methods and resort to a mean replacement, accompanied with a robust regression model. We use an established strategy for assessing different regression models and for determining how many missing sensor readings can be tolerated before model outputs become obsolete. We experimentally analyze the accuracies and robustness to missing data of seven linear regression models. We recommend using the regularized PCA regression with our established guideline in training regression models, which themselves are robust to missing data.

Download Full-text

Instrumentation For Quantitative Microscopy

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100078584 ◽

1977 ◽

Vol 35 ◽

pp. 206-209

Author(s):

B. Ralph ◽

A.R. Jones

Keyword(s):

Volume Fraction ◽

Three Dimensional ◽

Two Dimensional ◽

Automatic Data ◽

Phase Volume Fraction ◽

Statistical Confidence ◽

Resultant Data ◽

Automatic Data Collection ◽

Third Dimension ◽

Evolution Of Microstructure

In all fields of microscopy there is an increasing interest in the quantification of microstructure. This interest may stem from a desire to establish quality control parameters or may have a more fundamental requirement involving the derivation of parameters which partially or completely define the three dimensional nature of the microstructure. This latter categorey of study may arise from an interest in the evolution of microstructure or from a desire to generate detailed property/microstructure relationships. In the more fundamental studies some convolution of two-dimensional data into the third dimension (stereological analysis) will be necessary.In some cases the two-dimensional data may be acquired relatively easily without recourse to automatic data collection and further, it may prove possible to perform the data reduction and analysis relatively easily. In such cases the only recourse to machines may well be in establishing the statistical confidence of the resultant data. Such relatively straightforward studies tend to result from acquiring data on the whole assemblage of features making up the microstructure. In this field data mode, when parameters such as phase volume fraction, mean size etc. are sought, the main case for resorting to automation is in order to perform repetitive analyses since each analysis is relatively easily performed.

Download Full-text

Data Screening Methods – Application to Differential Diagnosis in Pancreatic Pathology from Radiological Signs

Methods of Information in Medicine ◽

10.1055/s-0038-1636606 ◽

1978 ◽

Vol 17 (01) ◽

pp. 36-40 ◽

Cited By ~ 4

Author(s):

J.-P. Durbec ◽

Jaqueline Cornée ◽

P. Berthezene

Keyword(s):

Differential Diagnosis ◽

Mutual Information ◽

Data Processing ◽

Screening Methods ◽

Automatic Data ◽

Chronic Calcifying Pancreatitis ◽

Number Of Patients ◽

Calcifying Pancreatitis ◽

Pancreatic Pathology ◽

Data Screening

The practice of systematic examinations in hospitals and the increasing development of automatic data processing permits the storing of a great deal of information about a large number of patients belonging to different diagnosis groups.To predict or to characterize these diagnosis groups some descriptors are particularly useful, others carry no information. Data screening based on the properties of mutual information and on the log cross products ratios in contingency tables is developed. The most useful descriptors are selected. For each one the characterized groups are specified.This approach has been performed on a set of binary (presence—absence) radiological variables. Four diagnoses groups are concerned: cancer of pancreas, chronic calcifying pancreatitis, non-calcifying pancreatitis and probable pancreatitis. Only twenty of the three hundred and forty initial radiological variables are selected. The presence of each corresponding sign is associated with one or more diagnosis groups.

Download Full-text

A Review on Big Data Analytics in Business

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset21841130 ◽

2018 ◽

pp. 210-214

Author(s):

Manbir Sandhu ◽

Purnima, Anuradha Saini

Keyword(s):

Big Data ◽

Big Data Analytics ◽

Smart Devices ◽

Data Streaming ◽

Huge Amount ◽

Business Units ◽

Enormous Amount ◽

Tools And Techniques ◽

Heath Care ◽

Data Analytic

Big data is a fast-growing technology that has the scope to mine huge amount of data to be used in various analytic applications. With large amount of data streaming in from a myriad of sources: social media, online transactions and ubiquity of smart devices, Big Data is practically garnering attention across all stakeholders from academics, banking, government, heath care, manufacturing and retail. Big Data refers to an enormous amount of data generated from disparate sources along with data analytic techniques to examine this voluminous data for predictive trends and patterns, to exploit new growth opportunities, to gain insight, to make informed decisions and optimize processes. Data-driven decision making is the essence of business establishments. The explosive growth of data is steering the business units to tap the potential of Big Data to achieve fueling growth and to achieve a cutting edge over their competitors. The overwhelming generation of data brings with it, its share of concerns. This paper discusses the concept of Big Data, its characteristics, the tools and techniques deployed by organizations to harness the power of Big Data and the daunting issues that hinder the adoption of Business Intelligence in Big Data strategies in organizations.

Download Full-text

Variable Precision Depth Encoding for 3D Range Geometry Compression

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.17.3dmp-034 ◽

2020 ◽

Vol 2020 (17) ◽

pp. 34-1-34-7

Author(s):

Matthew G. Finley ◽

Tyler Bell

Keyword(s):

Normal Distribution ◽

Arbitrary Distribution ◽

File Size ◽

Geometry Compression ◽

Variable Precision ◽

Wide Range ◽

Novel Method ◽

Encoding Method ◽

Rgb Image ◽

Color Channels

This paper presents a novel method for accurately encoding 3D range geometry within the color channels of a 2D RGB image that allows the encoding frequency—and therefore the encoding precision—to be uniquely determined for each coordinate. The proposed method can thus be used to balance between encoding precision and file size by encoding geometry along a normal distribution; encoding more precisely where the density of data is high and less precisely where the density is low. Alternative distributions may be followed to produce encodings optimized for specific applications. In general, the nature of the proposed encoding method is such that the precision of each point can be freely controlled or derived from an arbitrary distribution, ideally enabling this method for use within a wide range of applications.

Download Full-text