Discovering Quality Knowledge from Relational Databases

Author(s):  
M. Mehdi Owrang O.

Current database technology involves processing a large volume of data in order to discover new knowledge. However, knowledge discovery on just the most detailed and recent data does not reveal the long-term trends. Relational databases create new types of problems for knowledge discovery since they are normalized to avoid redundancies and update anomalies, which make them unsuitable for knowledge discovery. A key issue in any discovery system is to ensure the consistency, accuracy, and completeness of the discovered knowledge. We describe the aforementioned problems associated with the quality of the discovered knowledge and provide some solutions to avoid them.

2009 ◽  
pp. 238-256 ◽  
Author(s):  
M. Mehdi Owrang O.

Current database technology involves processing a large volume of data in order to discover new knowledge. However, knowledge discovery on just the most detailed and recent data does not reveal the long-term trends. Relational databases create new types of problems for knowledge discovery since they are normalized to avoid redundancies and update anomalies, which make them unsuitable for knowledge discovery. A key issue in any discovery system is to ensure the consistency, accuracy, and completeness of the discovered knowledge. We describe the aforementioned problems associated with the quality of the discovered knowledge and provide some solutions to avoid them.


Author(s):  
M. Mehdi Owrang O.

Current database technology involves processing a large volume of data in order to discover new knowledge. However, knowledge discovery on just the most detailed and recent data does not reveal the long-term trends. Relational databases create new types of problems for knowledge discovery since they are normalized to avoid redundancies and update anomalies, which make them unsuitable for knowledge discovery. A key issue in any discovery system is to ensure the consistency, accuracy, and completeness of the discovered knowledge. We describe the aforementioned problems associated with the quality of the discovered knowledge and provide some solutions to avoid them.


2007 ◽  
Vol 12 (1) ◽  
pp. 37-62 ◽  
Author(s):  
Paul Lambert ◽  
Kenneth Prandy ◽  
Wendy Bottero

This paper discusses long term trends in patterns of intergenerational social mobility in Britain. We argue that there is convincing empirical evidence of a small but steady linear trend towards increasing social mobility throughout the period 1800-2004. Our conclusions are based upon the construction and analysis of an extended micro-social dataset, which combines records from an historical genealogical study, with responses from 31 sample surveys conducted over the period 1963-2004. There has been much previous study of trends in social mobility, and little consensus on their nature. We argue that this dissension partly results from the very slow pace of change in mobility rates, which makes the time-frame of any comparison crucial, and raises important methodological questions about how long-term change in mobility is best measured. We highlight three methodological difficulties which arise when trying to draw conclusions over mobility trends - concerning the extent of controls for life course effects; the quality of data resources; and the measurement of stratification positions. After constructing a longitudinal dataset which attempts to confront these difficulties, our analyses provide robust evidence which challenges hitherto more popular, politicised claims of declining or unchanging mobility. By contrast, our findings suggest that Britain has moved, and continues to move, steadily towards increasing equality in the relationship between occupational attainment and parental background.


2011 ◽  
Vol 20 (01) ◽  
pp. 146-155
Author(s):  
A. V. Alekseyenko ◽  
Y. Aphinyanaphongs ◽  
S. Brown ◽  
D. Fenyo ◽  
L. Fu ◽  
...  

SummaryTo survey major developments and trends in the field of Bioinformatics in 2010 and their relationships to those of previous years, with emphasis on long-term trends, on best practices, on quality of the science of informatics, and on quality of science as a function of informatics.A critical review of articles in the literature of Bioinformatics over the past year.Our main results suggest that Bioinformatics continues to be a major catalyst for progress in Biology and Translational Medicine, as a consequence of new assaying technologies, most predominantly Next Generation Sequencing, which are changing the landscape of modern biological and medical research. These assays critically depend on bioinformatics and have led to quick growth of corresponding informatics methods development. Clinical-grade molecular signatures are proliferating at a rapid rate. However, a highly publicized incident at a prominent university showed that deficiencies in informatics methods can lead to catastrophic consequences for important scientific projects. Developing evidence-driven protocols and best practices is greatly needed given how serious are the implications for the quality of translational and basic science.Several exciting new methods have appeared over the past 18 months, that open new roads for progress in bioinformatics methods and their impact in biomedicine. At the same time, the range of open problems of great significance is extensive, ensuring the vitality of the field for many years to come.


2008 ◽  
Vol 22 (1) ◽  
pp. 129-152 ◽  
Author(s):  
Richard H Steckel

When economists investigate long-term trends and socioeconomic differences in the standard of living or quality of life, they have traditionally focused on monetary measures such as gross domestic product—which has occupied center stage for over 50 years. In recent decades, however, scholars have increasingly recognized the limitations of monetary measures while seeking useful alternatives. This essay examines the unique and valuable contributions of four biological measures—life expectancy, morbidity, stature, and certain features of skeletal remains—to understand levels and changes in human well-being. People desire far more than material goods and in fact they are quite willing to trade or give up material things in return for better physical or psychological health. For most people, health is so important to their quality of life that it is useful to refer to the “biological standard of living.” Biological measures may be especially valuable for historical studies and for other research circumstances where monetary measures are thin or lacking. A concluding section ruminates on the future evolution of biological approaches in measuring happiness.


Author(s):  
Ryo Onishi ◽  
Yosuke Hatakeyama ◽  
Kunichika Matsumoto ◽  
Kanako Seto ◽  
Koki Hirata ◽  
...  

Previous studies indicated that optimal care for pneumonia during hospitalization might reduce the risk of in-hospital mortality and subsequent readmission. This study was a retrospective observational study using Japanese administrative claims data from April 2010 to March 2019. We analyzed data from 167,120 inpatients with pneumonia ≥15 years old in the benchmarking project managed by All Japan Hospital Association. Hospital-level risk-adjusted ratios of 30-day readmission for pneumonia were calculated using multivariable logistic regression analyses. The Spearman’s correlation coefficient was used to assess the correlation in each consecutive period. In the analysis using complete 9-year data including 54,756 inpatients, the hospital standardized readmission ratios (HSRRs) showed wide variation among hospitals and improvement trend (r = −0.18, p = 0.03). In the analyses of trends in each consecutive period, the HSRRS were positively correlated between ‘2010–2012’ and ‘2013–2015’ (r = 0.255, p = 0.010), and ‘2013–2015’ and ‘2016–2018’ (r = 0.603, p < 0.001). This study denoted the HSRRs for pneumonia could be calculated using Japanese administrative claims data. The HSRRs significantly varied among hospitals with comparable case-mix, and could relatively evaluate the quality of preventing readmission including long-term trends. The HSRRs can be used as yet another measure to help improve quality of care over time if other indicators are examined in parallel.


1987 ◽  
Vol 44 (3) ◽  
pp. 515-524 ◽  
Author(s):  
Thomas F. Nalepa

A benthic survey of 40 stations in southern Lake Michigan in 1980–81 was compared with results of studies conducted in the mid-1960's to evaluate long-term trends in populations. The three major benthic groups, Pontoporeia, oligochaetes, and sphaeriids, were significantly more abundant in 1980–81. Compared with the most comprehensive survey (conducted in 1964–67), Pontoporeia increased two- to fivefold at depths less than 50 m, but did not increase at depths greater than 50 m. Oligochaetes increased two- to threefold regardless of sampling depth, while sphaeriids increased twofold at depths less than 50 m but decreased somewhat at depths greater than 50 m. Although oligochaete abundances increased, overall shifts in species composition were not apparent. Changes in abundances of Pontoporeia may have been related to shifts in predation pressure from forage fish populations, but increased oligochaete abundances can likely be related to increased enrichment between the two sampling periods. Although apparent improvements in the water quality of Lake Michigan since the mid-1970's were not yet reflected by benthic populations in 1980–81, continued monitoring of the benthos should provide a useful indicator of future trends.


Paleobiology ◽  
2015 ◽  
Vol 41 (1) ◽  
pp. 16-32 ◽  
Author(s):  
Jered A. Karr ◽  
Matthew E. Clapham

AbstractInsect taphonomy is a topic that has drawn interest because of its potential biases on diversity patterns and the ecological information recorded by ancient insect faunas. Other than the onset of common amber fossilization in the Cretaceous, very little is known about long-term trends in the nature and quality of insect preservation and, as a result, the effects of taphonomic biases are poorly constrained. We assembled a database of nearly 7000 Carboniferous–Pliocene insect adpression (compression and impression) species from the primary literature to assess changes in insect taphonomy over time and test biotic and environmental controls on preservation. We grouped the fossils into 10-Myr bins and scored preservation of holotype specimens as either articulated bodies or isolated wings; articulated specimens with a body implied a generally higher quality of preservation. Paleozoic and Triassic insect holotypes are known overwhelmingly from isolated wings (only 12% articulated bodies), but our database shows a significant increase in the percentage preserved as articulated bodies, to more than 70%, beginning about 160 Myr ago in the Late Jurassic. This transition could reflect variations in the robustness of different insect orders and shifts in the taxonomic composition of insect faunas, but all the major orders in the database exhibit significant increases in articulation. Instead, a shift to increased preservation in lacustrine paleoenvironments, which contain a greater proportion of articulated body fossils, explains most of the trend. The pronounced Late Jurassic increase in articulation has implications for evolutionary and ecological reconstructions, for example, suggesting that preserved insect diversity may be biased downward in the earlier part of their history when articulation was poor.


2021 ◽  
Vol 14 (12) ◽  
pp. 593
Author(s):  
Ibrahim Filiz ◽  
Jan René Judek ◽  
Marco Lorenz ◽  
Markus Spiwoks

Technological progress in recent years has made new methods available for making forecasts in a variety of areas. We examine the success of ex-ante stock market forecasts of three major stock market indices, i.e., the German Stock Market Index (DAX), the Dow Jones Industrial Index (DJI), and the Euro Stoxx 50 (SX5E). We test whether the forecasts prove true when they reach their effective dates and are therefore suitable for active investment strategies. We revive the thoughts of the American sociologist William Fielding Ogburn, who argues that forecasters consistently underestimate the variability of the future. In addition, we draw on some contemporary measures of forecast quality (prediction-realization diagram, test of unbiasedness, and Diebold–Mariano test). We reveal that (a) unusual events are underrepresented in the forecasts, (b) the dispersion of the forecasts lags behind that of the actual events, (c) the slope of the regression lines in the prediction-realization diagram is <1, (d) the forecasts are highly biased, and (e) the quality of the forecasts is not significantly better than that of naïve forecasts. The overall behavior of the forecasters can be described as “sticky” because their forecasts adhere too strongly to long-term trends in the indices and are thus characterized by conservatism.


Sign in / Sign up

Export Citation Format

Share Document