Concept-Based Mining Model

Author(s):  
Shady Shehata ◽  
Fakhri Karray ◽  
Mohamed Kamel

Most of text mining techniques are based on word and/or phrase analysis of the text. Statistical analysis of a term frequency captures the importance of the term within a document only. However, two terms can have the same frequency in their documents, but one term contributes more to the meaning of its sentences than the other term. Thus, the underlying model should indicate terms that capture the semantics of text. In this case, the model can capture terms that present the concepts of the sentence, which leads to discover the topic of the document. A new concept-based mining model that relies on the analysis of both the sentence and the document, rather than, the traditional analysis of the document dataset only is introduced. The concept-based model can effectively discriminate between non-important terms with respect to sentence semantics and terms which hold the concepts that represent the sentence meaning. The proposed model consists of concept-based statistical analyzer, conceptual ontological graph representation, and concept extractor. The term which contributes to the sentence semantics is assigned two different weights by the concept-based statistical analyzer and the conceptual ontological graph representation. These two weights are combined into a new weight. The concepts that have maximum combined weights are selected by the concept extractor. The concept-based model is used to enhance the quality of the text clustering, categorization and retrieval significantly.

Author(s):  
PRADNYA S. RANDIVE ◽  
NITIN N. PISE

In text mining most techniques depends on statistical analysis of terms. Statistical analysis trances important terms within document only. However this concept based mining model analyses terms in sentence, document and corpus level. This mining model consist of sentence based concept analysis, document based and corpus based concept analysis and concept based similarity measure. Experimental result enhances text clustering quality by using sentence, document, corpus and combined approach of concept analysis.


Fermentation ◽  
2019 ◽  
Vol 5 (4) ◽  
pp. 89 ◽  
Author(s):  
Vinko Krstanović ◽  
Kristina Mastanjević ◽  
Viktor Nedović ◽  
Krešimir Mastanjević

This paper aimed to investigate the influence of certain wheat and wheat malt quality indicators on limit of attenuation of wort (LAT). The experiment was conducted using wheats that have been proven to display the best malting properties with heightened total and soluble N and very good viscosity. Standard micromalting and brewing processes and analysis were applied. The obtained results showed that the quality of analyzed malts was satisfying. Statistical analysis determined no significant correlation between the limit of attenuation of wort and any of the other analyzed quality indicators. The lack of close correlations between indicators is probably due to the extremely complex intertwine of factors influencing the LAT, pointing to the fact that this particular indicator should be observed as separate and mainly variety-dependent.


1936 ◽  
Vol 26 (2) ◽  
pp. 189-211 ◽  
Author(s):  
S. J. Watson ◽  
W. S. Ferguson

An experiment was carried out with two groups of ten cows each, made up of two Guernseys, two Ayrshires, two Friesians and four Shorthorns.The experiment was of the change-over type, the experimental period of 20 weeks being subdivided into four periods of 5 weeks, each cow alternating between the two treatments.In two of the periods a normal winter ration of roots, hay and concentrates was fed. In the other two periods artificially dried grass replaced a proportion of the concentrates, an average of 8 lb. being fed per head daily. The two types of ration provided equal amounts of starch equivalent and protein equivalent, but the carotene intake was greater in the “dried grass ration”.A statistical analysis of the difference in milk yields due to the contrast “Dried grass” v. “Control” revealed no signs of any effect, and if any actual effect does exist, it is quite negligible for the 5-week periods of this experiment.


Author(s):  
P. Bodor ◽  
M. Gaál ◽  
M. Tóth

Fruit quality of cross pollinated apples (Malus x domestica) influenced by the metaxenic pollen effect of the pollinizer was observed in Hungary. Flowers of three resistant cultivars (`Baujade', `Rewena') were hand pollinated with other resistant apple cultivars. Fruits were harvested on 25 September, 2005. Fruit quality was investigated in the laboratory of the Department of Pomology; Corvinus University of Budapest. Not only size and morphological parameters (diameter, height, stem length), but also refraction and acidic content of the fruits were measured. According to the statistical analysis significant differences were determined on fruits among the groups as an effect of the pollen provider. In consideration of size parameters (diameter, height, weight) of `Rewena' fruits pollination partner 'Freedom' and 'Prima' caused outstanding results but `Florina' caused flatter fruits. Pollen of `Florina' and `Freedor-,' caused a higher percent refraction in the fruits of `Rewena'. In the case of `Baujade' fruits `Reglindis' — among cultivars we used as pollinizer — caused the biggest fruits medium flesh firmness and harmonic inner content values. `Rajka' caused on one hand smaller fruits and on the other hand higher flesh firmness and inner content values in the case of `Relinda' fruits. According to our data measured pollinizers varied the stem length as well.


2021 ◽  
Vol 258 ◽  
pp. 07044
Author(s):  
Dmitri Pletnev ◽  
Victor Barkhatov

The quality of life plays a crucial role in ensuring sustainable development and improving human interaction with the environment, solving environmental problems. On the other hand, there is a tendency for the outflow of both people and their capital from peripheral regions to the centers, worsening the quality of life throughout the country. The article assesses the quality of life in the Urals and Volga regions using the center-periphery framework. The data of the regional statistics of Rosstat and the data of the RA RIA rating were used. The article uses the methods of statistical analysis, generalization, and abstraction. The stable types of regions (Center, Periphery 1, Periphery 2) were identified, the type of each region was identified. The assessment of trends in the level of monetary incomes, meat consumption, the number of tourists traveling abroad, and other life quality indicators by groups of regions. It is concluded that the division of regions according to the quality of life is stable, and the differences only increase.


2011 ◽  
Vol 3 (2) ◽  
pp. 219-223 ◽  
Author(s):  
Uday Bhan Prajapati ◽  
Anil K. Dwivedi

Industries discharge their effluents which are rich in solids, may it be in the form of TSS or TDS. These solids affect the other physicochemical parameters of the water body. Present study deals with the investigation of seasonal variation and statistical analyses of the selected parameters, in river Ami, in light of the industrial effluents. The study records that summer season, appears to be the most polluted, that is during the period when the river carries little amount of water. Statistical analysis showed that all the physicochemical parameters were positively correlated except TDS and temperature.


2010 ◽  
Vol 22 (10) ◽  
pp. 1360-1371 ◽  
Author(s):  
Shady Shehata ◽  
Fakhri Karray ◽  
Mohamed Kamel

2017 ◽  
Vol 64 (3) ◽  
pp. 305-322
Author(s):  
Jan Purczyński ◽  
Kamila Bednarz-Okrzyńska

A new model for a dependent variable taking the value 0 or 1 (binary, dichotomous) was proposed. The name of the proposed model – the raybit model – stems from the fact that the probability corresponds to the Rayleigh cumulative distribution function. The assessment of the quality of selected models was conducted with the use of four definitions of error: MSE, MAE, WMSE, WMAE. Two computational examples were considered, which proved that the raybit model yields smaller values of error than the logit and probit models. Computer simulations were conducted using a random number generator with a binomial distribution. They proved that for the values of the theoretical probabilityfor the interval Pi ∈ [0; 0.8] the raybit model outperforms the other two models yielding a smaller value of error.


2021 ◽  
Vol 31 ◽  
pp. 1-21
Author(s):  
Martha Ríos Manríquez

This article identifies the factors that influence business performance (BP) in the construction, trade, and services sectors, as well as sub-sectors and branches of the manufacturing sector of small and mid-size enterprises (SME) in the state of Guanajuato, Mexico. A quantitative, descriptive, and correlational statistical analysis was performed on a sample of 460 enterprises, estimating a linear regression model using the ordinary least squares (OLS) method. Empirical evidence reveals that the construction, trade, and services sectors agree that profitability, efficient internal processes, and low labor absenteeism are those factors that mostly influence BP. On the other hand, in sub-sectors of low-technology manufacturing (minerals, metals, plastic and rubber; textile; and leather and substitute materials), the quality of product is the factor viewed as the most relevant to explain BP in Mexican SME.


Sign in / Sign up

Export Citation Format

Share Document