Descriptive and Predictive Analytical Methods for Big Data

Author(s):  
Sema A. Kalaian ◽  
Rafa M. Kasim ◽  
Nabeel R. Kasim

Data analytics and modeling are powerful analytical tools for knowledge discovery through examining and capturing the complex and hidden relationships and patterns among the quantitative variables in the existing massive structured Big Data in efforts to predict future enterprise performance. The main purpose of this chapter is to present a conceptual and practical overview of some of the basic and advanced analytical tools for analyzing structured Big Data. The chapter covers descriptive and predictive analytical methods. Descriptive analytical tools such as mean, median, mode, variance, standard deviation, and data visualization methods (e.g., histograms, line charts) are covered. Predictive analytical tools for analyzing Big Data such as correlation, simple- and multiple- linear regression are also covered in the chapter.

Web Services ◽  
2019 ◽  
pp. 314-331 ◽  
Author(s):  
Sema A. Kalaian ◽  
Rafa M. Kasim ◽  
Nabeel R. Kasim

Data analytics and modeling are powerful analytical tools for knowledge discovery through examining and capturing the complex and hidden relationships and patterns among the quantitative variables in the existing massive structured Big Data in efforts to predict future enterprise performance. The main purpose of this chapter is to present a conceptual and practical overview of some of the basic and advanced analytical tools for analyzing structured Big Data. The chapter covers descriptive and predictive analytical methods. Descriptive analytical tools such as mean, median, mode, variance, standard deviation, and data visualization methods (e.g., histograms, line charts) are covered. Predictive analytical tools for analyzing Big Data such as correlation, simple- and multiple- linear regression are also covered in the chapter.


1993 ◽  
Vol 57 (1) ◽  
pp. 99-104 ◽  
Author(s):  
J. C. Williams

AbstractThe following goat lactation model was fitted (using non-linear regression) to 407 lactations from five commercial goat dairies and one Research Institute goat herd: y = A exp (B(l + n'/2)n' + Cn' 2 - 1·01/n) where y = daily yield in kg; n = day of lactation (post parturition); and n' = (n -150)1100.Influence of farm, parity and season on the parameter estimates for 376 individual lactations was studied, using multiple linear regression. The models adopted were of the form: A = 1·366 + 1·122 × parity - 0·137 × parity2; ln(-B) = - 1·711 + 0·107 × parity + 0·512 season one; C = 0·037, with a standard deviation for A of 0·658, for ln(-B) of 0·636 and for C of 0·127.Influence of litter size on parameters was investigated for the Research Institute herd. There was no evidence of an effect on any of the model parameters.


Author(s):  
Nirmit Singhal ◽  
Amita Goel, ◽  
Nidhi Sengar ◽  
Vasudha Bahl

The world generated 52 times the amount of data in 2010 and 76 times the number of information sources in 2022. The ability to use this data creates enormous opportunities, and in order to make these opportunities a reality, people must use data to solve problems. Unfortunately, in the midst of a global pandemic, when people all over the world seek reliable, trustworthy information about COVID-19 (Coronavirus). Tableau plays a key role in this scenario because it is an extremely powerful tool for quickly visualizing large amounts of data. It has a simple drag-and-drop interface. Beautiful infographics are simple to create and take little time. Tableau works with a wide variety of data sources. COVID-19 (Coronavirus)analytics with Tableau will allow you to create dashboards that will assist you. Tableau is a tool that deals with big data analytics and generates output in a visualization technique, making it more understandable and presentable. Data blending, real-time reporting, and data collaboration are one of its features. Ultimately, this paper provides a clear picture of the growing COVID19 (Coronavirus) data and the tools that can assist more effectively, accurately, and efficiently. Keywords: Data Visualization, Tableau, Data Analysis, Covid-19 analysis, Covid-19 data


2020 ◽  
Author(s):  
Akram Kahforoushan ◽  
Shirin Hasanpour ◽  
Mojgan Mirghafourvand

Abstract BackgroundLate preterm infants suffer from many short-term and long-term problems after birth. The key factor in fighting these problems is effective breastfeeding. The present study aimedto determine the breastfeeding self-efficacy and its relationship with the perceived stress and breastfeeding performance in mothers with late preterm infants. MethodsIn this prospective study, 171 nursing mothers with late preterm infants born in Alzahra Medical Center of Tabriz, Iran, who met the conditions of this study were selected through convenience sampling. The Breastfeeding Self-Efficacy Scale-Short Form (BSES- SF) was employed to measure breastfeeding self-efficacy and 14-item Perceived Stress Scale (PSS14) was used to measure the perceived stress during 24 hours after giving birth and when the child was 4 months old the breastfeeding performance was measured by the standard breastfeeding performance questionnaire. The data were analyzed by Pearson and Spearman’s correlation tests, independent t-test, one-way ANOVA, and Multiple Linear Regression.ResultsThe mean (standard deviation) of breastfeeding self-efficacy equaled 50.0 (7.8) from the scores ranging between13-65 and the mean (standard deviation) of the perceived stress equaled to 26.5 (8.8) from the scores ranging between 0-56. The median (25-75 percentiles) of breastfeeding performance score in the mothers equaled 2.0 (1.0 to 3.0) from the scores ranging between 0-6. On the basis of multiple linear regression and through adjusting the personal-social characteristic, by increasing the score of the breastfeeding self-efficacy, the perceived stress was decreased to a statistically significant amount (B=-0.1, 95%CI=-0.3 to 0.0), however, there was no statistically significant relationship between breastfeeding self-efficacy and breastfeeding performance (p=0.418). ConclusionDue to the modifiable variability of breastfeeding self-efficacy and its role in perceived maternal stress, the development of appropriate strategies to further increase breastfeeding self-efficacy and provide more support to these mothers and infants is of particular importance.


Author(s):  
Rajganesh Nagarajan ◽  
Ramkumar Thirunavukarasu

In this chapter, the authors consider different categories of data, which are processed by the big data analytics tools. The challenges with respect to the big data processing are identified and a solution with the help of cloud computing is highlighted. Since the emergence of cloud computing is highly advocated because of its pay-per-use concept, the data processing tools can be effectively deployed within cloud computing and certainly reduce the investment cost. In addition, this chapter talks about the big data platforms, tools, and applications with data visualization concept. Finally, the applications of data analytics are discussed for future research.


Author(s):  
Mohd Imran ◽  
Mohd Vasim Ahamad ◽  
Misbahul Haque ◽  
Mohd Shoaib

The term big data analytics refers to mining and analyzing of the voluminous amount of data in big data by using various tools and platforms. Some of the popular tools are Apache Hadoop, Apache Spark, HBase, Storm, Grid Gain, HPCC, Casandra, Pig, Hive, and No SQL, etc. These tools are used depending on the parameter taken for big data analysis. So, we need a comparative analysis of such analytical tools to choose best and simpler way of analysis to gain more optimal throughput and efficient mining. This chapter contributes to a comparative study of big data analytics tools based on different aspects such as their functionality, pros, and cons based on characteristics that can be used to determine the best and most efficient among them. Through the comparative study, people are capable of using such tools in a more efficient way.


2022 ◽  
pp. 622-631
Author(s):  
Mohd Imran ◽  
Mohd Vasim Ahamad ◽  
Misbahul Haque ◽  
Mohd Shoaib

The term big data analytics refers to mining and analyzing of the voluminous amount of data in big data by using various tools and platforms. Some of the popular tools are Apache Hadoop, Apache Spark, HBase, Storm, Grid Gain, HPCC, Casandra, Pig, Hive, and No SQL, etc. These tools are used depending on the parameter taken for big data analysis. So, we need a comparative analysis of such analytical tools to choose best and simpler way of analysis to gain more optimal throughput and efficient mining. This chapter contributes to a comparative study of big data analytics tools based on different aspects such as their functionality, pros, and cons based on characteristics that can be used to determine the best and most efficient among them. Through the comparative study, people are capable of using such tools in a more efficient way.


Author(s):  
Sam Goundar ◽  
Akashdeep Bhardwaj ◽  
Shavindar Singh ◽  
Mandeep Singh ◽  
Gururaj H. L.

Big data is emerging, and the latest developments in technology have spawned enormous amounts of data. The traditional databases lack the capabilities to handle this diverse data and thus has led to the employment of new technologies, methods, and tools. This research discusses big data, the available big data analytical tools, the need to use big data analytics with its benefits and challenges. Through a research drawing on survey questionnaires, observation of the business processes, interviews and secondary research methods, the organizations, and companies in a small island state are identified to survey which of them use analytical tools to handle big data and the benefits it proposes to these businesses. Organizations and companies that do not use these tools were also surveyed and reasons were outlined as to why these organizations hesitate to utilize such tools.


Sign in / Sign up

Export Citation Format

Share Document