Data science skills: Building partnership for efficient school curriculum delivery in Africa

2020 ◽  
Vol 36 ◽  
pp. 49-62
Author(s):  
Nureni Olawale Adeboye ◽  
Peter Osuolale Popoola ◽  
Oluwatobi Nurudeen Ogunnusi

Data science is a concept to unify statistics, data analysis, machine learning and their related methods in order to analyze actual phenomena with data to provide better understanding. This article focused its investigation on acquisition of data science skills in building partnership for efficient school curriculum delivery in Africa, especially in the area of teaching statistics courses at the beginners’ level in tertiary institutions. Illustrations were made using Big data of selected 18 African countries sourced from United Nations Educational, Scientific and Cultural Organization (UNESCO) with special focus on some macro-economic variables that drives economic policy. Data description techniques were adopted in the analysis of the sourced open data with the aid of R analytics software for data science, as improvement on the traditional methods of data description for learning and thus open a new charter of education curriculum delivery in African schools. Though, the collaboration is not without its own challenges, its prospects in creating self-driven learning culture among students of tertiary institutions has greatly enhanced the quality of teaching, advancing students skills in machine learning, improved understanding of the role of data in global perspective and being able to critique claims based on data.

2020 ◽  
Vol 5 (19) ◽  
pp. 104-122
Author(s):  
Azzan Amin ◽  
Haslina Arshad ◽  
Ummul Hanan Mohamad

Data visualization is viewed as a significant element in data analysis and communication. As the data engagement becomes more and more complex, visual presentation of data does help users understand the data. So far, two-dimensional (2D) data visuals are often used for the data visualization process, but the lack of depth dimension leads to inefficient and limited understanding of the data. Therefore, the effectiveness of augmented reality (AR) in data visualization was studied through the development of an AR Data Visualization application using E-commerce data. Machine learning models are also involved in the development of this AR application for the provision of data using predictive analysis functions. To provide quality E-commerce data and an optimal machine learning model, the data science process is carried out using the python programming language. The E-commerce data selected for this study is open data taken through the Kaggle Website. This database has 9994 data numbers and 21 attributes. This AR data visualization application will make it easier for users to understand the E-commerce data in-depth through the use of AR technology and be able to visualize the forecasts for sales profit based on the algorithm model "Auto-Regressive Integrated Moving Average" (ARIMA).


Author(s):  
Kai R. Larsen ◽  
Daniel S. Becker

In Automated Machine Learning for Business, we teach the machine learning process using a new development in data science: automated machine learning. AutoML, when implemented properly, makes machine learning accessible to most people because it removes the need for years of experience in the most arcane aspects of data science, such as the math, statistics, and computer science skills required to become a top contender in traditional machine learning. Anyone trained in the use of AutoML can use it to test their ideas and support the quality of those ideas during presentations to management and stakeholder groups. Because the requisite investment is one semester-long undergraduate course rather than a year in a graduate program, these tools will likely become a core component of undergraduate programs, and over time, even the high school curriculum.


Smart Cities ◽  
2020 ◽  
Vol 3 (3) ◽  
pp. 657-675
Author(s):  
Richard B. Watson ◽  
Peter J. Ryan

Australian governments at all three levels—local (council), state, and federal—are beginning to exploit the massive amounts of data they collect through sensors and recording systems. Their aim is to enable Australian communities to benefit from “smart city” initiatives by providing greater efficiencies in their operations and strategic planning. Increasing numbers of datasets are being made freely available to the public. These so-called big data are amenable to data science analysis techniques including machine learning. While there are many cases of data use at the federal and state level, local councils are not taking full advantage of their data for a variety of reasons. This paper reviews the status of open datasets of Australian local governments and reports progress being made in several student and other projects to develop open data web services using machine learning for smart cities.


2020 ◽  
Author(s):  
Saeed Nosratabadi ◽  
Amir Mosavi ◽  
Puhong Duan ◽  
Pedram Ghamisi ◽  
Ferdinand Filip ◽  
...  

This paper provides a state-of-the-art investigation of advances in data science in emerging economic applications. The analysis was performed on novel data science methods in four individual classes of deep learning models, hybrid deep learning models, hybrid machine learning, and ensemble models. Application domains include a wide and diverse range of economics research from the stock market, marketing, and e-commerce to corporate banking and cryptocurrency. Prisma method, a systematic literature review methodology, was used to ensure the quality of the survey. The findings reveal that the trends follow the advancement of hybrid models, which, based on the accuracy metric, outperform other learning algorithms. It is further expected that the trends will converge toward the advancements of sophisticated hybrid deep learning models.


2020 ◽  
Author(s):  
Saeed Nosratabadi ◽  
Amir Mosavi ◽  
Puhong Duan ◽  
Pedram Ghamisi ◽  
Filip Ferdinand ◽  
...  

Author(s):  
Ritu Khandelwal ◽  
Hemlata Goyal ◽  
Rajveer Singh Shekhawat

Introduction: Machine learning is an intelligent technology that works as a bridge between businesses and data science. With the involvement of data science, the business goal focuses on findings to get valuable insights on available data. The large part of Indian Cinema is Bollywood which is a multi-million dollar industry. This paper attempts to predict whether the upcoming Bollywood Movie would be Blockbuster, Superhit, Hit, Average or Flop. For this Machine Learning techniques (classification and prediction) will be applied. To make classifier or prediction model first step is the learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations. Methods: All the techniques related to classification and Prediction such as Support Vector Machine(SVM), Random Forest, Decision Tree, Naïve Bayes, Logistic Regression, Adaboost, and KNN will be applied and try to find out efficient and effective results. All these functionalities can be applied with GUI Based workflows available with various categories such as data, Visualize, Model, and Evaluate. Result: To make classifier or prediction model first step is learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations Conclusion: This paper focuses on Comparative Analysis that would be performed based on different parameters such as Accuracy, Confusion Matrix to identify the best possible model for predicting the movie Success. By using Advertisement Propaganda, they can plan for the best time to release the movie according to the predicted success rate to gain higher benefits. Discussion: Data Mining is the process of discovering different patterns from large data sets and from that various relationships are also discovered to solve various problems that come in business and helps to predict the forthcoming trends. This Prediction can help Production Houses for Advertisement Propaganda and also they can plan their costs and by assuring these factors they can make the movie more profitable.


2021 ◽  
Vol 3 (2) ◽  
pp. 392-413
Author(s):  
Stefan Studer ◽  
Thanh Binh Bui ◽  
Christian Drescher ◽  
Alexander Hanuschkin ◽  
Ludwig Winkler ◽  
...  

Machine learning is an established and frequently used technique in industry and academia, but a standard process model to improve success and efficiency of machine learning applications is still missing. Project organizations and machine learning practitioners face manifold challenges and risks when developing machine learning applications and have a need for guidance to meet business expectations. This paper therefore proposes a process model for the development of machine learning applications, covering six phases from defining the scope to maintaining the deployed machine learning application. Business and data understanding are executed simultaneously in the first phase, as both have considerable impact on the feasibility of the project. The next phases are comprised of data preparation, modeling, evaluation, and deployment. Special focus is applied to the last phase, as a model running in changing real-time environments requires close monitoring and maintenance to reduce the risk of performance degradation over time. With each task of the process, this work proposes quality assurance methodology that is suitable to address challenges in machine learning development that are identified in the form of risks. The methodology is drawn from practical experience and scientific literature, and has proven to be general and stable. The process model expands on CRISP-DM, a data mining process model that enjoys strong industry support, but fails to address machine learning specific tasks. The presented work proposes an industry- and application-neutral process model tailored for machine learning applications with a focus on technical tasks for quality assurance.


Author(s):  
Sumi Helal ◽  
Flavia C. Delicato ◽  
Cintia B. Margi ◽  
Satyajayant Misra ◽  
Markus Endler

Sign in / Sign up

Export Citation Format

Share Document