Industrial growth in sub-Saharan Africa: evidence from machine learning with insights from nightlight satellite images

PurposeThis study uses machine machine learning techniques to assess industrial development in Africa.Design/methodology/approachThis study uses nightlight time data and machine learning techniques to assess industrial development in Africa.FindingsThis study provides evidence on how machine learning techniques and nightlight data can be used to assess economic development in places where subnational data are missing or not precise. Taken together, the research confirms four groups of important determinants of industrial growth: natural resources, agriculture growth, institutions and manufacturing imports. Our findings indicate that Africa should follow a more multisector approach for development, putting natural resources and agriculture productivity growth at the forefront.Originality/valueStudies on the use of machine learning (with insights from nightlight satellite images) to assess industrial development in Africa are sparse.

Download Full-text

Feasibility study of automatically performing the concrete delivery dispatching through machine learning techniques

Engineering Construction & Architectural Management ◽

10.1108/ecam-06-2014-0081 ◽

2015 ◽

Vol 22 (5) ◽

pp. 573-590 ◽

Cited By ~ 14

Author(s):

Mojtaba Maghrebi ◽

Claude Sammut ◽

S. Travis Waller

Keyword(s):

Machine Learning ◽

Human Resources ◽

Design Methodology ◽

Machine Learning Techniques ◽

Mixing Process ◽

Content Type ◽

Practical Solution ◽

Learning Techniques ◽

Ready Mixed Concrete ◽

Practical Implications

Purpose – The purpose of this paper is to study the implementation of machine learning (ML) techniques in order to automatically measure the feasibility of performing ready mixed concrete (RMC) dispatching jobs. Design/methodology/approach – Six ML techniques were selected and tested on data that was extracted from a developed simulation model and answered by a human expert. Findings – The results show that the performance of most of selected algorithms were the same and achieved an accuracy of around 80 per cent in terms of accuracy for the examined cases. Practical implications – This approach can be applied in practice to match experts’ decisions. Originality/value – In this paper the feasibility of handling complex concrete delivery problems by ML techniques is studied. Currently, most of the concrete mixing process is done by machines. However, RMC dispatching still relies on human resources to complete many tasks. In this paper the authors are addressing to reconstruct experts’ decisions as only practical solution.

Download Full-text

Malicious web domain identification using online credibility and performance data by considering the class imbalance issue

Industrial Management & Data Systems ◽

10.1108/imds-02-2018-0072 ◽

2019 ◽

Vol 119 (3) ◽

pp. 676-696 ◽

Cited By ~ 5

Author(s):

Zhongyi Hu ◽

Raymond Chiong ◽

Ilung Pranata ◽

Yukun Bao ◽

Yuqing Lin

Keyword(s):

Machine Learning ◽

Class Imbalance ◽

Performance Data ◽

Machine Learning Techniques ◽

Data Sets ◽

Real World Data ◽

Content Type ◽

Domain Identification ◽

Learning Techniques ◽

And Performance

Purpose Malicious web domain identification is of significant importance to the security protection of internet users. With online credibility and performance data, the purpose of this paper to investigate the use of machine learning techniques for malicious web domain identification by considering the class imbalance issue (i.e. there are more benign web domains than malicious ones). Design/methodology/approach The authors propose an integrated resampling approach to handle class imbalance by combining the synthetic minority oversampling technique (SMOTE) and particle swarm optimisation (PSO), a population-based meta-heuristic algorithm. The authors use the SMOTE for oversampling and PSO for undersampling. Findings By applying eight well-known machine learning classifiers, the proposed integrated resampling approach is comprehensively examined using several imbalanced web domain data sets with different imbalance ratios. Compared to five other well-known resampling approaches, experimental results confirm that the proposed approach is highly effective. Practical implications This study not only inspires the practical use of online credibility and performance data for identifying malicious web domains but also provides an effective resampling approach for handling the class imbalance issue in the area of malicious web domain identification. Originality/value Online credibility and performance data are applied to build malicious web domain identification models using machine learning techniques. An integrated resampling approach is proposed to address the class imbalance issue. The performance of the proposed approach is confirmed based on real-world data sets with different imbalance ratios.

Download Full-text

Enhancing alpine glacial lakes detection and mapping using multi-source data and machine learning techniques

10.5194/egusphere-egu2020-21811 ◽

2020 ◽

Author(s):

Sonam Wangchuk ◽

Tobias Bolch

Keyword(s):

Machine Learning ◽

Random Forest ◽

Satellite Images ◽

Random Forest Classifier ◽

Machine Learning Techniques ◽

Glacial Lake ◽

Glacial Lakes ◽

Alpine Regions ◽

Learning Techniques ◽

Source Data

<p>An accurate detection and mapping of glacial lakes in the Alpine regions such as the Himalayas, the Alps and the Andes are challenged by many factors. These factors include 1) a small size of glacial lakes, 2) cloud cover in optical satellite images, 3) cast shadows from mountains and clouds, 4) seasonal snow in satellite images, 5) varying degree of turbidity amongst glacial lakes, and 6) frozen glacial lake surface. In our study, we propose a fully automated approach, that overcomes most of the above mentioned challenges, to detect and map glacial lakes accurately using multi-source data and machine learning techniques such as the random forest classifier algorithm. The multi-source data are from the Sentinel-1 Synthetic Aperture Radar data (radar backscatter), the Sentinel-2 multispectral instrument data (NDWI), and the SRTM digital elevation model (slope). We use these data as inputs for the rule-based segmentation of potential glacial lakes, where decision rules are implemented from the expert system. The potential glacial lake polygons are then classified either as glacial lakes or non-glacial lakes by the trained and tested random forest classifier algorithm. The performance of the method was assessed in eight test sites located across the Alpine regions (e.g. the Boshula mountain range and Koshi basin in the Himalayas, the Tajiks Pamirs, the Swiss Alps and the Peruvian Andes) of the word. We show that the proposed method performs efficiently irrespective of geographic, geologic, climatic, and glacial lake conditions.</p>

Download Full-text

Analysis of Crime Report by Data Analytics Using Python

Advances in Data Mining and Database Management - Challenges and Applications of Data Analytics in Social Perspectives ◽

10.4018/978-1-7998-2566-1.ch003 ◽

2021 ◽

pp. 54-79

Author(s):

G. Maria Jones ◽

S. Godfrey Winster

Keyword(s):

Machine Learning ◽

New Technologies ◽

Rapid Development ◽

Performance Measure ◽

Machine Learning Techniques ◽

Complex Data ◽

Time Data ◽

Related Data ◽

Learning Techniques ◽

And Performance

The ever-rapid development of technology in today's world tends to provide us with a dramatic explosion of data, leading to its accumulation and thus data computation has amplified in comparison to the recent past. To manage such complex data, emerging new technologies are enabled specially to identify crime patterns, as crime-related data is escalating. These digital technologies have the potential to manipulate and also alter the pattern. To combat this, machine learning techniques are introduced which have the ability to analyse such voluminous data. In this work, the authors intend to understand and implement machine learning techniques in real time data analysis by means of Python. The detailed explanation in preparing the dataset, understanding, visualizing the data using pandas, and performance measure of algorithm is evaluated.

Download Full-text

Toward understanding the topical structure of hospitality literature

International Journal of Contemporary Hospitality Management ◽

10.1108/ijchm-11-2017-0714 ◽

2018 ◽

Vol 30 (11) ◽

pp. 3386-3411 ◽

Cited By ~ 7

Author(s):

Eunhye (Olivia) Park ◽

Bongsug Chae ◽

Junehee Kwon

Keyword(s):

Machine Learning ◽

Topic Modeling ◽

Research Field ◽

Intellectual Structure ◽

Machine Learning Techniques ◽

Research Topics ◽

Evolutionary Patterns ◽

Content Type ◽

Learning Techniques ◽

Academic Publications

Purpose This paper aims to identify the intellectual structure of four leading hospitality journals over 40 years by applying mixed-method approach, using both machine learning and traditional statistical analyses. Design/methodology/approach Abstracts from all 4,139 articles published in four top hospitality journals were analyzed using the structured topic modeling and inferential statistics. Topic correlation and community detection were applied to identify strengths of correlations and sub-groups of topics. Trend visualization and regression analysis were used to quantify the effects of the metadata (i.e. year of publication and journal) on topic proportions. Findings The authors found 50 topics and eight subgroups in the hospitality journals. Different evolutionary patterns in topic popularity were demonstrated, thereby providing the insights for popular research topics over time. The significant differences in topical proportions were found across the four leading hospitality journals, suggesting different foci in research topics in each journal. Research limitations/implications Combining machine learning techniques with traditional statistics demonstrated potential for discovering valuable insights from big text data in hospitality and tourism research contexts. The findings of this study may serve as a guide to understand the trends in the research field as well as the progress of specific areas or subfields. Originality/value It is the first attempt to apply topic modeling to academic publications and explore the effects of article metadata with the hospitality literature.

Download Full-text

Water quality monitoring: from conventional to emerging technologies

Water Science & Technology Water Supply ◽

10.2166/ws.2019.144 ◽

2019 ◽

Vol 20 (1) ◽

pp. 28-45

Author(s):

Umair Ahmed ◽

Rafia Mumtaz ◽

Hirra Anwar ◽

Sadaf Mumtaz ◽

Ali Mustafa Qamar

Keyword(s):

Machine Learning ◽

Water Quality ◽

Industrial Development ◽

Low Cost ◽

Research Work ◽

Water Quality Monitoring ◽

Machine Learning Techniques ◽

Rapid Urbanization ◽

Analysis Process ◽

Learning Techniques

Abstract The rapid urbanization and industrial development have resulted in water contamination and water quality deterioration at an alarming rate, deeming its quick, inexpensive and accurate detection imperative. Conventional methods to measure water quality are lengthy, expensive and inefficient, including the manual analysis process carried out in a laboratory. The research work in this paper focuses on the problem from various perspectives, including the traditional methods of determining water quality to gain insight into the problem and the analysis of state-of-the-art technologies, including Internet of Things (IoT) and machine learning techniques to address water quality. After analyzing the currently available solutions, this paper proposes an IoT-based low-cost system employing machine learning techniques to monitor water quality in real time, analyze water quality trends and detect anomalous events such as intentional contamination of water.

Download Full-text

Oil spill classification from multi-spectral satellite images: exploring different machine learning techniques

10.1117/12.864556 ◽

2010 ◽

Cited By ~ 3

Author(s):

Linda Corucci ◽

Fabio Nardelli ◽

Marco Cococcioni

Keyword(s):

Machine Learning ◽

Oil Spill ◽

Satellite Images ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

Early Detection of Cardiovascular Disease using Machine learning Techniques an Experimental Study

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c46570.99320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 635-641

Keyword(s):

Machine Learning ◽

Proper Time ◽

Daily Basis ◽

Machine Learning Algorithms ◽

The Other ◽

Machine Learning Techniques ◽

Entire Body ◽

Time Data ◽

Learning Techniques ◽

Health Related

Human body prioritizes the heart as the second most important organ after the brain. Any disruption in the heart ultimately leads to disruption of the entire body. Being the members of modern era, enormous changes are happening to us on a daily basis that impact our lives in one way or the other. A major disease among top five fatal diseases includes the heart disease which has been consuming lives worldwide. Therefore, the prediction of this disease is of prime importance as it will enable one to take a proper and needful approach at a proper time. Data mining and machine learning are taking out and refining of useful information from a massive amount of data. It is a basic and primary process in defining and discovering useful information and hidden patterns from databases. The flexibility and adaptability of optimization algorithms find its use in dealing with complex non -linear problems. Machine Learning techniques find its use in medical sciences in solving real health-related issues by early prediction and treatment of various diseases. In this paper, six machine learning algorithms are used and then compared accordingly based on the evaluation of performance. Among all classifiers, decision tree outperforms over the other algorithms with a testing accuracy of 97.29%.

Download Full-text

An intrusion detection system for health-care system using machine and deep learning

World Journal of Engineering ◽

10.1108/wje-04-2021-0204 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Sagar Pande ◽

Aditya Khamparia ◽

Deepak Gupta

Keyword(s):

Machine Learning ◽

Health Care ◽

Deep Learning ◽

Network Security ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Machine Learning Techniques ◽

Content Type ◽

Learning Techniques

Purpose One of the important key components of health care–based system is a reliable intrusion detection system. Traditional techniques are not adequate to handle complex data. Also, the diversified intrusion techniques cannot meet current network requirements. Not only the data is getting increased but also the attacks are increasing very rapidly. Deep learning and machine learning techniques are very trending in the area of research in the area of network security. A lot of work has been done in this area by still evolutionary algorithms along with machine learning is very rarely explored. The purpose of this study is to provide novel deep learning framework for the detection of attacks. Design/methodology/approach In this paper, novel deep learning is the framework is proposed for the detection of attacks. Also, a comparison of machine learning and deep learning algorithms is provided. Findings The obtained results are more than 99% for both the data sets. Research limitations/implications The diversified intrusion techniques cannot meet current network requirements. Practical implications The data is getting increased but also the attacks are increasing very rapidly. Social implications Deep learning and machine learning techniques are very trending in the area of research in the area of network security. Originality/value Novel deep learning is the framework is proposed for the detection of attacks.

Download Full-text

A segmented machine learning modeling approach of social media for predicting occupancy

International Journal of Contemporary Hospitality Management ◽

10.1108/ijchm-06-2020-0611 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Cited By ~ 1

Author(s):

Apostolos Ampountolas ◽

Mark P. Legg

Keyword(s):

Machine Learning ◽

Social Media ◽

Life Cycles ◽

Machine Learning Techniques ◽

Learning Approach ◽

Negative Effects ◽

Data Set ◽

Content Type ◽

Learning Techniques ◽

The Usa

Purpose This study aims to predict hotel demand through text analysis by investigating keyword series to increase demand predictions’ precision. To do so, this paper presents a framework for modeling hotel demand that incorporates machine learning techniques. Design/methodology/approach The empirical forecasting is conducted by introducing a segmented machine learning approach of leveraging hierarchical clustering tied to machine learning and deep learning techniques. These features allow the model to yield more precise estimates. This study evaluates an extensive range of social media–derived words with the most significant probability of gradually establishing an understanding of an optimal outcome. Analyzes were performed on a major hotel chain in an urban market setting within the USA. Findings The findings indicate that while traditional methods, being the naïve approach and ARIMA models, struggled with forecasting accuracy, segmented boosting methods (XGBoost) leveraging social media predict hotel occupancy with greater precision for all examined time horizons. Additionally, the segmented learning approach improved the forecasts’ stability and robustness while mitigating common overfitting issues within a highly dimensional data set. Research limitations/implications Incorporating social media into a segmented learning framework can augment the current generation of forecasting methods’ accuracy. Moreover, the segmented learning approach mitigates the negative effects of market shifts (e.g. COVID-19) that can reduce in-production forecasts’ life-cycles. The ability to be more robust to market deviations will allow hospitality firms to minimize development time. Originality/value The results are expected to generate insights by providing revenue managers with an instrument for predicting demand.

Download Full-text