SPATIAL PLANNING TEXT INFORMATION PROCESSING WITH USE OF MACHINE LEARNING METHODS

Abstract. Spatial development plans provide an important information on future land development capabilities. Unfortunately, at the moment access to planning information in Poland is limited. Despite many initiatives taken to standardize planning documents, the standard for recording plans has not yet been developed. Each of the planning areas has a symbol and a category of land use, which is different in each of the plans. For this reason, it is very difficult to carry out an analysis enabling aggregation of all areas with a specific, the same development function.The authors in the article conduct experiments aimed at using machine learning methods for the needs of processing the text part of plans and their classification. The main aim was to find the best method for grouping texts of zones with the same land use. The experiment consists in an attempt to automatically classify the texts of findings for individual areas into the 10 defined categories of land use. Thanks to this, it is possible to predict the future land use function for a specific zone text regulation and aggregate all zones with specific land use type.In the proposed solution for the classification problem of heterogeneous planning information authors used k-means algorithm and artificial neural networks. The main challenge for this solution, however, was not the design of the classification tool but rather the preprocessing of the text. In this paper an approach for text preprocessing as well as selected methods of text classification is presented. The results of the work indicate greater use of CNN's usability to solve the problem presented. K-means clustering produces clusters, in which texts are not grouped according to land use function, which is not useful in the context of zones aggregation.

Download Full-text

Identifying Spatiotemporal Patterns in Land Use and Cover Samples from Satellite Image Time Series

Remote Sensing ◽

10.3390/rs13050974 ◽

2021 ◽

Vol 13 (5) ◽

pp. 974

Author(s):

Lorena Alves Santos ◽

Karine Ferreira ◽

Michelle Picoli ◽

Gilberto Camara ◽

Raul Zurita-Milla ◽

...

Keyword(s):

Machine Learning ◽

Land Use ◽

Time Series ◽

Satellite Image ◽

Spatiotemporal Patterns ◽

Supervised Machine Learning ◽

Learning Methods ◽

Self Organizing Maps ◽

Machine Learning Methods ◽

Land Use And Cover

The use of satellite image time series analysis and machine learning methods brings new opportunities and challenges for land use and cover changes (LUCC) mapping over large areas. One of these challenges is the need for samples that properly represent the high variability of land used and cover classes over large areas to train supervised machine learning methods and to produce accurate LUCC maps. This paper addresses this challenge and presents a method to identify spatiotemporal patterns in land use and cover samples to infer subclasses through the phenological and spectral information provided by satellite image time series. The proposed method uses self-organizing maps (SOMs) to reduce the data dimensionality creating primary clusters. From these primary clusters, it uses hierarchical clustering to create subclusters that recognize intra-class variability intrinsic to different regions and periods, mainly in large areas and multiple years. To show how the method works, we use MODIS image time series associated to samples of cropland and pasture classes over the Cerrado biome in Brazil. The results prove that the proposed method is suitable for identifying spatiotemporal patterns in land use and cover samples that can be used to infer subclasses, mainly for crop-types.

Download Full-text

A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

Business Systems Research Journal ◽

10.2478/bsrj-2014-0021 ◽

2014 ◽

Vol 5 (3) ◽

pp. 82-96 ◽

Cited By ~ 3

Author(s):

Marijana Zekić-Sušac ◽

Sanja Pfeifer ◽

Nataša Šarlija

Keyword(s):

Neural Network ◽

Machine Learning ◽

Classification Accuracy ◽

Classification Problem ◽

High Dimensional ◽

Nearest Neighbour ◽

Learning Methods ◽

Machine Learning Methods ◽

Dimensional Classification ◽

Artificial Neural

Abstract Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART classification trees, support vector machines, and k-nearest neighbour on the same dataset in order to compare their efficiency in the sense of classification accuracy. The performance of each method was compared on ten subsamples in a 10-fold cross-validation procedure in order to assess computing sensitivity and specificity of each model. Results: The artificial neural network model based on multilayer perceptron yielded a higher classification rate than the models produced by other methods. The pairwise t-test showed a statistical significance between the artificial neural network and the k-nearest neighbour model, while the difference among other methods was not statistically significant. Conclusions: Tested machine learning methods are able to learn fast and achieve high classification accuracy. However, further advancement can be assured by testing a few additional methodological refinements in machine learning methods.

Download Full-text

Using an Ensemble Learning Approach on Traditional Machine Learning Methods to Solve a Multi-Label Classification Problem

Proceedings of the International Conference on Paradigms of Computing, Communication and Data Sciences - Algorithms for Intelligent Systems ◽

10.1007/978-981-15-7533-4_60 ◽

2021 ◽

pp. 761-772

Author(s):

Siddharth Basu ◽

Sanjay Kumar ◽

Sirjanpreet Singh Banga ◽

Harshit Garg

Keyword(s):

Machine Learning ◽

Ensemble Learning ◽

Classification Problem ◽

Learning Approach ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Cellular automata model based on machine learning methods for simulating land use change

Proceedings Title: Proceedings of the 2012 Winter Simulation Conference (WSC) ◽

10.1109/wsc.2012.6465098 ◽

2012 ◽

Cited By ~ 1

Author(s):

Omar Charif ◽

Reine-Maria Basse ◽

Hichem Omrani ◽

Philippe Trigano

Keyword(s):

Machine Learning ◽

Land Use ◽

Cellular Automata ◽

Land Use Change ◽

Learning Methods ◽

Cellular Automata Model ◽

Machine Learning Methods ◽

Model Based

Download Full-text

Development and Application of Earth Observation Based Machine Learning Methods for Characterizing Forest and Land Cover Change in Dilijan National Park of Armenia between 1991 and 2019

Remote Sensing ◽

10.3390/rs13152942 ◽

2021 ◽

Vol 13 (15) ◽

pp. 2942

Author(s):

Nathalie Morin ◽

Antoine Masse ◽

Christophe Sannier ◽

Martin Siklar ◽

Norman Kiesslich ◽

...

Keyword(s):

Machine Learning ◽

Land Use ◽

Land Cover ◽

National Park ◽

Forest Degradation ◽

Forest Monitoring ◽

Anthropogenic Pressure ◽

Learning Methods ◽

Forest Density ◽

Machine Learning Methods

Dilijan National Park is one of the most important national parks of Armenia, established in 2002 to protect its rich biodiversity of flora and fauna and to prevent illegal logging. The aim of this study is to provide first, a mapping of forest degradation and deforestation, and second, of land cover/land use changes every 5 years over a 28-year monitoring cycle from 1991 to 2019, using Sentinel-2 and Landsat time series and Machine Learning methods. Very High Spatial Resolution imagery was used for calibration and validation purposes of forest density modelling and related changes. Correlation coefficient R2 between forest density map and reference values ranges from 0.70 for the earliest epoch to 0.90 for the latest one. Land cover/land use classification yield good results with most classes showing high users’ and producers’ accuracies above 80%. Although forest degradation and deforestation which initiated about 30 years ago was restrained thanks to protection measures, anthropogenic pressure remains a threat with the increase in settlements, tourism, or agriculture. This case study can be used as a decision-support tool for the Armenian Government for sustainable forest management and policies and serve as a model for a future nationwide forest monitoring system.

Download Full-text

Application of machine learning methods for automated classification and routing in ITIL

Journal of Physics Conference Series ◽

10.1088/1742-6596/2091/1/012041 ◽

2021 ◽

Vol 2091 (1) ◽

pp. 012041

Author(s):

VV Nikulin ◽

S D Shibaikin ◽

A N Vishnyakov

Keyword(s):

Machine Learning ◽

Human Factor ◽

Gradient Boosting ◽

Automated Classification ◽

It Services ◽

Learning Methods ◽

Machine Learning Methods ◽

Text Information ◽

Comparison Of The Results

Abstract The article analyzes the application of machine learning methods for automated classification and routing in ITIL library. ITSM technology and ITIL are considered. The definitions of the incident and IT services are given. Then, the vectorization and extraction of keywords in the information written in natural language is carried out and lemmatization and TF-IDF measure will be used. A comparative analysis of the application of machine learning methods is given as well as a comparison of the results of automatic classification of text information using gradient boosting and a convolutional neural network. Various parameters of these methods are considered and the most effective method of machine learning is determined. The results of using machine learning methods for automated classification of incidents allows high-precision routing of requests for restoring the operability of IT services, reducing response time and errors associated with the human factor.

Download Full-text

APPLICATION OF MACHINE LEARNING METHODS TO SOLVE THE NLP TEXT CLASSIFICATION PROBLEM BASED ON ANALYSIS OF SEMANTICS OF NATURAL LANGUAGE

Вестник Алтайской академии экономики и права ◽

10.17513/vaael.1187 ◽

2020 ◽

Vol 2 (№6 2020) ◽

pp. 229-235

Author(s):

D.V. Zhel

Keyword(s):

Machine Learning ◽

Natural Language ◽

Text Classification ◽

Classification Problem ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

A practical framework for predicting residential indoor PM2.5 concentration using land-use regression and machine learning methods

Chemosphere ◽

10.1016/j.chemosphere.2020.129140 ◽

2021 ◽

Vol 265 ◽

pp. 129140

Author(s):

Zhiyuan Li ◽

Xinning Tong ◽

Jason Man Wai Ho ◽

Timothy C.Y. Kwok ◽

Guanghui Dong ◽

...

Keyword(s):

Machine Learning ◽

Land Use ◽

Land Use Regression ◽

Learning Methods ◽

Machine Learning Methods ◽

Indoor Pm2.5 ◽

Pm2.5 Concentration

Download Full-text

System Analysis of Financial Monitoring Subjects' Activities for the Country's Economic Security Ensuring

KnE Social Sciences ◽

10.18502/kss.v3i2.1575 ◽

2018 ◽

Vol 3 (2) ◽

pp. 444

Author(s):

Prikazchikova A.S. ◽

Prikazchikova G.S.

Keyword(s):

Machine Learning ◽

System Analysis ◽

Binary Classification ◽

Nearest Neighbors ◽

Classification Problem ◽

Economic Security ◽

Learning Methods ◽

K Nearest Neighbors ◽

Machine Learning Methods ◽

Credit Institutions

The article considers the binary classification problem of economic security objects on the credit institutions example, for which it is proposed to use machine learning methods. In the study process the expediency of one of the methods of machine learning — the method of k-nearest neighbors — was proved to solve this problem, its efficiency amounted to 84 %. Key words: machine learning methods, financial statements, performance indicators, credit institutions, binary classification, k-nearest neighbors method.

Download Full-text

Machine learning methods for toxic comment classification: a systematic review

Acta Universitatis Sapientiae Informatica ◽

10.2478/ausi-2020-0012 ◽

2020 ◽

Vol 12 (2) ◽

pp. 205-216

Author(s):

Darko Andročec

Keyword(s):

Machine Learning ◽

Systematic Review ◽

Classification Problem ◽

Future Research ◽

Primary Study ◽

Learning Methods ◽

Data Set ◽

Machine Learning Methods ◽

Research Themes ◽

Evaluation Metric

Abstract Nowadays users leave numerous comments on different social networks, news portals, and forums. Some of the comments are toxic or abusive. Due to numbers of comments, it is unfeasible to manually moderate them, so most of the systems use some kind of automatic discovery of toxicity using machine learning models. In this work, we performed a systematic review of the state-of-the-art in toxic comment classification using machine learning methods. We extracted data from 31 selected primary relevant studies. First, we have investigated when and where the papers were published and their maturity level. In our analysis of every primary study we investigated: data set used, evaluation metric, used machine learning methods, classes of toxicity, and comment language. We finish our work with comprehensive list of gaps in current research and suggestions for future research themes related to online toxic comment classification problem.

Download Full-text