Autoregressive-based outlier algorithm to detect money laundering activities

2017 ◽  
Vol 20 (2) ◽  
pp. 190-202 ◽  
Author(s):  
Kannan S. ◽  
Somasundaram K.

Purpose Due to the large-size, non-uniform transactions per day, the money laundering detection (MLD) is a time-consuming and difficult process. The major purpose of the proposed auto-regressive (AR) outlier-based MLD (AROMLD) is to reduce the time consumption for handling large-sized non-uniform transactions. Design/methodology/approach The AR-based outlier design produces consistent asymptotic distributed results that enhance the demand-forecasting abilities. Besides, the inter-quartile range (IQR) formulations proposed in this paper support the detailed analysis of time-series data pairs. Findings The prediction of high-dimensionality and the difficulties in the relationship/difference between the data pairs makes the time-series mining as a complex task. The presence of domain invariance in time-series mining initiates the regressive formulation for outlier detection. The deep analysis of time-varying process and the demand of forecasting combine the AR and the IQR formulations for an effective outlier detection. Research limitations/implications The present research focuses on the detection of an outlier in the previous financial transaction, by using the AR model. Prediction of the possibility of an outlier in future transactions remains a major issue. Originality/value The lack of prior segmentation of ML detection suffers from dimensionality. Besides, the absence of boundary to isolate the normal and suspicious transactions induces the limitations. The lack of deep analysis and the time consumption are overwhelmed by using the regression formulation.

2021 ◽  
Vol 54 (3) ◽  
pp. 1-33
Author(s):  
Ane Blázquez-García ◽  
Angel Conde ◽  
Usue Mori ◽  
Jose A. Lozano

Recent advances in technology have brought major breakthroughs in data collection, enabling a large amount of data to be gathered over time and thus generating time series. Mining this data has become an important task for researchers and practitioners in the past few years, including the detection of outliers or anomalies that may represent errors or events of interest. This review aims to provide a structured and comprehensive state-of-the-art on unsupervised outlier detection techniques in the context of time series. To this end, a taxonomy is presented based on the main aspects that characterize an outlier detection technique.


2016 ◽  
Vol 50 (1) ◽  
pp. 41-57 ◽  
Author(s):  
Linghe Huang ◽  
Qinghua Zhu ◽  
Jia Tina Du ◽  
Baozhen Lee

Purpose – Wiki is a new form of information production and organization, which has become one of the most important knowledge resources. In recent years, with the increase of users in wikis, “free rider problem” has been serious. In order to motivate editors to contribute more to a wiki system, it is important to fully understand their contribution behavior. The purpose of this paper is to explore the law of dynamic contribution behavior of editors in wikis. Design/methodology/approach – After developing a dynamic model of contribution behavior, the authors employed both the metrological and clustering methods to process the time series data. The experimental data were collected from Baidu Baike, a renowned Chinese wiki system similar to Wikipedia. Findings – There are four categories of editors: “testers,” “dropouts,” “delayers” and “stickers.” Testers, who contribute the least content and stop contributing rapidly after editing a few articles. After editing a large amount of content, dropouts stop contributing completely. Delayers are the editors who do not stop contributing during the observation time, but they may stop contributing in the near future. Stickers, who keep contributing and edit the most content, are the core editors. In addition, there are significant time-of-day and holiday effects on the number of editors’ contributions. Originality/value – By using the method of time series analysis, some new characteristics of editors and editor types were found. Compared with the former studies, this research also had a larger sample. Therefore, the results are more scientific and representative and can help managers to better optimize the wiki systems and formulate incentive strategies for editors.


2017 ◽  
Vol 1 ◽  
pp. 41-54 ◽  
Author(s):  
Amrit Subedi

Background: There are various approaches of modeling on time series data. Most of the studies conducted regarding time series data are based on annual trend whereas very few concerned with data having monthly fluctuation. The data of tourist arrivals is an example of time series data with monthly fluctuation which reveals that there is higher number of tourist arrivals in some months/seasons whereas others have less number. Starting from January, it makes a complete cycle in every 12 months with 3 bends indicating that it can be captured by biquadratic function.Objective: To provide an alternative approach of modeling i.e. combination of Autoregressive model with polynomial (biquadratic) function on time series data with monthly/seasonal fluctuation and compare its adequacy with widely used cyclic autoregressive model i.e. AR (12).Materials and Methods: This study is based on monthly data of tourist arrivals in Nepal. Firstly, usual time series model AR (12) has been adopted and an alternative approach of modeling has been attempted combining AR and biquadratic function. The first part of the model i.e. AR represents annual trend whereas biquadratic part does for monthly fluctuation.Results: The fitted cyclic autoregressive model on monthly data of tourist arrivals is Est. Ym = 3614.33 + 0.9509Ym-12, (R2=0.80); Est. Ym indicates predicted tourist arrivals for mth month and Ym-12 indicates observed tourist arrivals in (m-12)th month and the combined model of AR and biquadratic function is Est. Yt(m) = -46464.6 + 1.000Yt-1 + 52911.56m - 17177m2 + 2043.95m3 - 79.43m4, (R2=0.78); Est. Yt(m) indicates predicted tourist arrivals for mth month of tth year and Yt-1 indicates average tourist arrivals in (t-1)th year. The AR model combined with polynomial function reveals normal and homoscedastic residuals more accurately compared to first one.Conclusion: The use of polynomial function combined with autoregressive model can be useful for time series data having seasonal fluctuation. It can be an alternative approach for picking up a good model for such type of data. Nepalese Journal of Statistics, 2017,  Vol. 1, 41-54


2018 ◽  
Vol 11 (4) ◽  
pp. 486-495
Author(s):  
Ke Yi Zhou ◽  
Shaolin Hu

Purpose The similarity measurement of time series is an important research in time series detection, which is a basic work of time series clustering, anomaly discovery, prediction and many other data mining problems. The purpose of this paper is to design a new similarity measurement algorithm to improve the performance of the original similarity measurement algorithm. The subsequence morphological information is taken into account by the proposed algorithm, and time series is represented by a pattern, so the similarity measurement algorithm is more accurate. Design/methodology/approach Following some previous researches on similarity measurement, an improved method is presented. This new method combines morphological representation and dynamic time warping (DTW) technique to measure the similarities of time series. After the segmentation of time series data into segments, three parameter values of median, point number and slope are introduced into the improved distance measurement formula. The effectiveness of the morphological weighted DTW algorithm (MW-DTW) is demonstrated by the example of momentum wheel data of an aircraft attitude control system. Findings The improved method is insensitive to the distortion and expansion of time axis and can be used to detect the morphological changes of time series data. Simulation results confirm that this method proposed in this paper has a high accuracy of similarity measurement. Practical implications This improved method has been used to solve the problem of similarity measurement in time series, which is widely emerged in different fields of science and engineering, such as the field of control, measurement, monitoring, process signal processing and economic analysis. Originality/value In the similarity measurement of time series, the distance between sequences is often used as the only detection index. The results of similarity measurement should not be affected by the longitudinal or transverse stretching and translation changes of the sequence, so it is necessary to incorporate the morphological changes of the sequence into similarity measurement. The MW-DTW is more suitable for the actual situation. At the same time, the MW-DTW algorithm reduces the computational complexity by transforming the computational object to subsequences.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Najimu Saka ◽  
Abdullahi Babatunde Saka ◽  
Opeoluwa Akinradewo ◽  
Clinton O. Aigbavboa

Purpose The complex interaction of politics and the economy is a critical factor for the sustainable growth and development of the construction sector (CNS). This study aims to investigate the effects of type of political administration including democracy and military on the performance of CNS using the Nigerian Construction Sector (NCS) as a case study. Design/methodology/approach A 48 year (1970–2017) time series data (TSD) on the NCS and the gross domestic product (GDP) based on 2010 constant USD were extracted from the United Nations Statistical Department database. Analysis of variance (ANOVA) and analysis of covariance (ANCOVA) models were used to analyze the TSD. The ANCOVA model includes the GDP as correlational variable or covariate. Findings The estimates of the ANOVA model indicate that democratic administration is significantly better than military administration in construction performance. However, the ANCOVA model indicates that the GDP is more important than political administration in the performance of the CNS. The study recommends for a new national construction policy, favourable fiscal and monetary policy, local content development policy and construction credit guaranty scheme for the rapid growth and development of the NCS. Originality/value Hitherto, little is known about the influence of political administration on the performance of the CNS. This study provides empirical evidence from a developing economy perspective. It presents the relationships and highlights recommendations for driving growth in the construction industry.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Zulkifli Halim ◽  
Shuhaida Mohamed Shuhidan ◽  
Zuraidah Mohd Sanusi

PurposeIn the previous study of financial distress prediction, deep learning techniques performed better than traditional techniques over time-series data. This study investigates the performance of deep learning models: recurrent neural network, long short-term memory and gated recurrent unit for the financial distress prediction among the Malaysian public listed corporation over the time-series data. This study also compares the performance of logistic regression, support vector machine, neural network, decision tree and the deep learning models on single-year data.Design/methodology/approachThe data used are the financial data of public listed companies that been classified as PN17 status (distress) and non-PN17 (not distress) in Malaysia. This study was conducted using machine learning library of Python programming language.FindingsThe findings indicate that all deep learning models used for this study achieved 90% accuracy and above with long short-term memory (LSTM) and gated recurrent unit (GRU) getting 93% accuracy. In addition, deep learning models consistently have good performance compared to the other models over single-year data. The results show LSTM and GRU getting 90% and recurrent neural network (RNN) 88% accuracy. The results also show that LSTM and GRU get better precision and recall compared to RNN. The findings of this study show that the deep learning approach will lead to better performance in financial distress prediction studies. To be added, time-series data should be highlighted in any financial distress prediction studies since it has a big impact on credit risk assessment.Research limitations/implicationsThe first limitation of this study is the hyperparameter tuning only applied for deep learning models. Secondly, the time-series data are only used for deep learning models since the other models optimally fit on single-year data.Practical implicationsThis study proposes recommendations that deep learning is a new approach that will lead to better performance in financial distress prediction studies. Besides that, time-series data should be highlighted in any financial distress prediction studies since the data have a big impact on the assessment of credit risk.Originality/valueTo the best of authors' knowledge, this article is the first study that uses the gated recurrent unit in financial distress prediction studies based on time-series data for Malaysian public listed companies. The findings of this study can help financial institutions/investors to find a better and accurate approach for credit risk assessment.


2020 ◽  
Vol 49 (2) ◽  
pp. 229-248
Author(s):  
Tamson Pietsch

PurposeThe purpose of this paper is to create comparable time series data on university income in Australia and the UK that might be used as a resource for those seeking to understand the changing funding profile of universities in the two countries and for those seeking to investigate how such data were produced and utilised.Design/methodology/approachA statistical analysis of university income from all sources in the UK and Australia.FindingsThe article produces a new time series for Australia and a comparable time series for the UK. It suggests some of the ways these data related to broader patterns of economic change, sketches the possibility of strategic influence, and outlines some of their limitations.Originality/valueThis is the first study to systematically create a time series on Australian university income across the twentieth century and present it alongside a comparable dataset for the UK.


Sign in / Sign up

Export Citation Format

Share Document