Machine Learning

Author(s):  
João Gama ◽  
André C.P.L.F. de Carvalho

Machine learning techniques have been successfully applied to several real world problems in areas as diverse as image analysis, Semantic Web, bioinformatics, text processing, natural language processing,telecommunications, finance, medical diagnosis, and so forth. A particular application where machine learning plays a key role is data mining, where machine learning techniques have been extensively used for the extraction of association, clustering, prediction, diagnosis, and regression models. This text presents our personal view of the main aspects, major tasks, frequently used algorithms, current research, and future directions of machine learning research. For such, it is organized as follows: Background information concerning machine learning is presented in the second section. The third section discusses different definitions for Machine Learning. Common tasks faced by Machine Learning Systems are described in the fourth section. Popular Machine Learning algorithms and the importance of the loss function are commented on in the fifth section. The sixth and seventh sections present the current trends and future research directions, respectively.

2012 ◽  
pp. 13-22 ◽  
Author(s):  
João Gama ◽  
André C.P.L.F. de Carvalho

Machine learning techniques have been successfully applied to several real world problems in areas as diverse as image analysis, Semantic Web, bioinformatics, text processing, natural language processing,telecommunications, finance, medical diagnosis, and so forth. A particular application where machine learning plays a key role is data mining, where machine learning techniques have been extensively used for the extraction of association, clustering, prediction, diagnosis, and regression models. This text presents our personal view of the main aspects, major tasks, frequently used algorithms, current research, and future directions of machine learning research. For such, it is organized as follows: Background information concerning machine learning is presented in the second section. The third section discusses different definitions for Machine Learning. Common tasks faced by Machine Learning Systems are described in the fourth section. Popular Machine Learning algorithms and the importance of the loss function are commented on in the fifth section. The sixth and seventh sections present the current trends and future research directions, respectively.


2020 ◽  
Vol 7 (10) ◽  
pp. 380-389
Author(s):  
Asogwa D.C ◽  
Anigbogu S.O ◽  
Anigbogu G.N ◽  
Efozia F.N

Author's age prediction is the task of determining the author's age by studying the texts written by them. The prediction of author’s age can be enlightening about the different trends, opinions social and political views of an age group. Marketers always use this to encourage a product or a service to an age group following their conveyed interests and opinions. Methodologies in natural language processing have made it possible to predict author’s age from text by examining the variation of linguistic characteristics. Also, many machine learning algorithms have been used in author’s age prediction. However, in social networks, computational linguists are challenged with numerous issues just as machine learning techniques are performance driven with its own challenges in realistic scenarios. This work developed a model that can predict author's age from text with a machine learning algorithm (Naïve Bayes) using three types of features namely, content based, style based and topic based. The trained model gave a prediction accuracy of 80%.


Author(s):  
Rashida Ali ◽  
Ibrahim Rampurawala ◽  
Mayuri Wandhe ◽  
Ruchika Shrikhande ◽  
Arpita Bhatkar

Internet provides a medium to connect with individuals of similar or different interests creating a hub. Since a huge hub participates on these platforms, the user can receive a high volume of messages from different individuals creating a chaos and unwanted messages. These messages sometimes contain a true information and sometimes false, which leads to a state of confusion in the minds of the users and leads to first step towards spam messaging. Spam messages means an irrelevant and unsolicited message sent by a known/unknown user which may lead to a sense of insecurity among users. In this paper, the different machine learning algorithms were trained and tested with natural language processing (NLP) to classify whether the messages are spam or ham.


Author(s):  
Mercedes Barrachina ◽  
Laura Valenzuela López

Sleep disorders are related to many different diseases, and they could have a significant impact in patients' health, causing an economic impact to the society and to the national health systems. In the United States, according to information from the Center for Disease Control and Prevention, those disorders are affecting 50-70 million in the adult population. Sleep disorders are causing annually around 40,000 deaths due to cardiovascular problems, and they cost the health system more than 16 billion. In other countries, such as in Spain, those disorders affect up to 48% of the adult population. The main objective of this chapter is to review and evaluate the different machine learning techniques utilized by researchers and medical professionals to identify, assess, and characterize sleep disorders. Moreover, some future research directions are proposed considering the evaluated area.


2018 ◽  
Vol 2 (3) ◽  
pp. 228-267 ◽  
Author(s):  
Zaidi ◽  
Chandola ◽  
Allen ◽  
Sanyal ◽  
Stewart ◽  
...  

Modeling the interactions of water and energy systems is important to the enforcement of infrastructure security and system sustainability. To this end, recent technological advancement has allowed the production of large volumes of data associated with functioning of these sectors. We are beginning to see that statistical and machine learning techniques can help elucidate characteristic patterns across these systems from water availability, transport, and use to energy generation, fuel supply, and customer demand, and in the interdependencies among these systems that can leave these systems vulnerable to cascading impacts from single disruptions. In this paper, we discuss ways in which data and machine learning can be applied to the challenges facing the energy-water nexus along with the potential issues associated with the machine learning techniques themselves. We then survey machine learning techniques that have found application to date in energy-water nexus problems. We conclude by outlining future research directions and opportunities for collaboration among the energy-water nexus and machine learning communities that can lead to mutual synergistic advantage.


2020 ◽  
Author(s):  
Rory Bunker ◽  
Teo Sunsjak

Over the past two decades, Machine Learning (ML) techniques have been increasingly utilized for the purpose of predicting outcomes in sport. In this paper, we provide a review of studies that have used ML for predicting results in team sport, covering studies from 1996 to 2019. We sought to answer five key research questions while extensively surveying papers in this field. This paper offers insights into which ML algorithms have tended to be used in this field, as well as those that are beginning to emerge with successful outcomes. Our research highlights defining characteristics of successful studies and identifies robust strategies for evaluating accuracy results in this application domain. Our study considers accuracies that have been achieved across different sports and explores the notion that outcomes of some team sports could be inherently more difficult to predict than others. Finally, our study uncovers common themes of future research directions across all surveyed papers, looking for gaps and opportunities, while proposing recommendations for future researchers in this domain.


2020 ◽  
Vol 17 (8) ◽  
pp. 3776-3781
Author(s):  
M. Adimoolam ◽  
Raghav Sharma ◽  
A. John ◽  
M. Suresh Kumar ◽  
K. Ashok Kumar

In the past few decades human beings have knowledgeable tremendous intensification in the interaction in particular micro blogging websites and various social media as online resources. Many kinds of data have been used and classification data to group and store are challenging in this real world scenario. Various machine and Natural Language Processing (NLP) were being applied to analysis the sentiment. A major concentration of this work was on using several machine learning algorithms to perform sentimental analysis and comparing various machine learning models for the sentiment classification. This work analysed various sentimental using multiple classifications. From the evaluation of this experiment, it can be concluded that NLP and machine learning Techniques are efficient for sentimental analysis.


2021 ◽  
Vol 5 (2 (113)) ◽  
pp. 55-65
Author(s):  
Aigerim Yerimbetova ◽  
Madina Tussupova ◽  
Madina Sambetbayeva ◽  
Mussa Turdalyuly ◽  
Bakzhan Sakenov

This research is aimed at identifying the parts of speech for the Kazakh and Turkish languages in an information retrieval system. The proposed algorithms are based on machine learning techniques. In this paper, we consider the binary classification of words according to parts of speech. We decided to take the most popular machine learning algorithms. In this paper, the following approaches and well-known machine learning algorithms are studied and considered. We defined 7 dictionaries and tagged 135 million words in Kazakh and 9 dictionaries and 50 million words in the Turkish language. The main problem considered in the paper is to create algorithms for the execution of dictionaries of the so-called Link Grammar Parser (LGP) system, in particular for the Kazakh and Turkish languages, using machine learning techniques. The focus of the research is on the review and comparison of machine learning algorithms and methods that have accomplished results on various natural language processing tasks such as grammatical categories determination. For the operation of the LGP system, a dictionary is created in which a connector for each word is indicated – the type of connection that can be created using this word. The authors considered methods of filling in LGP dictionaries using machine learning.  The complexities of natural language processing, however, do not exclude the possibility of identifying narrower tasks that can already be solved algorithmically: for example, determining parts of speech or splitting texts into logical groups. However, some features of natural languages significantly reduce the effectiveness of these solutions. Thus, taking into account all word forms for each word in the Kazakh and Turkish languages increases the complexity of text processing by an order of magnitude


2021 ◽  
Vol 54 (5) ◽  
pp. 1-36
Author(s):  
Ishai Rosenberg ◽  
Asaf Shabtai ◽  
Yuval Elovici ◽  
Lior Rokach

In recent years, machine learning algorithms, and more specifically deep learning algorithms, have been widely used in many fields, including cyber security. However, machine learning systems are vulnerable to adversarial attacks, and this limits the application of machine learning, especially in non-stationary, adversarial environments, such as the cyber security domain, where actual adversaries (e.g., malware developers) exist. This article comprehensively summarizes the latest research on adversarial attacks against security solutions based on machine learning techniques and illuminates the risks they pose. First, the adversarial attack methods are characterized based on their stage of occurrence, and the attacker’ s goals and capabilities. Then, we categorize the applications of adversarial attack and defense methods in the cyber security domain. Finally, we highlight some characteristics identified in recent research and discuss the impact of recent advancements in other adversarial learning domains on future research directions in the cyber security domain. To the best of our knowledge, this work is the first to discuss the unique challenges of implementing end-to-end adversarial attacks in the cyber security domain, map them in a unified taxonomy, and use the taxonomy to highlight future research directions.


Author(s):  
Saurabh Gupta ◽  
Vaishali Vaishali ◽  
Raghuvansh Tahlan ◽  
Navya Sanjna Joshi ◽  
Ritvik Agarwal

Stock market prediction is a long-time intriguing topic to researchers from different fields. Stock market data is extremely volatile and hence laborious to model. In particular, innumerable studies have been conducted to predict the movement of stock market using Machine Learning algorithms such as Regression Techniques, Time Series Forecasting, Indices Modelling, Natural Language Processing and more, but there is still room for improvement. Also, Option chain and Options have been the subjects that not many have ventured into, leading us to this subject. Mainly, NIFTY and BANKNIFTY Options account for 70% of total derivatives traded and much more turnover than all stocks combined. This research paper attempts to figure out the utility of Option Chain in predicting the direction of movement in NIFTY. We have tried how different features from Option chain can be extracted, and the resulting problem can be solved using Machine Learning techniques and Deep Learning techniques.


Sign in / Sign up

Export Citation Format

Share Document