Conclusion

Author(s):  
Pradeep Kumar

This chapter summarize and concludes the issues and challenges elaborated in different chapters using machine learning approaches presented by various authors. It identifies the importance of supervised and unsupervised learning algorithms establishing classification, prediction, clustering, security policies along with object recognition and pattern matching structures. A systematic position for future research and practice is also described in detail. This book presents the capabilities of machine learning methods and ideas on how these methods could be used to solve real-world problems related to health, social and engineering applications.

2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Fathima Aliyar Vellameeran ◽  
Thomas Brindha

Abstract Objectives To make a clear literature review on state-of-the-art heart disease prediction models. Methods It reviews 61 research papers and states the significant analysis. Initially, the analysis addresses the contributions of each literature works and observes the simulation environment. Here, different types of machine learning algorithms deployed in each contribution. In addition, the utilized dataset for existing heart disease prediction models was observed. Results The performance measures computed in entire papers like prediction accuracy, prediction error, specificity, sensitivity, f-measure, etc., are learned. Further, the best performance is also checked to confirm the effectiveness of entire contributions. Conclusions The comprehensive research challenges and the gap are portrayed based on the development of intelligent methods concerning the unresolved challenges in heart disease prediction using data mining techniques.


2021 ◽  
Author(s):  
Andreas Christ Sølvsten Jørgensen ◽  
Atiyo Ghosh ◽  
Marc Sturrock ◽  
Vahid Shahrezaei

AbstractThe modelling of many real-world problems relies on computationally heavy simulations. Since statistical inference rests on repeated simulations to sample the parameter space, the high computational expense of these simulations can become a stumbling block. In this paper, we compare two ways to mitigate this issue based on machine learning methods. One approach is to construct lightweight surrogate models to substitute the simulations used in inference. Alternatively, one might altogether circumnavigate the need for Bayesian sampling schemes and directly estimate the posterior distribution. We focus on stochastic simulations that track autonomous agents and present two case studies of real-world applications: tumour growths and the spread of infectious diseases. We demonstrate that good accuracy in inference can be achieved with a relatively small number of simulations, making our machine learning approaches orders of magnitude faster than classical simulation-based methods that rely on sampling the parameter space. However, we find that while some methods generally produce more robust results than others, no algorithm offers a one-size-fits-all solution when attempting to infer model parameters from observations. Instead, one must choose the inference technique with the specific real-world application in mind. The stochastic nature of the considered real-world phenomena poses an additional challenge that can become insurmountable for some approaches. Overall, we find machine learning approaches that create direct inference machines to be promising for real-world applications. We present our findings as general guidelines for modelling practitioners.Author summaryComputer simulations play a vital role in modern science as they are commonly used to compare theory with observations. One can thus infer the properties of a observed system by comparing the data to the predicted behaviour in different scenarios. Each of these scenarios corresponds to a simulation with slightly different settings. However, since real-world problems are highly complex, the simulations often require extensive computational resources, making direct comparisons with data challenging, if not insurmountable. It is, therefore, necessary to resort to inference methods that mitigate this issue, but it is not clear-cut what path to choose for any specific research problem. In this paper, we provide general guidelines for how to make this choice. We do so by studying examples from oncology and epidemiology and by taking advantage of developments in machine learning. More specifically, we focus on simulations that track the behaviour of autonomous agents, such as single cells or individuals. We show that the best way forward is problem-dependent and highlight the methods that yield the most robust results across the different case studies. We demonstrate that these methods are highly promising and produce reliable results in a small fraction of the time required by classic approaches that rely on comparisons between data and individual simulations. Rather than relying on a single inference technique, we recommend employing several methods and selecting the most reliable based on predetermined criteria.


CrystEngComm ◽  
2019 ◽  
Vol 21 (8) ◽  
pp. 1215-1223 ◽  
Author(s):  
Ayana Ghosh ◽  
Lydie Louis ◽  
Kapildev K. Arora ◽  
Bruno C. Hancock ◽  
Joseph F. Krzyzaniak ◽  
...  

This work critically evaluates a number of machine learning approaches for predicting the crystallization propensity of active pharmaceutical ingredients using a real-world dataset.


Author(s):  
Zhixiang Chen ◽  
Binhai Zhu ◽  
Xiannong Meng

In this chapter, machine-learning approaches to real-time intelligent Web search are discussed. The goal is to build an intelligent Web search system that can find the user’s desired information with as little relevance feedback from the user as possible. The system can achieve a significant search precision increase with a small number of iterations of user relevance feedback. A new machine-learning algorithm is designed as the core of the intelligent search component. This algorithm is applied to three different search engines with different emphases. This chapter presents the algorithm, the architectures, and the performances of these search engines. Future research issues regarding real-time intelligent Web search are also discussed.


2022 ◽  
pp. 183-210
Author(s):  
Raquel Amaro ◽  
Susana Correia ◽  
Matilde Gonçalves ◽  
Chiara Barbero ◽  
Miguel Magalhães

This chapter presents research on the teaching-learning of Portuguese as a host language, based on the exploration of authentic informational and institutional texts targeting migrant and refugee people, and considering that successful host language teaching must correspond to the needs of its target audience. The chapter discusses methods of defining and identifying criteria and features to monitor official texts with regard to inclusiveness and bias. It provides insights on how to select real texts to be used in task-based language teaching approaches for inclusive host language teaching. Departing from a real corpus analysis, the potential and the limitations of existing guidelines to inclusiveness for the assessment of real texts are shown, as well as other still neglected issues. Furthermore, this chapter provides future research directions to an effective and reliable assessment of inclusive texts that can serve as inclusive host language teaching materials through NLP and machine learning approaches.


Author(s):  
Neeti Sangwan ◽  
Vishal Bhatnagar

In Big Data analysis, the application of machine learning has proven to be a revolutionary. The systematic review of literature shows that research has been carried out on the domain of big data analytics particularly text analytics with the inclusion of machine learning approaches. This extensive survey deals with the data at hand that provides different ways and issues while combining the machine learning approaches with the text. During the course of the survey, various publications in the field of synchronous application of machine learning in text analytics were searched and studied. Classification framework is proposed as the contribution of machine learning in text analytics. A classification framework represented the various application areas to motivate researchers for future research on the application of two emerging technologies.


2018 ◽  
Vol 255 ◽  
pp. 1191-1210 ◽  
Author(s):  
S. De Vito ◽  
E. Esposito ◽  
M. Salvato ◽  
O. Popoola ◽  
F. Formisano ◽  
...  

2019 ◽  
Vol 8 (3) ◽  
pp. 7071-7081

Current generation real-world data sets processed through machine learning are imbalanced by nature. This imbalanced data enables the researchers with a challenging scenario in the context of perdition for both the machine learning and data mining algorithms. It is observed from the past research studies most of the imbalanced data sets consists of the major classes and minor classes and the major class leads the minor class. Several standards and hybrid prediction algorithms are proposed in various application domains but in most of the real-time data sets analyzed in the studies are imbalanced by nature thereby affecting the accuracy of the prediction. This paper presents a systematic survey of the past research studies to analyze intrinsic data characteristics and techniques utilized for handling class-imbalanced data. In addition, this study reveals the research gaps, trends and patterns in existing studies and discusses briefly on future research directions


Sign in / Sign up

Export Citation Format

Share Document