Analysis of Tumor Disease Patterns Based on Medical Big Data

2021 ◽  
Vol 11 (2) ◽  
pp. 478-486
Author(s):  
Jing Zheng ◽  
Zhongjun Gao ◽  
Lixin Pu ◽  
Mingjie He ◽  
Jipeng Fan ◽  
...  

Using the medical big data mining related technology, the model of tumor disease was analyzed and studied. Using data science methods as a guiding method and idea, analyzing and constructing a medical service model based on big data for oncology diseases, exploring its development strategy; using business process analysis method to analyze the business process and mapping of cancer disease medical services; using serviceoriented architecture analysis and Design methodology to build a highly flexible, configurable, and easily scalable precision medical big data platform. By analyzing the characteristics of medical big data and the shortcomings of the traditional Apriori algorithm, the Hadoop platform is used to improve and optimize the Apriori algorithm. The results show that the improved Apriori algorithm has great improvement in efficiency and performance, and can be adapted to mining medical big data. Through data mining experiments, it is concluded that there is a correlation between tumors and smoking, chronic infection, occupational pathogenic factors, etc. It has certain guiding significance for the prevention and treatment of tumors, thus also demonstrating the improved Apriori algorithm for lung tumors. Clinical research has practical significance.

2020 ◽  
Vol 1 (1) ◽  
pp. 23-26
Author(s):  
Siti Zulaikha ◽  
Martaleli Bettiza ◽  
Nola Ritha

Data on the rainfall is compelling to study as it becomes one of the major factors affecting the weather in a certain region and various aspects of life as well. Generally, predicting rainfall is performed by analyzing data in the past in certain methods. Rainfall is prone to follow repeated pattern in sequence of time. The utilization of big data mining is expected to result in any valuable information that used to be unrevealed in the big data store. Some methods used in data mining are Apriori Algorithm and Improved Apriori Algorithm. Improved Apriori itself is to represent the database in the form of matrix to describe its relation in the database. Data used in this research is the rainfall factor in 2016 in Tanjungpinang city. Based on the test of Improved Apriori Algorithm, it was found out that the relation of the rainfall and weather factors utilizing 2 item sets, that is, if the temperature is low (24,0 - 26,0), the humidity is high (85 - 100), then the rainfall is mild. If the temperature is low (24,0 - 26,0), the light intensity is low (0 – 3), then the rainfall is heavy, and 3 item sets if the temperature is low (24,0 - 26,0), the humidity is high (85 - 100), the sun light intensity is low (0-3), then the rainfall is medium.


Author(s):  
Gurdeep S Hura

This chapter presents this new emerging technology of social media and networking with a detailed discussion on: basic definitions and applications, how this technology evolved in the last few years, the need for dynamicity under data mining environment. It also provides a comprehensive design and analysis of popular social networking media and sites available for the users. A brief discussion on the data mining methodologies for implementing the variety of new applications dealing with huge/big data in data science is presented. Further, an attempt is being made in this chapter to present a new emerging perspective of data mining methodologies with its dynamicity for social networking media and sites as a new trend and needed framework for dealing with huge amount of data for its collection, analysis and interpretation for a number of real world applications. A discussion will also be provided for the current and future status of data mining of social media and networking applications.


2019 ◽  
Vol 25 (2) ◽  
pp. 308-321 ◽  
Author(s):  
Arfan Majeed ◽  
Jingxiang Lv ◽  
Tao Peng

Purpose This paper aims to present an overall framework of big data-based analytics to optimize the production performance of additive manufacturing (AM) process. Design/methodology/approach Four components, namely, big data application, big data sensing and acquisition, big data processing and storage, model establishing, data mining and process optimization were presented to comprise the framework. Key technologies including the big data acquisition and integration, big data mining and knowledge sharing mechanism were developed for the big data analytics for AM. Findings The presented framework was demonstrated by an application scenario from a company of three-dimensional printing solutions. The results show that the proposed framework benefited customers, manufacturers, environment and even all aspects of manufacturing phase. Research limitations/implications This study only proposed a framework, and did not include the realization of the algorithm for data analysis, such as association, classification and clustering. Practical implications The proposed framework can be used to optimize the quality, energy consumption and production efficiency of the AM process. Originality/value This paper introduces the concept of big data in the field of AM. The proposed framework can be used to make better decisions based on the big data during manufacturing process.


2014 ◽  
Vol 568-570 ◽  
pp. 798-801
Author(s):  
Ye Qing Xiong ◽  
Shu Dong Zhang

It occurs time and space performance bottlenecks when traditional association rules algorithms are used to big data mining. This paper proposes a parallel algorithm based on matrix under cloud computing to improve Apriori algorithm. The algorithm uses binary matrix to store transaction data, uses matrix "and" operation to replace the connection between itemsets and combines cloud computing technology to implement the parallel mining for frequent itemsets. Under different conditions, the simulation shows it improves the efficiency, solves the performance bottleneck problem and can be widely used in big data mining with strong scalability and stability.


2016 ◽  
Vol 21 (3) ◽  
pp. 525-547 ◽  
Author(s):  
Scott Tonidandel ◽  
Eden B. King ◽  
Jose M. Cortina

Advances in data science, such as data mining, data visualization, and machine learning, are extremely well-suited to address numerous questions in the organizational sciences given the explosion of available data. Despite these opportunities, few scholars in our field have discussed the specific ways in which the lens of our science should be brought to bear on the topic of big data and big data's reciprocal impact on our science. The purpose of this paper is to provide an overview of the big data phenomenon and its potential for impacting organizational science in both positive and negative ways. We identifying the biggest opportunities afforded by big data along with the biggest obstacles, and we discuss specifically how we think our methods will be most impacted by the data analytics movement. We also provide a list of resources to help interested readers incorporate big data methods into their existing research. Our hope is that we stimulate interest in big data, motivate future research using big data sources, and encourage the application of associated data science techniques more broadly in the organizational sciences.


2008 ◽  
Vol 23 (3) ◽  
pp. 185-202 ◽  
Author(s):  
Hannu Salmela

Increased reliance on computers by many companies and public organisations has generated a business risk for them. For example, problems in systems availability can lead to additional cost in operations, lost revenues and competitive losses. Systematic analyses are needed to avoid such losses. Prior research provides rigorous methods for identifying information systems risks and reducing the likelihood of these risks. The analysis of potential business losses in the user organisations has, however, received only limited attention. The objective in this paper is to investigate methods that can be used in the systematic identification of potential losses in the user organisation. The paper provides a review of prior literature on methods that can be used to analyse and report such losses. One understudied method is business process analysis that entails a systematic analysis of potential losses in different phases of the organisation's core business processes. Action research was used to examine the use of business process analysis as a method to associate information systems availability risk with potential business losses. The analysis was carried out in two different companies, one operating in the paper industry and the other one in the finance sector. The study contributes to research by summarising prior research on the analysis of business losses and by illustrating how the use of one method, business process analysis, assisted the client organisations. Perhaps most importantly, it identifies a new topic that has received very little attention in information systems research, despite its obvious managerial and practical significance.


2019 ◽  
Vol 25 (7) ◽  
pp. 1783-1801 ◽  
Author(s):  
Shu-hsien Liao ◽  
Yi-Shan Tasi

Purpose In the retailing industry, database is the time and place where a retail transaction is completed. E-business processes are increasingly adopting databases that can obtain in-depth customers and sales knowledge with the big data analysis. The specific big data analysis on a database system allows a retailer designing and implementing business process management (BPM) to maximize profits, minimize costs and satisfy customers on a business model. Thus, the research of big data analysis on the BPM in the retailing is a critical issue. The paper aims to discuss this issue. Design/methodology/approach This paper develops a database, ER model, and uses cluster analysis, C&R tree and the a priori algorithm as approaches to illustrate big data analysis/data mining results for generating business intelligence and process management, which then obtain customer knowledge from the case firm’s database system. Findings Big data analysis/data mining results such as customer profiles, product/brand display classifications and product/brand sales associations can be used to propose alternatives to the case firm for store layout and bundling sales business process and management development. Originality/value This research paper is an example to develop the BPM of database model and big data/data mining based on insights from big data analysis applications for store layout and bundling sales in the retailing industry.


2020 ◽  
Author(s):  
Huanhuan Wang ◽  
Xiang Wu ◽  
Yongqi Tan ◽  
Hongsheng Yin ◽  
Xiaochun Cheng ◽  
...  

BACKGROUND Medical data mining and sharing is an important process to realize the value of medical big data in E-Health applications. However, medical data contains a large amount of personal private information of patients, there is a risk of privacy disclosure when sharing and mining. Therefore, how to ensure the security of medical big data in the process of publishing, sharing and mining has become the focus of current researches. OBJECTIVE The objective of our study is to design a framework based on differential privacy protection mechanism to ensure the security sharing of medical data. We developed a privacy Protection Query Language (PQL) that can integrate multiple machine mining methods and provide secure sharing functions for medical data. METHODS This paper adopts a modular design method with three sub-modules, including parsing module, mining module and noising module. Each module encapsulates different computing devices, such as composite parser, noise jammer, etc. In the PQL framework, we apply the differential privacy mechanism to the results of the module collaborative calculation to optimize the security of various mining algorithms. These computing devices operate independently, but the mining results depend on their cooperation. RESULTS Designed and developed a query language framework that provides medical data mining, sharing and privacy preserving functions. We theoretically proved the performance of the PQL framework. The experimental results showed that the PQL framework can ensure the security of each mining result, and the average usefulness of the output results is above 97%. CONCLUSIONS We presented a security framework that enables medical data providers to securely share the health data or treatment data, and developed a usable query language based on differential privacy mechanism that enables researchers to mine potential information securely using data mining algorithms. CLINICALTRIAL


Sign in / Sign up

Export Citation Format

Share Document