scholarly journals Optimal Subsampling Methods in Bike Sharing Data Analysis

Author(s):  
Xiaofeng Zhao

Abstract Bike sharing system are popular around the world. Traditional bike sharing system require the bikes to be returned to fixed stations, while morden system allows users to leave bikes wherever they like, ready for the next user to pick them up. Smartphone use GPS signal to keep track of its bikes and monitor where most bikes are used and where to place them. Smartphone simultaneously collect many other information such as weather condition, temperature and so on, these features have influence on the delivering amount of bikes. Due to the extensive number of smartphone users, big data technique is requried to handle this situation. We apply subsample method to this smartphone collected big data. In this paper, we derive non-uniform sampling distributions and propose optimal subsampling algorithm. We apply the proposed optimal subsampling algorithm to analyze the smartphone collected bike sharing data set, perfrom extensive computer experiments to evaluate the numerical performance of the proposed sampling algorithm. Our results indicated that the proposed optimal algorithm outperformed the uniform method and have faster running time than using the whole data set.

2021 ◽  
Vol 285 ◽  
pp. 116429
Author(s):  
Wen-Long Shang ◽  
Jinyu Chen ◽  
Huibo Bi ◽  
Yi Sui ◽  
Yanyan Chen ◽  
...  

Author(s):  
Andrei M. Bandalouski ◽  
Natalja G. Egorova ◽  
Mikhail Y. Kovalyov ◽  
Erwin Pesch ◽  
S. Armagan Tarim

AbstractIn this paper we present a novel approach to the dynamic pricing problem for hotel businesses. It includes disaggregation of the demand into several categories, forecasting, elastic demand simulation, and a mathematical programming model with concave quadratic objective function and linear constraints for dynamic price optimization. The approach is computationally efficient and easy to implement. In computer experiments with a hotel data set, the hotel revenue is increased by about 6% on average in comparison with the actual revenue gained in a past period, where the fixed price policy was employed, subject to an assumption that the demand can deviate from the suggested elastic model. The approach and the developed software can be a useful tool for small hotels recovering from the economic consequences of the COVID-19 pandemic.


2021 ◽  
Vol 11 (5) ◽  
pp. 2340
Author(s):  
Sanjay Mathrani ◽  
Xusheng Lai

Web data have grown exponentially to reach zettabyte scales. Mountains of data come from several online applications, such as e-commerce, social media, web and sensor-based devices, business web sites, and other information types posted by users. Big data analytics (BDA) can help to derive new insights from this huge and fast-growing data source. The core advantage of BDA technology is in its ability to mine these data and provide information on underlying trends. BDA, however, faces innate difficulty in optimizing the process and capabilities that require merging of diverse data assets to generate viable information. This paper explores the BDA process and capabilities in leveraging data via three case studies who are prime users of BDA tools. Findings emphasize four key components of the BDA process framework: system coordination, data sourcing, big data application service, and end users. Further building blocks are data security, privacy, and management that represent services for providing functionality to the four components of the BDA process across information and technology value chains.


Author(s):  
Yihao Tian

Big data is an unstructured data set with a considerable volume, coming from various sources such as the internet, business organizations, etc., in various formats. Predicting consumer behavior is a core responsibility for most dealers. Market research can show consumer intentions; it can be a big order for a best-designed research project to penetrate the veil, protecting real customer motivations from closer scrutiny. Customer behavior usually focuses on customer data mining, and each model is structured at one stage to answer one query. Customer behavior prediction is a complex and unpredictable challenge. In this paper, advanced mathematical and big data analytical (BDA) methods to predict customer behavior. Predictive behavior analytics can provide modern marketers with multiple insights to optimize efforts in their strategies. This model goes beyond analyzing historical evidence and making the most knowledgeable assumptions about what will happen in the future using mathematical. Because the method is complex, it is quite straightforward for most customers. As a result, most consumer behavior models, so many variables that produce predictions that are usually quite accurate using big data. This paper attempts to develop a model of association rule mining to predict customers’ behavior, improve accuracy, and derive major consumer data patterns. The finding recommended BDA method improves Big data analytics usability in the organization (98.2%), risk management ratio (96.2%), operational cost (97.1%), customer feedback ratio (98.5%), and demand prediction ratio (95.2%).


2021 ◽  
Author(s):  
FENG GUO ◽  
HUI-LIN QIN

With the continuous development of information technology, enterprises have gradually entered the era of big data. How to analyze the complex data and find out the useful information to promote the development of enterprises is becoming more and more important in the modernization of science and technology. This paper expounds the importance and existing problems of big data application in enterprise management, and briefly analyzes and discusses its application in enterprises and its future development direction and trend. With the rapid development of Internet of things, cloud computing and other information technology, the world ushered in the era of big data. It has become a trend to promote the deep integration of Internet, big data, artificial intelligence and real economy. Due to the rapid development of economy, the amount of data information generated in the process of consumption and production is very large. Under the traditional management mode, enterprises can not meet the needs of the current social and economic development. However, the application of big data technology in enterprises can achieve better analysis and Research on these data information, so as to provide reliable data basis for enterprises to carry out various business management decisions.


2019 ◽  
Vol 2 ◽  
pp. 1-6
Author(s):  
Wenjuan Lu ◽  
Aiguo Liu ◽  
Chengcheng Zhang

<p><strong>Abstract.</strong> With the development of geographic information technology, the way to get geographical information is constantly, and the data of space-time is exploding, and more and more scholars have started to develop a field of data processing and space and time analysis. In this, the traditional data visualization technology is high in popularity and simple and easy to understand, through simple pie chart and histogram, which can reveal and analyze the characteristics of the data itself, but still cannot combine with the map better to display the hidden time and space information to exert its application value. How to fully explore the spatiotemporal information contained in massive data and accurately explore the spatial distribution and variation rules of geographical things and phenomena is a key research problem at present. Based on this, this paper designed and constructed a universal thematic data visual analysis system that supports the full functions of data warehousing, data management, data analysis and data visualization. In this paper, Weifang city is taken as the research area, starting from the aspects of rainfall interpolation analysis and population comprehensive analysis of Weifang, etc., the author realizes the fast and efficient display under the big data set, and fully displays the characteristics of spatial and temporal data through the visualization effect of thematic data. At the same time, Cassandra distributed database is adopted in this research, which can also store, manage and analyze big data. To a certain extent, it reduces the pressure of front-end map drawing, and has good query analysis efficiency and fast processing ability.</p>


A large volume of datasets is available in various fields that are stored to be somewhere which is called big data. Big Data healthcare has clinical data set of every patient records in huge amount and they are maintained by Electronic Health Records (EHR). More than 80 % of clinical data is the unstructured format and reposit in hundreds of forms. The challenges and demand for data storage, analysis is to handling large datasets in terms of efficiency and scalability. Hadoop Map reduces framework uses big data to store and operate any kinds of data speedily. It is not solely meant for storage system however conjointly a platform for information storage moreover as processing. It is scalable and fault-tolerant to the systems. Also, the prediction of the data sets is handled by machine learning algorithm. This work focuses on the Extreme Machine Learning algorithm (ELM) that can utilize the optimized way of finding a solution to find disease risk prediction by combining ELM with Cuckoo Search optimization-based Support Vector Machine (CS-SVM). The proposed work also considers the scalability and accuracy of big data models, thus the proposed algorithm greatly achieves the computing work and got good results in performance of both veracity and efficiency.


Author(s):  
Dr. Pasumponpandian A.

The integration of two of the biggest giants in the computing world has resulted in the development and advancement of new methodologies in data processing. Cognitive computing and big data analytics are integrated to give rise to advanced technologically sound algorithms like MOIWO and NSGA. There is an important role played by the E-projects portfolio selection (EPPS) issue in the web development environment that is handled with the help of a decision making algorithm based on big data. The EPPS problem tackles choosing the right projects for investment on the social media in order to achieve maximum return at minimal risk conditions. In order to address this issue and further optimize EPPS probe on social media, the proposed work focuses on building a hybrid algorithm known as NSGA-II-MOIWO. This algorithms makes use of the positive aspects of MOIWO algorithm and NSGA-II algorithm in order to develop an efficient one. The experimental results are recorded and analyzed in order to determine the most optimal algorithm based on the return and risk of investment. Based on the results, it is found that NSGA-II-MOIWO outperforms both MOIWO and NSGA, proving to be a better hybrid alternative.


Author(s):  
Hena Iqbal ◽  
Sujni Paul ◽  
Khaliquzzaman Khan

Evaluation is an analytical and organized process to figure out the present positive influences, favourable future prospects, existing shortcomings and ulterior complexities of any plan, program, practice or a policy. Evaluation of policy is an essential and vital process required to measure the performance or progression of the scheme. The main purpose of policy evaluation is to empower various stakeholders and enhance their socio-economic environment. A large number of policies or schemes in different areas are launched by government in view of citizen welfare. Although, the governmental policies intend to better shape up the life quality of people but may also impact their every day’s life. A latest governmental scheme Saubhagya launched by Indian government in 2017 has been selected for evaluation by applying opinion mining techniques. The data set of public opinion associated with this scheme has been captured by Twitter. The primary intent is to offer opinion mining as a smart city technology that harness the user-generated big data and analyse it to offer a sustainable governance model.


2012 ◽  
Vol 7 (1) ◽  
pp. 174-197 ◽  
Author(s):  
Heather Small ◽  
Kristine Kasianovitz ◽  
Ronald Blanford ◽  
Ina Celaya

Social networking sites and other social media have enabled new forms of collaborative communication and participation for users, and created additional value as rich data sets for research. Research based on accessing, mining, and analyzing social media data has risen steadily over the last several years and is increasingly multidisciplinary; researchers from the social sciences, humanities, computer science and other domains have used social media data as the basis of their studies. The broad use of this form of data has implications for how curators address preservation, access and reuse for an audience with divergent disciplinary norms related to privacy, ownership, authenticity and reliability.In this paper, we explore how the characteristics of the Twitter platform, coupled with an ambiguous and evolving understanding of privacy in networked communication, and divergent disciplinary understandings of the resulting data, combine to create complex issues for curators trying to ensure broad-based and ethical reuse of Twitter data. We provide a case study of a specific data set to illustrate how data curators can engage with the topics and questions raised in the paper. While some initial suggestions are offered to librarians and other information professionals who are beginning to receive social media data from researchers, our larger goal is to stimulate discussion and prompt additional research on the curation and preservation of social media data.


Sign in / Sign up

Export Citation Format

Share Document