Is data science a science? The essence of phenomenon and the role of theory in the emerging field

Kybernetes ◽  
2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Pedro Jácome de Moura Jr

PurposeData science lacks a distinctive identity and a theory-informed approach, both for its own sake and to properly be applied conjointly to the social sciences. This paper’s purposes are twofold: to provide (1) data science an illustration of theory adoption, able to address explanation and support prediction/prescription capacities and (2) a rationale for identification of the key phenomena and properties of data science so that the data speak through a contextual understanding of reality, broader than has been usual.Design/methodology/approachA literature review and a derived conceptual research model for a push–pull approach (adapted for a data science study in the management field) are presented. A real location–allocation problem is solved through a specific algorithm and explained in the light of the adapted push–pull theory, serving as an instance for a data science theory-informed application in the management field.FindingsThis study advances knowledge on the definition of data science key phenomena as not just pure “data”, but interrelated data and datasets properties, as well as on the specific adaptation of the push-pull theory through its definition, dimensionality and interaction model, also illustrating how to apply the theory in a data science theory-informed research. The proposed model contributes to the theoretical strengthening of data science, still an incipient area, and the solution of the location-allocation problem suggests the applicability of the proposed approach to broad data science problems, alleviating the criticism on the lack of explanation and the focus on pattern recognition in data science practice and research.Research limitations/implicationsThe proposed algorithm requires the previous definition of a perimeter of interest. This aspect should be characterised as an antecedent to the model, which is a strong assumption. As for prescription, in this specific case, one has to take complementary actions, since theory, model and algorithm are not detached from in loco visits, market research or interviews with potential stakeholders.Practical implicationsThis study offers a conceptual model for practical location–allocation problem analyses, based on the push–pull theoretical components. So, it suggests a proper definition for each component (the object, the perspective, the forces, its degrees and the nature of the movement). The proposed model has also an algorithm for computational implementation, which visually describes and explains components interaction, allowing further simulation (estimated forces degrees) for prediction.Originality/valueFirst, this study identifies an overlap of push–pull theoretical approaches, which suggests theory adoption eventually as mere common sense, weakening further theoretical development. Second, this study elaborates a definition for the push–pull theory, a dimensionality and a relationship between its components. Third, a typical location–allocation problem is analysed in the light of the refactored theory, showing its adequacy for that class of problems. And fourth, this study suggests that the essence of a data science should be the study of contextual relationships among data, and that the context should be provided by the spatial, temporal, political, economic and social analytical interests.

2019 ◽  
Vol 26 (3) ◽  
pp. 196-203 ◽  
Author(s):  
Lianjie Qin ◽  
Wei Xu ◽  
Xiujuan Zhao ◽  
Yunjia Ma

BackgroundDetermining the locations of disaster emergency shelters and the allocation of impacted residents are key components in shelter planning and emergency management. Various models have been developed to solve this location–allocation problem, but gaps remain regarding the processes of hazards. This study attempts to develop a model based on the change of typhoon track that addresses the location–allocation problem for typhoon emergency shelters.PurposeTo consider the changes in candidate shelters and number of evacuees due to the change in impact area with the progression of a typhoon.MethodsThe proposed model is composed of several static processes and solved by a modified particle swarm optimisation algorithm with a restart strategy.ResultsThe model is illustrated with the case of the evacuation process for Wenchang in Hainan province during Typhoon Rammasun in 2014 and Typhoon Mirinae in 2016. For the case of Typhoon Rammasun in 2014, the residents from east to west need to evacuate in three phases. For the case of Typhoon Mirinae in 2016, residents in the northern communities need not to evacuate to candidate shelters because they are not affected by the typhoon.ConclusionThe proposed model has advantages compared with non-typhoon track change–based model in saving time spent in shelters for residents and saving public resources for the local governments. With the proposed model, a manager could efficiently evacuate residents by considering the typhoon conditions.


1971 ◽  
Vol 3 (1) ◽  
pp. 73-82 ◽  
Author(s):  
A J Scott

The general location-allocation problem is defined. Then various generalizations of the problem are indicated. The dynamic extension of the location-allocation problem is shown to be of especial interest and significance. Two major approaches to the formulation and analysis of this dynamic problem are discussed. The first approach makes no attempt to anticipate future events and thus leads over the long-run to sub-optimal solutions. The second approach attempts fully to anticipate the future, and is formalized as a dynamic program. This second approach guarantees full optimality over the range of definition of the problem. Some numerical examples are presented.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Mehnoosh Soleimani ◽  
Mohammad Khalilzadeh ◽  
Arman Bahari ◽  
Ali Heidary

Purpose One of the practical issues in the area of location and allocation is the location of the hub. In recent years, exchange rates have fluctuated sharply for a number of reasons such as sanctions against the country. Natural disasters that have occurred in recent years caused delays in hub servicing. The purpose of this study is to develop a mathematical programming model to minimize costs, maximize social responsibility and minimize fuel consumption so that in the event of a disruption in the main hub, the flow of materials can be directed to its backup hub to prevent delays in flow between nodes and disruptions in hubs. Design/methodology/approach A multi-objective mathematical programming model is developed considering uncertainty in some parameters, especially cost as fuzzy numbers. In addition, backup hubs are selected for each primary hub to deal with disruption and natural disasters and prevent delays. Then, a robust possibilistic method is proposed to deal with uncertainty. As the hub location-allocation problem is considered as NP-Hard problems so that exact methods cannot solve them in large sizes, two metaheuristic algorithms including a non-dominated sorting genetic algorithm non-dominated sorting genetic algorithm (NSGA-II) and multi-objective particle swarm optimization (MOPSO) are applied to tackle the problem. Findings Numerical results show the proposed model is valid. Also, they demonstrate that the NSGA-II algorithm outperforms the MOPSO algorithm. Practical implications The proposed model was implemented in one of the largest food companies in Iran, which has numerous products manufactured in different cities, to seek the hub locations. Also, due to several reasons such as road traffic and route type the difference in the rate of fuel consumption between nodes, this model helps managers and decision-makers to choose the best locations to have the least fuel consumption. Moreover, as the hub set up increases the employment rate in that city and has social benefits as it requires hiring some staff. Originality/value This paper investigates the hub location problem considering backup hubs with multiple objective functions to deal with disruption and uncertainty. Also, this study examines how non-hub nodes are assigned to hub nodes.


1975 ◽  
Vol 22 (1) ◽  
pp. 57-65 ◽  
Author(s):  
George O. Wesolowsky ◽  
William G. Truscott

ECONOMICS ◽  
2019 ◽  
Vol 7 (1) ◽  
pp. 51-59
Author(s):  
Juraj Pekár ◽  
Ivan Brezina ◽  
Zuzana Čičková

Abstract The paper is focused on specific location-allocation problem aimed to determine a set of scrap yards for vehicle decommissioning in Slovakia. The goal is to determine scrap yards network so that it is not prohibitive to pass old car for dismantling and further processing wherever former owner lives. Two approaches are considered. Once we consider the case when it is necessary to construct a completely new network of scrap yards, which results to setting of their minimum numbers and also their location. In the latter case, the already existing network of scrap yards is considered, while the model provides its extension, in order to achieve the desired values of accessibility for all residents. The results were applied to an existing network of scrap yards identifying locations to build new scrap yards. Areas where whole new network of scrap yards must be built were also identified.


2019 ◽  
Vol 4 (4) ◽  
pp. 388-394
Author(s):  
Viktoriia Hmyria ◽  
Volodymyr Kucherenko

The purpose of this article is to study theoretical approaches to definition of economic essence and directions of economic security formation of agricultural enterprises at the region. Analysis of the theoretical aspects of the research object showed that currently there are different approaches to the economic security interpretation. It is considered as a process of meeting public needs, protection of the state national interests, the economic stability, the state of the national economy, in which the country provides its own sovereignty, a state of protection against a certain type of threats. Certain realities of a market economy require fundamentally new approaches to the formation and increase of efficiency of agrarian enterprises functioning, and thus ensure their economic security. In the context of the latest scientific dimensions, it has been found out that the economic security of agricultural enterprises is a complex of sophisticated branched processes, methods, tools, levers of influence on activity for the purpose of increasing the economic activity efficiency of entities. In this regard, there are many different opinions regarding the components of enterprises economic security. It is proved that the economic security of enterprises is characterized by certain features in the regional context of their formation and application to agricultural production. The proposed model Forsythe technology modernization of a regional mechanism of economic security at agricultural enterprises will allow to determine more efficiently the volume of investments, tax burden on the industry and crediting enterprises, depreciation deductions etc., which will effectively affect the activity of the enterprise, and hence ensure its basic – economic security. Keywords: economic security, agricultural enterprises, natural conditions, fixed assets, information security, financial security.


2019 ◽  
Vol 37 (6) ◽  
pp. 929-951 ◽  
Author(s):  
Laurent Remy ◽  
Dragan Ivanović ◽  
Maria Theodoridou ◽  
Athina Kritsotaki ◽  
Paul Martin ◽  
...  

Purpose The purpose of this paper is to boost multidisciplinary research by the building of an integrated catalogue or research assets metadata. Such an integrated catalogue should enable researchers to solve problems or analyse phenomena that require a view across several scientific domains. Design/methodology/approach There are two main approaches for integrating metadata catalogues provided by different e-science research infrastructures (e-RIs): centralised and distributed. The authors decided to implement a central metadata catalogue that describes, provides access to and records actions on the assets of a number of e-RIs participating in the system. The authors chose the CERIF data model for description of assets available via the integrated catalogue. Analysis of popular metadata formats used in e-RIs has been conducted, and mappings between popular formats and the CERIF data model have been defined using an XML-based tool for description and automatic execution of mappings. Findings An integrated catalogue of research assets metadata has been created. Metadata from e-RIs supporting Dublin Core, ISO 19139, DCAT-AP, EPOS-DCAT-AP, OIL-E and CKAN formats can be integrated into the catalogue. Metadata are stored in CERIF RDF in the integrated catalogue. A web portal for searching this catalogue has been implemented. Research limitations/implications Only five formats are supported at this moment. However, description of mappings between other source formats and the target CERIF format can be defined in the future using the 3M tool, an XML-based tool for describing X3ML mappings that can then be automatically executed on XML metadata records. The approach and best practices described in this paper can thus be applied in future mappings between other metadata formats. Practical implications The integrated catalogue is a part of the eVRE prototype, which is a result of the VRE4EIC H2020 project. Social implications The integrated catalogue should boost the performance of multi-disciplinary research; thus it has the potential to enhance the practice of data science and so contribute to an increasingly knowledge-based society. Originality/value A novel approach for creation of the integrated catalogue has been defined and implemented. The approach includes definition of mappings between various formats. Defined mappings are effective and shareable.


Sign in / Sign up

Export Citation Format

Share Document