A Hierarchical Hadoop Framework to Handle Big Data in Geo-Distributed Computing Environments

Author(s):  
Orazio Tomarchio ◽  
Giuseppe Di Modica ◽  
Marco Cavallo ◽  
Carmelo Polito

Advances in the communication technologies, along with the birth of new communication paradigms leveraging on the power of the social, has fostered the production of huge amounts of data. Old-fashioned computing paradigms are unfit to handle the dimensions of the data daily produced by the countless, worldwide distributed sources of information. So far, the MapReduce has been able to keep the promise of speeding up the computation over Big Data within a cluster. This article focuses on scenarios of worldwide distributed Big Data. While stigmatizing the poor performance of the Hadoop framework when deployed in such scenarios, it proposes the definition of a Hierarchical Hadoop Framework (H2F) to cope with the issues arising when Big Data are scattered over geographically distant data centers. The article highlights the novelty introduced by the H2F with respect to other hierarchical approaches. Tests run on a software prototype are also reported to show the increase of performance that H2F is able to achieve in geographical scenarios over a plain Hadoop approach.

Author(s):  
Orazio Tomarchio ◽  
Giuseppe Di Modica ◽  
Marco Cavallo ◽  
Carmelo Polito

Advances in the communication technologies, along with the birth of new communication paradigms leveraging on the power of the social, has fostered the production of huge amounts of data. Old-fashioned computing paradigms are unfit to handle the dimensions of the data daily produced by the countless, worldwide distributed sources of information. So far, the MapReduce has been able to keep the promise of speeding up the computation over Big Data within a cluster. This article focuses on scenarios of worldwide distributed Big Data. While stigmatizing the poor performance of the Hadoop framework when deployed in such scenarios, it proposes the definition of a Hierarchical Hadoop Framework (H2F) to cope with the issues arising when Big Data are scattered over geographically distant data centers. The article highlights the novelty introduced by the H2F with respect to other hierarchical approaches. Tests run on a software prototype are also reported to show the increase of performance that H2F is able to achieve in geographical scenarios over a plain Hadoop approach.


Author(s):  
A S Mukhin ◽  
I A Rytsarev ◽  
R A Paringer ◽  
A V Kupriyanov ◽  
D V Kirsh

The article is devoted to the definition of such groups in social networks. The object of the study was selected data social network Vk. Text data was collected, processed and analyzed. To solve the problem of obtaining the necessary information, research was conducted in the field of optimization of data collection of the social network Vk. A software tool that provides the collection and subsequent processing of the necessary data from the specified resources has been developed. The existing algorithms of text analysis, mainly of large volume, were investigated and applied.


2021 ◽  
Vol 3 (6) ◽  
pp. 131-145
Author(s):  
Agarwal Harshita ◽  
Poulomi Sen

In the twenty-first century which is regarded as the dawn of the social media age, the disputants, as well as the legal professionals such as advocates and judges, embrace the information available at their disposal on several social media platforms. It has altered the conduct of arbitration by changing the way disputants communicate. Being the modern tool for communication, it has elevated the speed and dissemination of information, which allows audiences to follow the dispute and express their support or dissatisfaction towards the disputants. As a consequence, the parties seeking redressal of their grievances through ADR get influenced due to the formation of ‘unconscious bias’. Communication is the epitome of the dispute resolution process, and the intervention of social media in the process generates a ghost syndrome, thus, resulting in the fading of such epitome. Its impact is not restricted to the parties but has the potential to undermine the independence, integrity, and impartiality of the judge or the mediator. Social Media has become significant within the legal domain as technology penetrates all ambits of individual endeavors. Looking towards the positive contributions, it acts as a source of evidence, especially in employment and labor disputes. Transformations in communication technologies have altered the definition of power in international arbitration, the class of individuals participating in the process, and strategies employed to mediate the conflict. The paper intends to discuss the elite usage and manipulation of social media impacting ADR, the cases influenced by it, and the theoretical framework required for its conduct.


2021 ◽  
Vol 8 (2) ◽  
pp. 160
Author(s):  
Xinzhao Pang

In the context of the vigorous development of big data and network communication technologies, the universality of personal data information processing, the openness of concept definitions, and the potential risks in practice have led to theoretical and practical changes in the definition of personal data information. With the widespread use of big data technology in criminal investigations, the theoretical and practical activities of big data investigation have gradually formed. Big data investigation activities are often accompanied by infringements on citizens' personal data and other legitimate rights and interests. At present, the traditional model of personal data information protection cannot restrict the code of conduct in big data investigation activities. For this reason, it is necessary to introduce a comprehensive governance model, which mainly includes relative control of individuals, balance of multiple interests and dynamic risk adjustment. Etc., and focus on the transformation of the legal protection model of personal information.


2020 ◽  
Vol 20 (2) ◽  
pp. 239-251
Author(s):  
N. V. Plotichkina

The article considers the media mythologization of the social: the media create representations of the world as a place for power struggle and suggest various versions of sociality which are legitimized in mythical narratives. Academic arguments emphasize the complexity of the social, the necessity of its theorization and dereification in the digital context. The study of the social/media dialectic reveals three myths - of natural collectivity, of the mediated center and of big data. The myth of the mediated center is a double construct: a narrative about the center of knowledge, values and meanings in society which produces normative or descriptive truth; and a mythical narrative of the media as representing society to its members and having a privileged access to the center of the social reality. This myth legitimizes the symbolic power of the mass media and is supported by rituals. The myth of us aims at naturalizing the network sociality, convincing users of the naturalness of its contacts, interactions and communities, providing the sense of cohesion and constructing a digital identity. The myth of us is a narrative about collectivity determined by the joint efforts of users and designers of social platforms. This myth hides the decay of the social in the digital reality and indicates its media-mythical compensation. Big data represent new landscapes of objects, methods of cognition and definition of sociality. Big data is not only a source of knowledge, innovation and change but also a mythology which should be critically examined. The article considers different approaches to the analysis of big data mythology developed within the interpretation of digital artifacts as a reflection of the social context and the best form of social knowledge. This myth presents its version of sociology - with new epistemology, ethics and methodology - and hides other sources of knowledge about sociality in the digital media (mythologizes data policy (agency), economies, ethics and epistemologies).


Author(s):  
Rafael Marques ◽  

In this text we propose an allegoric reading of Georg Simmel’s text “The Poor”, concentrating our analysis in the interactional, dialogical and reciprocal dimensions that constitute the poor as a social category and impose certain character traits and social roles to them. Signaling the continuities between alms giving and the modern Welfare State, we propose a cynical look at the social processes that defined the poor as a non-existence or as a pretext for a moral enrichment of the givers and helpers. We mark a long process that started with the definition of poverty as pathology and culminated in the buildup of an identity of the poor according to statistical categories and rational classificatory systems. The poor is characterized as a combinatorial type, receiving its meaning from the association with other concepts and as an absence of a value in itself.


2008 ◽  
Vol 05 (03) ◽  
pp. 449-462 ◽  
Author(s):  
N. B. PETROVSKAYA

The least-squares (LS) method is often used in computational aerodynamics to reconstruct a given function at certain points of a computational grid. In this paper we discuss the accuracy of the LS approximation on highly stretched meshes that are inherent in computational aerodynamics. A new definition of a distant point in a LS reconstruction stencil will be given in order to explain the poor performance of the method in a boundary layer region. Namely, based on the concept of outliers widely used in the statistics, we demonstrate that the definition of a distant point in a LS reconstruction stencil should take into account the solution properties and it cannot rely upon the geometric shape of the stencil only. Our approach is illustrated with numerical examples.


E-Management ◽  
2019 ◽  
Vol 2 (3) ◽  
pp. 29-37
Author(s):  
V. Dovzhik ◽  
G. Dovzhik ◽  
T. Fedyanina

The concept of “Big data” as a way of processing huge amounts of data has been considered, the principles of working with big data have been formulated, the techniques of electronic marketing have been reviewed. The definition of targeting has been given, its possibilities have been described. The sources of information for the General targeting field have been presented. The concept of hyperlocal targeting has been interpreted. The specificity of the use of e-marketing technologies depending on the goals of promotion and the ways of creating individual sales tunnels, the so-called “narrow entry” technique, have been investigated. The content essence and possibilities of using the technology of detailed marketing have been revealed. On the example of using the technology of detailed targeting by means of automatic redirection of users from one URL to another (redirect) and on the basis of the obtained data the formation of a portrait of a potential client and segmentation of target audiences an algorithm for the use of detailed targeting technology has been developed. The process of segmentation and creation of a typical portrait of a representative of the target audience has been shown. On the basis of the developed portraits of representatives of the target audience, the client’s path design was created. Herewith, the standard logical chains of customer involvement in the process of making a purchase, namely, attracting a potential buyer, increasing his level of interest in the product and the final sale, have been described. The most significant technological difficulties have been studied, consisting primarily in the simultaneous connection of a variety of platforms and modules necessary for the automation of processes and the collection of statistics for analytical work. The list of tools used in the implementation of detailed targeting technology has been presented: LeeLoo.ai, AmoCRM, LPGenerator, Yandex.Metrics.The logic of expansion of the applied tools of electronic marketing for different target audiences for the purpose of increase of conversion and, accordingly, economy of the advertizing budget has been described.


Author(s):  
Kiran Kumar S V N Madupu

Big Data has terrific influence on scientific discoveries and also value development. This paper presents approaches in data mining and modern technologies in Big Data. Difficulties of data mining as well as data mining with big data are discussed. Some technology development of data mining as well as data mining with big data are additionally presented.


Sign in / Sign up

Export Citation Format

Share Document