hadoop ecosystem
Recently Published Documents


TOTAL DOCUMENTS

88
(FIVE YEARS 21)

H-INDEX

8
(FIVE YEARS 3)

Author(s):  
Nibareke Thérence ◽  
Laassiri Jalal ◽  
Lahrizi Sara

2020 ◽  
pp. 153-165
Author(s):  
Rathinaraja Jeyaraj ◽  
Ganeshkumar Pugalendhi ◽  
Anand Paul
Keyword(s):  

Algorithms ◽  
2020 ◽  
Vol 13 (3) ◽  
pp. 71 ◽  
Author(s):  
Athanasios Alexopoulos ◽  
Georgios Drakopoulos ◽  
Andreas Kanavos ◽  
Phivos Mylonas ◽  
Gerasimos Vonitsanos

At the dawn of the 10V or big data data era, there are a considerable number of sources such as smart phones, IoT devices, social media, smart city sensors, as well as the health care system, all of which constitute but a small portion of the data lakes feeding the entire big data ecosystem. This 10V data growth poses two primary challenges, namely storing and processing. Concerning the latter, new frameworks have been developed including distributed platforms such as the Hadoop ecosystem. Classification is a major machine learning task typically executed on distributed platforms and as a consequence many algorithmic techniques have been developed tailored for these platforms. This article extensively relies in two ways on classifiers implemented in MLlib, the main machine learning library for the Hadoop ecosystem. First, a vast number of classifiers is applied to two datasets, namely Higgs and PAMAP. Second, a two-step classification is ab ovo performed to the same datasets. Specifically, the singular value decomposition of the data matrix determines first a set of transformed attributes which in turn drive the classifiers of MLlib. The twofold purpose of the proposed architecture is to reduce complexity while maintaining a similar if not better level of the metrics of accuracy, recall, and F 1 . The intuition behind this approach stems from the engineering principle of breaking down complex problems to simpler and more manageable tasks. The experiments based on the same Spark cluster indicate that the proposed architecture outperforms the individual classifiers with respect to both complexity and the abovementioned metrics.


Author(s):  
Arvind Panwar ◽  
Vishal Bhatnagar

Internet, & more unambiguously the creation of WWW in the early 1990s, helped people to build an interconnected global platform where information can be stored, shared, and consumed by anyone with an electronic device which has the ability to connect to the Web. This provides a way of putting together lots of information, ideas, and opinion. An interactive platform was born to post content, messages, and opinions under one roof, and the platform is known as social media. Social media has acquired massive popularity and importance that why today almost everyone can't stay away from it. Social media is not only a medium for people to express their thoughts, moreover, but it is also a very powerful tool which can be used by businesses to focus on new and existing customers and increase profit with the help of social media analytics. This paper starts with a discussion on social media with its significance & pitfalls. Later on, this paper presents a brief introduction of sentiment analysis in social media and give an experimental work on sentiment analysis in a social game review.


Sign in / Sign up

Export Citation Format

Share Document