The Design and Implementation of Script Authoring Assistant System of Film and Television Big Data

Author(s):  
Mengyu Liu ◽  
Wenqian Shang ◽  
Jianxiang Cao ◽  
Chan Pan ◽  
Weiguo Lin ◽  
...  
2019 ◽  
Vol 4 (2) ◽  
pp. 207-220
Author(s):  
김기수 ◽  
Yukun Hahm ◽  
장유림 ◽  
Jaejin Yi ◽  
HONGHOI KIM

2018 ◽  
Vol 18 (03) ◽  
pp. e23 ◽  
Author(s):  
María José Basgall ◽  
Waldo Hasperué ◽  
Marcelo Naiouf ◽  
Alberto Fernández ◽  
Francisco Herrera

The volume of data in today's applications has meant a change in the way Machine Learning issues are addressed. Indeed, the Big Data scenario involves scalability constraints that can only be achieved through intelligent model design and the use of distributed technologies. In this context, solutions based on the Spark platform have established themselves as a de facto standard. In this contribution, we focus on a very important framework within Big Data Analytics, namely classification with imbalanced datasets. The main characteristic of this problem is that one of the classes is underrepresented, and therefore it is usually more complex to find a model that identifies it correctly. For this reason, it is common to apply preprocessing techniques such as oversampling to balance the distribution of examples in classes. In this work we present SMOTE-BD, a fully scalable preprocessing approach for imbalanced classification in Big Data. It is based on one of the most widespread preprocessing solutions for imbalanced classification, namely the SMOTE algorithm, which creates new synthetic instances according to the neighborhood of each example of the minority class. Our novel development is made to be independent of the number of partitions or processes created to achieve a higher degree of efficiency. Experiments conducted on different standard and Big Data datasets show the quality of the proposed design and implementation.


The demand for energy is increasing rapidly and, after a few years, it may surpass the available energy, which may lead the energy providers to increase the cost of energy consumption to compensate the cost for the production. This paper provides design and implementation details of a prototype big data application developed to help large buildings to automatically manage their energy consumption by setting energy consumption targets, collecting periodic energy consumption data, storing the data streams, displaying the energy consumption graphically in real-time, analyzing the consumption patterns, and generating energy consumption graphs and reports. The application is connected to Mongo NoSQL backend database to handle the large and continuously changing data. This big data energy consumption management system is expected to help the users in managing energy consumption by analyzing the patterns to see if it is within or above the desired consumption targets and displaying the data graphically.


Sign in / Sign up

Export Citation Format

Share Document