scholarly journals IR-TEx: An Open Source Data Integration Tool for Big Data Transcriptomics Designed for the Malaria Vector Anopheles gambiae

Author(s):  
Victoria A. Ingham ◽  
Andrew Bennett ◽  
Duo Peng ◽  
Simon C. Wagstaff ◽  
Hilary Ranson
2016 ◽  
Vol 07 (03) ◽  
pp. 31-33
Author(s):  
ATIF AZIZ ◽  
◽  
RAJEEV ARYA ◽  
SANA SHAFIQUE ◽  
◽  
...  

Big Data is a term used to represent huge volume of both unstructured and structured data which cannot be processed by the traditional data processing techniques. This data is too huge, grows exponentially and doesn't fit into the structure of the traditional database systems. Analyzing Big Data is a very challenging task since it involves the processing of huge amount of data. As the industry or its business grows, the data related to the industries also tend to grow on a larger scale. Prominent data analysis tools are required to analyze the data in order to gain value out of it. Hadoop is a sought-after open source framework that uses MapReduce techniques to store and process huge datasets. However, the programs written using MapReduce techniques are not flexible and also require maintenance. This problem is overcome by making use of HiveQL. In order to execute queries in HiveQL, the platform required is Hive. It is an open-source data warehousing set-up built on Hadoop. HiveQL queries are compiled into MapReduce jobs that are executed utilizing Hadoop. In this paper we have analyzed the Indian Premier League dataset using HiveQL and compared its execution time with that of traditional SQL queries. It was found that the HiveQL provided better performance with larger dataset while SQL performed better with smaller datasets


Author(s):  
Richard S. Segall

This chapter discusses Open Source Software and associated technologies for the processing of Big Data. This includes discussions of Hadoop-related projects, the current top open source data tools and frameworks such as SMACK that is acronym for open source technologies Spark, Mesos, Akka, Cassandra, and Kafka that together compose the ingestion, aggregation, analysis, and storage layers for Big Data processing. Tabular summaries and categories for 38 Open Source Statistical Software (OSSS) are provided that include for each listing of features and URLs for free downloads. The current challenges of Big Data and Open Source Software are also discussed.


Author(s):  
Richard S. Segall

This chapter discusses Open Source Software and associated technologies for the processing of Big Data. This includes discussions of Hadoop-related projects, the current top open source data tools and frameworks such as SMACK that is acronym for open source technologies Spark, Mesos, Akka, Cassandra, and Kafka that together compose the ingestion, aggregation, analysis, and storage layers for Big Data processing. Tabular summaries and categories for 38 Open Source Statistical Software (OSSS) are provided that include for each listing of features and URLs for free downloads. The current challenges of Big Data and Open Source Software are also discussed.


2018 ◽  
Vol 80 (6) ◽  
pp. 457-461
Author(s):  
Carlos A. Morales-Ramirez ◽  
Pearlyn Y. Pang

Open-source data are information provided free online. It is gaining popularity in science research, especially for modeling species distribution. MaxEnt is an open-source software that models using presence-only data and environmental variables. These variables can also be found online and are generally free. Using all of these open-source data and tools makes species distribution modeling (SDM) more accessible. With the rapid changes our planet is undergoing, SDM helps understand future habitat suitability for species. Due to increasing interest in biogeographic research, SDM has increased for marine species, which were previously not commonly found in this modeling. Here we provide examples of where to obtain the data and how the modeling can be performed and taught.


2018 ◽  
Vol 231 ◽  
pp. 1100-1108 ◽  
Author(s):  
Alaa Alhamwi ◽  
Wided Medjroubi ◽  
Thomas Vogt ◽  
Carsten Agert

Sign in / Sign up

Export Citation Format

Share Document