The Big Data Era

Author(s):  
Maria K. Krommyda ◽  
Verena Kantere

Large datasets pertaining to many scientific fields and everyday activities are becoming available at an increasing rate. Processing, analyzing, and understanding the information that they offer poses significant technical challenges. There are many efforts dedicated to the development of big data exploration, analysis, and visualization applications that will improve the value of the information extracted from these datasets. An analysis of the state-of-the-art in these applications is presented here along with open research challenges that have not yet been tackled sufficiently. Also, specific domains where big data applications are needed are presented, and unique challenges are identified.

2021 ◽  
Vol 16 (2) ◽  
pp. 111-135
Author(s):  
Emilio M. Sanfilippo

Information entities are used in ontologies to represent engineering technical specifications, health records, pictures or librarian data about, e.g., narrative fictions, among others. The literature in applied ontology lacks a comparison of the state of the art, and foundational questions on the nature of information entities remain open for research. The purpose of the paper is twofold. First, to compare existing ontologies with both each other and theories proposed in philosophy, semiotics, librarianship, and literary studies in order to understand how the ontologies conceive and model information entities. Second, to discuss some open research challenges that can lead to principled approaches for the treatment of information entities, possibly by getting into account the variety of information entity types found in the literature.


2021 ◽  
Vol 54 (5) ◽  
pp. 1-34
Author(s):  
Maya Dotan ◽  
Yvonne-Anne Pignolet ◽  
Stefan Schmid ◽  
Saar Tochner ◽  
Aviv Zohar

Blockchains, in general, and cryptocurrencies such as Bitcoin, in particular, are realized using distributed systems and hence critically rely on the performance and security of the interconnecting network. The requirements on these networks and their usage, however, can differ significantly from traditional communication networks, with implications on all layers of the protocol stack. This article is motivated by these differences and, in particular, by the observation that many fundamental design aspects of these networks are not well-understood today. To support the networking community to contribute to this emerging application domain, we present a structured overview of the field, from topology and neighbor discovery, over block and transaction propagation, to sharding and off-chain networks, also reviewing existing empirical results from different measurement studies. In particular, for each of these domains, we provide the context, highlighting differences and commonalities with traditional networks, review the state-of-the-art, and identify open research challenges. Our article can hence also be seen as a call-to-arms to improve the foundation on top of which blockchains are built.


Author(s):  
Georgios Skourletopoulos ◽  
Constandinos X. Mavromoustakis ◽  
George Mastorakis ◽  
Jordi Mongay Batalla ◽  
Ciprian Dobre ◽  
...  

Author(s):  
Akrati Saxena ◽  
Harita Reddy

AbstractOnline informal learning and knowledge-sharing platforms, such as Stack Exchange, Reddit, and Wikipedia have been a great source of learning. Millions of people access these websites to ask questions, answer the questions, view answers, or check facts. However, one interesting question that has always attracted the researchers is if all the users share equally on these portals, and if not then how the contribution varies across users, and how it is distributed? Do different users focus on different kinds of activities and play specific roles? In this work, we present a survey of users’ social roles that have been identified on online discussion and Q&A platforms including Usenet newsgroups, Reddit, Stack Exchange, and MOOC forums, as well as on crowdsourced encyclopedias, such as Wikipedia, and Baidu Baike, where users interact with each other through talk pages. We discuss the state of the art on capturing the variety of users roles through different methods including the construction of user network, analysis of content posted by users, temporal analysis of user activity, posting frequency, and so on. We also discuss the available datasets and APIs to collect the data from these platforms for further research. The survey is concluded with open research questions.


Author(s):  
Jing Yang ◽  
Quan Zhang ◽  
Kunpeng Liu ◽  
Peng Jin ◽  
Guoyi Zhao

In recent years, electricity big data has extensive applications in the grid companies across the provinces. However, certain problems are encountered including, the inability to generate an ideal model using the isolated data possessed by each company, and the priority concerns for data privacy and safety during big data application and sharing. In this pursuit, the present research envisaged the application of federated learning to protect the local data, and to build a uniform model for different companies affiliated to the State Grid. Federated learning can serve as an essential means for realizing the grid-wide promotion of the achievements of big data applications, while ensuring the data safety.


Author(s):  
Rafal Cupek ◽  
Marek Drewniak ◽  
Marcin Fojcik ◽  
Erik Kyrkjebø ◽  
Jerry Chun-Wei Lin ◽  
...  

2018 ◽  
Vol 44 (4) ◽  
pp. 651-658
Author(s):  
Ralph Weischedel ◽  
Elizabeth Boschee

Though information extraction (IE) research has more than a 25-year history, F1 scores remain low. Thus, one could question continued investment in IE research. In this article, we present three applications where information extraction of entities, relations, and/or events has been used, and note the common features that seem to have led to success. We also identify key research challenges whose solution seems essential for broader successes. Because a few practical deployments already exist and because breakthroughs on particular challenges would greatly broaden the technology’s deployment, further R&D investments are justified.


2019 ◽  
Vol 18 (5) ◽  
pp. 3049-3082 ◽  
Author(s):  
Nelly Bencomo ◽  
Sebastian Götz ◽  
Hui Song

Sign in / Sign up

Export Citation Format

Share Document