heterogeneous data Latest Research Papers

Hyperspherical Variational Co-embedding for Attributed Networks

ACM Transactions on Information Systems ◽

10.1145/3478284 ◽

2022 ◽

Vol 40 (3) ◽

pp. 1-36

Author(s):

Jinyuan Fang ◽

Shangsong Liang ◽

Zaiqiao Meng ◽

Maarten De Rijke

Keyword(s):

Euclidean Space ◽

Poor Performance ◽

Building Blocks ◽

Heterogeneous Data ◽

User Profiling ◽

Network Embedding ◽

Stream Network ◽

Attributed Network ◽

Von Mises ◽

Attributed Networks

Network-based information has been widely explored and exploited in the information retrieval literature. Attributed networks, consisting of nodes, edges as well as attributes describing properties of nodes, are a basic type of network-based data, and are especially useful for many applications. Examples include user profiling in social networks and item recommendation in user-item purchase networks. Learning useful and expressive representations of entities in attributed networks can provide more effective building blocks to down-stream network-based tasks such as link prediction and attribute inference. Practically, input features of attributed networks are normalized as unit directional vectors. However, most network embedding techniques ignore the spherical nature of inputs and focus on learning representations in a Gaussian or Euclidean space, which, we hypothesize, might lead to less effective representations. To obtain more effective representations of attributed networks, we investigate the problem of mapping an attributed network with unit normalized directional features into a non-Gaussian and non-Euclidean space. Specifically, we propose a hyperspherical variational co-embedding for attributed networks (HCAN), which is based on generalized variational auto-encoders for heterogeneous data with multiple types of entities. HCAN jointly learns latent embeddings for both nodes and attributes in a unified hyperspherical space such that the affinities between nodes and attributes can be captured effectively. We argue that this is a crucial feature in many real-world applications of attributed networks. Previous Gaussian network embedding algorithms break the assumption of uninformative prior, which leads to unstable results and poor performance. In contrast, HCAN embeds nodes and attributes as von Mises-Fisher distributions, and allows one to capture the uncertainty of the inferred representations. Experimental results on eight datasets show that HCAN yields better performance in a number of applications compared with nine state-of-the-art baselines.

Efficient Training with Heterogeneous Data Distribution

10.1017/9781108955959.008 ◽

2022 ◽

pp. 98-111

Keyword(s):

Data Distribution ◽

Heterogeneous Data

Energy-efficient client selection in federated learning with heterogeneous data on edge

Peer-to-Peer Networking and Applications ◽

10.1007/s12083-021-01254-8 ◽

2022 ◽

Author(s):

Jianxin Zhao ◽

Yanhao Feng ◽

Xinyu Chang ◽

Chi Harold Liu

Keyword(s):

Energy Efficient ◽

Heterogeneous Data

Research on the Promotion of New Energy Vehicles Based on Multi-Source Heterogeneous Data

10.21203/rs.3.rs-1185117/v1 ◽

2022 ◽

Author(s):

Bing Sun ◽

Zhuofang Ju

Keyword(s):

Network Analysis ◽

Latent Dirichlet Allocation ◽

Topic Model ◽

Heterogeneous Data ◽

Collaboration Network ◽

Power Performance ◽

Cooperative Research ◽

Econometric Methods ◽

New Energy ◽

New Energy Vehicles

Abstract Under the background of green development, new energy vehicles(NEVs), as an important strategic emerging industry, play a crucial role in energy conservation and emission reduction. In the post-epidemic era, steadily promoting the promotion of NEVs will be a hot topic. Based on heterogeneous source data, combined with the Latent Dirichlet Allocation (LDA) topic model, Social Network Analysis (SNA), and econometric methods, this paper explores whether individual purchase decisions and company-level cooperative research and development will promote the promotion of new energy vehicles. The results show that whether BEV, HEV, or PHEV, users are more concerned about space dimension, power performance, and design style; Patent collaboration network analysis indicates that NEV enterprises are establishing close partnerships, which will urge the promotion of NEVs; For BEV and HEV models, new energy vehicle companies will invest more patents and R&D investment will better expedite the advancement of NEVs.

Peridynamic simulation of the mechanical responses and fracturing behaviors of granite subjected to uniaxial compression based on CT heterogeneous data

Engineering With Computers ◽

10.1007/s00366-021-01549-7 ◽

2022 ◽

Author(s):

Kai Feng ◽

Xiao-Ping Zhou

Keyword(s):

Uniaxial Compression ◽

Heterogeneous Data ◽

Mechanical Responses

Handling qualitative preferences in SPARQL over virtual ontology-based data access

Semantic Web ◽

10.3233/sw-212895 ◽

2022 ◽

pp. 1-24

Author(s):

Marlene Goncalves ◽

David Chaves-Fraga ◽

Oscar Corcho

Keyword(s):

Open Data ◽

Scoring Function ◽

Data Access ◽

Heterogeneous Data ◽

Database Management System ◽

Query Complexity ◽

Distribution Data ◽

Preference Queries ◽

Preference Criteria ◽

Qualitative Preferences

With the increase of data volume in heterogeneous datasets that are being published following Open Data initiatives, new operators are necessary to help users to find the subset of data that best satisfies their preference criteria. Quantitative approaches such as top-k queries may not be the most appropriate approaches as they require the user to assign weights that may not be known beforehand to a scoring function. Unlike the quantitative approach, under the qualitative approach, which includes the well-known skyline, preference criteria are more intuitive in certain cases and can be expressed more naturally. In this paper, we address the problem of evaluating SPARQL qualitative preference queries over an Ontology-Based Data Access (OBDA) approach, which provides uniform access over multiple and heterogeneous data sources. Our main contribution is Morph-Skyline++, a framework for processing SPARQL qualitative preferences by directly querying relational databases. Our framework implements a technique that translates SPARQL qualitative preference queries directly into queries that can be evaluated by a relational database management system. We evaluate our approach over different scenarios, reporting the effects of data distribution, data size, and query complexity on the performance of our proposed technique in comparison with state-of-the-art techniques. Obtained results suggest that the execution time can be reduced by up to two orders of magnitude in comparison to current techniques scaling up to larger datasets while identifying precisely the result set.

Microestimates of wealth for all low- and middle-income countries

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2113658119 ◽

2022 ◽

Vol 119 (3) ◽

pp. e2113658119

Author(s):

Guanghua Chi ◽

Han Fang ◽

Sourav Chatterjee ◽

Joshua E. Blumenstock

Keyword(s):

Survey Data ◽

Household Survey ◽

Heterogeneous Data ◽

Machine Learning Algorithms ◽

Middle Income ◽

Middle Income Countries ◽

Economic Development And Growth ◽

Household Survey Data ◽

Nationally Representative ◽

Low And Middle Income

Many critical policy decisions, from strategic investments to the allocation of humanitarian aid, rely on data about the geographic distribution of wealth and poverty. Yet many poverty maps are out of date or exist only at very coarse levels of granularity. Here we develop microestimates of the relative wealth and poverty of the populated surface of all 135 low- and middle-income countries (LMICs) at 2.4 km resolution. The estimates are built by applying machine-learning algorithms to vast and heterogeneous data from satellites, mobile phone networks, and topographic maps, as well as aggregated and deidentified connectivity data from Facebook. We train and calibrate the estimates using nationally representative household survey data from 56 LMICs and then validate their accuracy using four independent sources of household survey data from 18 countries. We also provide confidence intervals for each microestimate to facilitate responsible downstream use. These estimates are provided free for public use in the hope that they enable targeted policy response to the COVID-19 pandemic, provide the foundation for insights into the causes and consequences of economic development and growth, and promote responsible policymaking in support of sustainable development.

Hierarchical Federated Learning for Edge-Aided Unmanned Aerial Vehicle Networks

Applied Sciences ◽

10.3390/app12020670 ◽

2022 ◽

Vol 12 (2) ◽

pp. 670

Author(s):

Jamshid Tursunboev ◽

Yong-Sung Kang ◽

Sung-Bum Huh ◽

Dong-Woo Lim ◽

Jae-Mo Kang ◽

...

Keyword(s):

Unmanned Aerial Vehicle ◽

Critical Issue ◽

Heterogeneous Data ◽

Base Stations ◽

High Performing ◽

Shared Data ◽

Private Data ◽

Machine Learning Model ◽

Vehicle Networks ◽

Aerial Vehicle

Federated learning (FL) allows UAVs to collaboratively train a globally shared machine learning model while locally preserving their private data. Recently, the FL in edge-aided unmanned aerial vehicle (UAV) networks has drawn an upsurge of research interest due to a bursting increase in heterogeneous data acquired by UAVs and the need to build the global model with privacy; however, a critical issue is how to deal with the non-independent and identically distributed (non-i.i.d.) nature of heterogeneous data while ensuring the convergence of learning. To effectively address this challenging issue, this paper proposes a novel and high-performing FL scheme, namely, the hierarchical FL algorithm, for the edge-aided UAV network, which exploits the edge servers located in base stations as intermediate aggregators with employing commonly shared data. Experiment results demonstrate that the proposed hierarchical FL algorithm outperforms several baseline FL algorithms and exhibits better convergence behavior.

Industrial analytics – An overview

it - Information Technology ◽

10.1515/itit-2021-0066 ◽

2022 ◽

Vol 0 (0) ◽

Author(s):

Christoph Gröger

Keyword(s):

Value Chain ◽

Data Science ◽

Heterogeneous Data ◽

Subject Area ◽

Essential Elements ◽

Sensor Data ◽

Future Research ◽

Success Factor ◽

Industrial Enterprises ◽

Industrial Value Chain

Abstract The digital transformation generates huge amounts of heterogeneous data across the industrial value chain, from simulation data in engineering, over sensor data in manufacturing to telemetry data on product use. Extracting insights from these data constitutes a critical success factor for industrial enterprises, e. g., to optimize processes and enhance product features. This is referred to as industrial analytics, i. e., data analytics for industrial value creation. Industrial analytics is an interdisciplinary subject area between data science and industrial engineering and is at the core of Industry 4.0. Yet, existing literature on industrial analytics is fragmented and specialized. To address this issue, this paper presents a holistic overview of the field of industrial analytics integrating both current research as well as industry experiences on real-world industrial analytics projects. We define key terms, describe typical use cases and discuss characteristics of industrial analytics. Moreover, we present a conceptual framework for industrial analytics that structures essential elements, e. g., data platforms and data roles. Finally, we conclude and highlight future research directions.

Neural Matrix Factorization Recommendation for User Preference Prediction Based on Explicit and Implicit Feedback

Computational Intelligence and Neuroscience ◽

10.1155/2022/9593957 ◽

2022 ◽

Vol 2022 ◽

pp. 1-12

Author(s):

Huazhen Liu ◽

Wei Wang ◽

Yihan Zhang ◽

Renqian Gu ◽

Yaqi Hao

Keyword(s):

Neural Network ◽

Matrix Factorization ◽

Recommendation System ◽

Heterogeneous Data ◽

User Preference ◽

Personalized Recommendation ◽

Implicit Feedback ◽

Network Training ◽

Feedback Data ◽

Explicit Feedback

Explicit feedback and implicit feedback are two important types of heterogeneous data for constructing a recommendation system. The combination of the two can effectively improve the performance of the recommendation system. However, most of the current deep learning recommendation models fail to fully exploit the complementary advantages of two types of data combined and usually only use binary implicit feedback data. Thus, this paper proposes a neural matrix factorization recommendation algorithm (EINMF) based on explicit-implicit feedback. First, neural network is used to learn nonlinear feature of explicit-implicit feedback of user-item interaction. Second, combined with the traditional matrix factorization, explicit feedback is used to accurately reflect the explicit preference and the potential preferences of users to build a recommendation model; a new loss function is designed based on explicit-implicit feedback to obtain the best parameters through the neural network training to predict the preference of users for items; finally, according to prediction results, personalized recommendation list is pushed to the user. The feasibility, validity, and robustness are fully demonstrated in comparison with multiple baseline models on two real datasets.

heterogeneous data
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Hyperspherical Variational Co-embedding for Attributed Networks

Efficient Training with Heterogeneous Data Distribution

Energy-efficient client selection in federated learning with heterogeneous data on edge

Research on the Promotion of New Energy Vehicles Based on Multi-Source Heterogeneous Data

Peridynamic simulation of the mechanical responses and fracturing behaviors of granite subjected to uniaxial compression based on CT heterogeneous data

Handling qualitative preferences in SPARQL over virtual ontology-based data access

Microestimates of wealth for all low- and middle-income countries

Hierarchical Federated Learning for Edge-Aided Unmanned Aerial Vehicle Networks

Industrial analytics – An overview

Neural Matrix Factorization Recommendation for User Preference Prediction Based on Explicit and Implicit Feedback

Export Citation Format

heterogeneous dataRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Hyperspherical Variational Co-embedding for Attributed Networks

Efficient Training with Heterogeneous Data Distribution

Energy-efficient client selection in federated learning with heterogeneous data on edge

Research on the Promotion of New Energy Vehicles Based on Multi-Source Heterogeneous Data

Peridynamic simulation of the mechanical responses and fracturing behaviors of granite subjected to uniaxial compression based on CT heterogeneous data

Handling qualitative preferences in SPARQL over virtual ontology-based data access

Microestimates of wealth for all low- and middle-income countries

Hierarchical Federated Learning for Edge-Aided Unmanned Aerial Vehicle Networks

Industrial analytics – An overview

Neural Matrix Factorization Recommendation for User Preference Prediction Based on Explicit and Implicit Feedback

heterogeneous data
Recently Published Documents