Keyword Extraction Based on Selectivity and Generalized Selectivity

Author(s):  
Slobodan Beliga ◽  
Ana Meštrović ◽  
Sanda Martinčić-Ipšić

This chapter presents a novel Selectivity-Based Keyword Extraction (SBKE) method, which extracts keywords from the source text represented as a network. The node selectivity value is calculated from a weighted network as the average weight distributed on the links of a single node and is used in the procedure of keyword candidate ranking and extraction. The selectivity slightly outperforms an extraction based on the standard centrality measures. Therefore, the selectivity and its modification – generalized selectivity as the node centrality measures are included in the SBKE method. Selectivity-based extraction does not require linguistic knowledge as it is derived purely from statistical and structural information of the network and it can be easily ported to new languages and used in a multilingual scenario. The true potential of the proposed SBKE method is in its generality, portability and low computation costs, which positions it as a strong candidate for preparing collections which lack human annotations for keyword extraction.

Author(s):  
Slobodan Beliga ◽  
Ana Meštrović ◽  
Sanda Martinčić-Ipšić

In this work the authors propose a novel Selectivity-Based Keyword Extraction (SBKE) method, which extracts keywords from the source text represented as a network. The node selectivity value is calculated from a weighted network as the average weight distributed on the links of a single node and is used in the procedure of keyword candidate ranking and extraction. The authors show that selectivity-based keyword extraction slightly outperforms an extraction based on the standard centrality measures: in/out-degree, betweenness and closeness. Therefore, they include selectivity and its modification – generalized selectivity as node centrality measures in the SBKE method. Selectivity-based extraction does not require linguistic knowledge as it is derived purely from statistical and structural information of the network. The experimental results point out that selectivity-based keyword extraction has a great potential for the collection-oriented keyword extraction task.


2021 ◽  
Vol 11 (4) ◽  
pp. 498
Author(s):  
Marcello Zanghieri ◽  
Giulia Menichetti ◽  
Alessandra Retico ◽  
Sara Calderoni ◽  
Gastone Castellani ◽  
...  

Autism spectrum disorders (ASDs) are a heterogeneous group of neurodevelopmental conditions characterized by impairments in social interaction and communication and restricted patterns of behavior, interests, and activities. Although the etiopathogenesis of idiopathic ASD has not been fully elucidated, compelling evidence suggests an interaction between genetic liability and environmental factors in producing early alterations of structural and functional brain development that are detectable by magnetic resonance imaging (MRI) at the group level. This work shows the results of a network-based approach to characterize not only variations in the values of the extracted features but also in their mutual relationships that might reflect underlying brain structural differences between autistic subjects and healthy controls. We applied a network-based analysis on sMRI data from the Autism Brain Imaging Data Exchange I (ABIDE-I) database, containing 419 features extracted with FreeSurfer software. Two networks were generated: one from subjects with autistic disorder (AUT) (DSM-IV-TR), and one from typically developing controls (TD), adopting a subsampling strategy to overcome class imbalance (235 AUT, 418 TD). We compared the distribution of several node centrality measures and observed significant inter-class differences in averaged centralities. Moreover, a single-node analysis allowed us to identify the most relevant features that distinguished the groups.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Douglas Guilbeault ◽  
Damon Centola

AbstractThe standard measure of distance in social networks – average shortest path length – assumes a model of “simple” contagion, in which people only need exposure to influence from one peer to adopt the contagion. However, many social phenomena are “complex” contagions, for which people need exposure to multiple peers before they adopt. Here, we show that the classical measure of path length fails to define network connectedness and node centrality for complex contagions. Centrality measures and seeding strategies based on the classical definition of path length frequently misidentify the network features that are most effective for spreading complex contagions. To address these issues, we derive measures of complex path length and complex centrality, which significantly improve the capacity to identify the network structures and central individuals best suited for spreading complex contagions. We validate our theory using empirical data on the spread of a microfinance program in 43 rural Indian villages.


BMC Medicine ◽  
2020 ◽  
Vol 18 (1) ◽  
Author(s):  
Tobias R. Spiller ◽  
Ofir Levi ◽  
Yuval Neria ◽  
Benjamin Suarez-Jimenez ◽  
Yair Bar-Haim ◽  
...  

Abstract Background In the network approach to psychopathology, psychiatric disorders are considered networks of causally active symptoms (nodes), with node centrality hypothesized to reflect symptoms’ causal influence within a network. Accordingly, centrality measures have been used in numerous network-based cross-sectional studies to identify specific treatment targets, based on the assumption that deactivating highly central nodes would proliferate to other nodes in the network, thereby collapsing the network structure and alleviating the overall psychopathology (i.e., the centrality hypothesis). Methods Here, we summarize three types of evidence pertaining to the centrality hypothesis in psychopathology. First, we discuss the validity of the theoretical assumptions underlying the centrality hypothesis in psychopathology. We then summarize the methodological aspects of extant studies using centrality measures as predictors of symptom change following treatment, while delineating their main findings and several of their limitations. Finally, using a specific dataset of 710 treatment-seeking patients with posttraumatic stress disorder (PTSD) as an example, we empirically examine node centrality as a predictor of therapeutic change, replicating the approach taken by previous studies, while addressing some of their limitations. Specifically, we investigated whether three pre-treatment centrality indices (strength, predictability, and expected influence) were significantly correlated with the strength of the association between a symptom’s change and the change in the severity of all other symptoms in the network from pre- to post-treatment (Δnode-Δnetwork association). Using similar analyses, we also examine the predictive validity of two simple non-causal node properties (mean symptom severity and infrequency of symptom endorsement). Results Of the three centrality measures, only expected influence successfully predicted how strongly changes in nodes/symptoms were associated with change in the remainder of the nodes/symptoms. Importantly, when excluding the amnesia node, a well-documented outlier in the phenomenology of PTSD, none of the tested centrality measures predicted symptom change. Conversely, both mean symptom severity and infrequency of symptom endorsement, two standard non-network-derived indices, were found to be more predictive than expected influence and remained significantly predictive also after excluding amnesia from the network analyses. Conclusions The centrality hypothesis in its current form is ill-defined, showing no consistent supporting evidence in the context of cross-sectional, between-subject networks.


2019 ◽  
Vol 2019 ◽  
pp. 1-10 ◽  
Author(s):  
Weiwei Cao ◽  
Xiangnan Feng ◽  
Jianmin Jia ◽  
Hong Zhang

Understanding the structure of the Chinese railway network (CRN) is crucial for maintaining its efficiency and planning its future development. To advance our knowledge of CRN, we modeled CRN as a complex weighted network and explored the structural characteristics of the network via statistical evaluations and spatial analysis. Our results show CRN as a small-world network whose train flow obeys power-law decaying, demonstrating that CRN is a mature transportation infrastructure with a scale-free structure. CRN also shows significant spatial heterogeneity and hierarchy in its regionally uneven train flow distribution. We then examined the nodal centralities of CRN using four topological measures: degree, strength, betweenness, and closeness. Nodal degree is positively correlated with strength, betweenness, and closeness. Unlike the common feature of a scale-free network, the most connected nodes in CRN are not necessarily the most central due to underlying geographical, political, and socioeconomic factors. We proposed an integrated measure based on the four centrality measures to identify the global role of each node and the multilayer structure of CRN and confirm that stable connections hold between different layers of CRN.


2016 ◽  
Vol 8 (2) ◽  
pp. 154 ◽  
Author(s):  
Jason Christopher Jones ◽  
Nadine Normand-Marconnet

Author of a dozen reputable works on wine, M. Dovaz composed the foreword for the French translation of the Japanese graphic novel, Kami no Shizuku (2005), released in France as Les Gouttes de Dieu (2008). This manga has become a best-seller in its genre in France while the Japanese television adaptation has also reached a French audience through fansubs, allowing a new generation to gain access to that which had hitherto been seen as its own cultural patrimony. Integral to this wine culture is the ability to “talk about” its central object, in spite of linguistic or geographical separation. The central challenge being to efficiently associate words to a fleeting sensation provoked by visual, olfactory and gustative experience, a specific linguistic knowledge is necessary for those who wish to claim proficiency in wine. The critical narrative arc and didacticism of wine manga rests in the mastery of lexical sophistication as well as cultural knowledge, a posture also shared by most French experts. The language of wine, the power center of which once resided in France, has been brought into Japan through the act of translation. This very act has allowed for a shift in power—and thus the potential to represent the wine world—from France to Japan. We will show in this paper that there is an interplay occurring between French and Japanese media, producing a cultural space bridged through wine lexicon used in two series of manga recently translated into French (Sommelier in 2004 and Les Gouttes de Dieu in 2008). For this purpose, we will proceed to a comparative analysis of the Japanese source text with the French target text, highlighting metaphors used in wine culture. Through the analysis of the texts, we will demonstrate that the Japanese-French translations of these metaphors allow a new way for the French to see their culture through a lense provided by the Japanese sommelier.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Orsolya Kardos ◽  
András London ◽  
Tamás Vinkó

AbstractIdentifying key actors or nodes in a network is a relevant task regarding many applications. In general, the real-valued function that evaluates the nodes is called node centrality measure. Performing a relevance-based ranking on the list of nodes is also of high practical importance, since the most central nodes by a measure usually provide the highest contribution in explaining the behavior of the whole network. Stability of centrality measures against graph perturbation is an important concept, especially in the analysis of real world—often noise contaminated—datasets from different domains. In this paper, with the utilization of the formal definition of stability introduced by Segarra and Ribeiro (IEEE Trans Signal Process 64(3):543–555, 2015), we discuss three main perturbation categories and experimentally analyze the stability of several node centrality measures.


Sign in / Sign up

Export Citation Format

Share Document