citation graph
Recently Published Documents


TOTAL DOCUMENTS

38
(FIVE YEARS 20)

H-INDEX

6
(FIVE YEARS 2)

2021 ◽  
Vol 24 (5) ◽  
pp. 923-943
Author(s):  
Андрей Анатольевич Печников ◽  
Дмитрий Евгеньевич Чебуков

According to the portal Math-Net.Ru a graph of journal citation is constructed. To increase the reliability of the model, the citation time interval was chosen from 2010 to 2021, when the distribution of citation articles stabilized at the level of 3500-4500 citations per year. The structure of link aging is studied and it is shown that their half-life is equal to 8 years. Therefore, the publication date of the cited articles was limited to 2002. For the constructed citation graph, the main properties, such as a small diameter and a high density, are obtained, indicating a high level of scientific communication in the Math-Net.Ru. The absence of the “Matthew effect” as a pronounced advantage in quoting established journals in relation to less well-known ones is shown. Adequacy of the journal citation graph Math-Net.Ru as a model of scientific communication confirmed by comparing the ranking of journals in the citation graph with their SCIENCE INDEX rating in eLIBRARY.RU. A direct moderate relationship between the two rankings is shown. A number of meaningful conclusions are drawn from the analysis of the citation graph.


2021 ◽  
pp. 1-46
Author(s):  
Peter Buneman ◽  
Dennis Dosso ◽  
Matteo Lissandrini ◽  
Gianmaria Silvello

Abstract The citation graph is a computational artifact that is widely used to represent the domain of published literature. It represents connections between published works, such as citations and authorship. Among other things, the graph supports the computation of bibliometric measures such as h-indexes and impact factors. There is now an increasing demand that we should treat the publication of data in the same way that we treat conventional publications. In particular, we should cite data for the same reasons that we cite other publications. In this paper we discuss what is needed for the citation graph to represent data citation. We identify two challenges: (i) to model the evolution of credit appropriately (through references) over time and (ii) to model data citation not only to a dataset treated as a single object but also to parts of it. We describe an extension of the current citation graph model that addresses these challenges. It is built on two central concepts: citable units and reference subsumption. We discuss how this extension would enable data citation to be represented within the citation graph and how it allows for improvements in current practices for bibliometric computations both for scientific publications and for data.


2021 ◽  
pp. 1-19
Author(s):  
Tirthankar Ghosal ◽  
Piyush Tiwary ◽  
Robert Patton ◽  
Christopher Stahl

Abstract Finding the lineage of a research topic is crucial for understanding the prior state of the art and advancing scientific displacement. The deluge of scholarly articles makes it difficult to locate the most relevant previous work. It causes researchers to spend a considerable amount of time building up their literature list. Citations play a crucial role in discovering relevant literature. However, not all citations are created equal. The majority of the citations that a paper receives provide contextual and background information to the citing papers. In those cases, the cited paper is not central to the theme of citing papers. However, some papers build upon a given paper, further the research frontier. In those cases, the concerned cited paper plays a pivotal role in the citing paper. Hence, the nature of citation the former receives from the latter is significant. In this work, we discuss our investigations towards discovering significant citations of a given paper. We further show how we can leverage significant citations to build a research lineage via a significant citation graph. We demonstrate the efficacy of our idea with two real-life case studies. Our experiments yield promising results with respect to the current state-of-the-art in classifying significant citations, outperforming the earlier ones by a relative margin of 20 points in terms of precision. We hypothesize that such an automated system can facilitate relevant literature discovery and help identify knowledge flow for a particular category of papers.


2021 ◽  
Vol 9 ◽  
Author(s):  
Jerrold Soh Tsin Howe

We propose and evaluate generative models for case law citation networks that account for legal authority, subject relevance, and time decay. Since Common Law systems rely heavily on citations to precedent, case law citation networks present a special type of citation graph which existing models do not adequately reproduce. We describe a general framework for simulating node and edge generation processes in such networks, including a procedure for simulating case subjects, and experiment with four methods of modelling subject relevance: using subject similarity as linear features, as fitness coefficients, constraining the citable graph by subject, and computing subject-sensitive PageRank scores. Model properties are studied by simulation and compared against existing baselines. Promising approaches are then benchmarked against empirical networks from the United States and Singapore Supreme Courts. Our models better approximate the structural properties of both benchmarks, particularly in terms of subject structure. We show that differences in the approach for modelling subject relevance, as well as for normalizing attachment probabilities, produce significantly different network structures. Overall, using subject similarities as fitness coefficients in a sum-normalized attachment model provides the best approximation to both benchmarks. Our results shed light on the mechanics of legal citations as well as the community structure of case law citation networks. Researchers may use our models to simulate case law networks for other inquiries in legal network science.


2021 ◽  
Author(s):  
Bahareh Kazemi

Surfing data mining techniques for representing data sources have specifically attracted much attention among researchers. Given the curse of dimensionality in representing text using the traditional Bag-of-words models, lower-dimensional representation of text has been an important line of research due to its impact on many prediction, and recommendation tasks. This thesis studies two main different viewpoints in text representation using content and citation information and then, different existing approaches along with their advantages, limitations and drawbacks are reviewed. A novel hybrid distributed technique for text representation is proposed where the textual content of documents is projected into a vector representation using an artificial neural network . To test the performance of the new proposed technique, the well known link-prediction problem is selected to serve as a benchmark. A comparison is performed with other common techniques by predicting the existence of citation links between tuple of papers in a large citation graph.


2021 ◽  
Author(s):  
Bahareh Kazemi

Surfing data mining techniques for representing data sources have specifically attracted much attention among researchers. Given the curse of dimensionality in representing text using the traditional Bag-of-words models, lower-dimensional representation of text has been an important line of research due to its impact on many prediction, and recommendation tasks. This thesis studies two main different viewpoints in text representation using content and citation information and then, different existing approaches along with their advantages, limitations and drawbacks are reviewed. A novel hybrid distributed technique for text representation is proposed where the textual content of documents is projected into a vector representation using an artificial neural network . To test the performance of the new proposed technique, the well known link-prediction problem is selected to serve as a benchmark. A comparison is performed with other common techniques by predicting the existence of citation links between tuple of papers in a large citation graph.


2021 ◽  
Author(s):  
Hanwen Liu ◽  
Jun Hou ◽  
Qianmu Li ◽  
Jian Jiang

Abstract Currently, readers often prefer to search for their interested papers based on a set of typed query keywords. As the keywords of a paper is often limited, paper recommender systems often need to recommend a set of papers which collectively satisfy the readers’ keyword query. However, the topics of recommended papers are probably not correlated with each other, which fail to meet the readers’ requirements on in-depth and continuous academic research. Furthermore, although existing paper citation graphs can model the papers’ correlations, they often face the data sparse problem which blocks accurate paper recommendations. To address these issues, we propose a keywords-driven and weight-aware paper recommendation approach, named LP-PRk+w (link prediction-paper recommendation), based on a weighted paper correlation graph. Concretely, we firstly optimize the existing paper citation graph modes by introducing a weighted similarity, after which we obtain a weighted paper correlation graph. Then we recommend a set of correlated papers based on the weighted paper correlation graph and the query keywords from readers. At last, we conduct large-scale experiments on a real-world Hep-Th dataset. Experimental results demonstrate that our proposal can improve the paper recommendation performances considerably, compared to other related solutions.


2021 ◽  
Vol 20 (4) ◽  
pp. 50-64
Author(s):  
Bissan Audeh ◽  
Michel Beigbeder ◽  
Christine Largeron ◽  
Diana Ramírez-Cifuentes

Digital libraries have become an essential tool for researchers in all scientific domains. With almost unlimited storage capacities, current digital libraries hold a tremendous number of documents. Though some efforts have been made to facilitate access to documents relevant to a specific information need, such a task remains a real challenge for a new researcher. Indeed neophytes do not necessarily use appropriate keywords to express their information need and they might not be qualified enough to evaluate correctly the relevance of documents retrieved by the system. In this study, we suppose that to better meet the needs of neophytes, the information retrieval system in a digital library should take into consideration features other than content-based relevance. To test this hypothesis, we use machine learning methods and build new features from several metadata related to documents. More precisely, we propose to consider as features for machine learning: content-based scores, scores based on the citation graph and scores based on metadata extracted from external resources. As acquiring such features is not a trivial task, we analyze their usefulness and their capacity to detect relevant documents. Our analysis concludes that the use of these additional features improves the performance of the system for a neophyte. In fact, by adding the new features we find more documents suitable for neophytes within the results returned by the system than when using content-based features alone.


2021 ◽  
pp. 631-647
Author(s):  
Ke Yuan ◽  
Liangcai Gao ◽  
Zhuoren Jiang ◽  
Zhi Tang

2021 ◽  
Author(s):  
Andrey Anatolievich Pechnikov ◽  
Dmitry Evgenievch Chebukov

According to the portal Math-Net.Ru a graph of journal citation is constructed. To increase the reliability of the model, the citation time interval was chosen from 2010 to 2021, when the distribution of citation articles stabilized at the level of 3500-4500 citations per year. The structure of link aging is studied and it is shown that their half-life is equal to 8 years. Therefore, the publication date of the cited articles was limited to 2002. For the constructed citation graph, the main properties, such as a small diameter and a high density, are obtained, indicating a high level of scientific communication in the Math-Net.Ru. Adequacy of the journal citation graph Math-Net.Ru as a model of scientific communication confirmed by comparing the ranking of journals in the citation graph with their SCIENCE INDEX rating in eLIBRARY.RU. A direct moderate relationship between the two rankings is shown. A number of meaningful conclusions are drawn from the analysis of the citation graph.


Sign in / Sign up

Export Citation Format

Share Document