scholarly journals Text mining and social network analysis of data curation literature

10.9776/16510 ◽  
2016 ◽  
Author(s):  
Jeonghyun Kim
2019 ◽  
Vol 8 (1) ◽  
pp. 009
Author(s):  
Carlos G. Figuerola ◽  
Tamar Groves ◽  
Francisco J. Rodríguez

The practice of historical research in recent years has been substantially affected by the emergence of the so-called digital humanities. New computer tools have been appearing, software systems capable of processing vast quantities of information in ways that until recently were inconceivable. Text mining and social network analysis techniques are sophisticated instruments that can help render a more enriching reading of the available data and draw useful conclusions. We reflect on this in the first part of this article, and then apply these tools to a practical case: quantifying and identifying the women who appear in university-related articles in the newspaper El País from its founding until 2011.


Author(s):  
Matías Borba Eguren

El presente trabajo analiza la participación de Carlos Pastore – intelectual y político paraguayo, exiliado en Montevideo en 1942 – en los homenajes a Artigas, realizados por el Instituto Histórico y Geográfico del Uruguay en 1950. Se busca establecer su papel como articulador entre el Instituto y otros paraguayos – en su mayoría exiliados –, aplicando técnicas del social network analysis y de text mining, para graficar la red político-historiográfica a su alrededor. Se pretende señalar cómo los homenajes a Artigas fueron una oportunidad para los exiliados paraguayos de utilizar el pasado como herramienta de militancia contra la situación política de Paraguay.


2018 ◽  
Vol 12 (2) ◽  
pp. 390-408
Author(s):  
Qian Zhang ◽  
Yang Cao ◽  
Qiwen Wang ◽  
Duc Vu ◽  
Priyaa Thavasimani ◽  
...  

We illustrate how combining retrospective and prospectiveprovenance can yield scientifically meaningful hybrid provenancerepresentations of the computational histories of data produced during a script run. We use scripts from multiple disciplines (astrophysics, climate science, biodiversity data curation, and social network analysis), implemented in Python, R, and MATLAB, to highlight the usefulness of diverse forms of retrospectiveprovenance when coupled with prospectiveprovenance. Users provide prospective provenance, i.e., the conceptual workflows latent in scripts, via simple YesWorkflow annotations, embedded as script comments. Runtime observables can be linked to prospective provenance via relational views and queries. These observables could be found hidden in filenames or folder structures, be recorded in log files, or they can be automatically captured using tools such as noWorkflow or the DataONE RunManagers. The YesWorkflow toolkit, example scripts, and demonstration code are available via an open source repository.


2019 ◽  
Vol 39 ◽  
pp. 93-112 ◽  
Author(s):  
Mark Warschauer ◽  
Soobin Yim ◽  
Hansol Lee ◽  
Binbin Zheng

AbstractThis paper will review the role of data mining in research on second language learning. Following a general introduction to the topic, three areas of data mining research will be summarized—clustering techniques, text-mining, and social network analysis—with examples from both the broader field and studies conducted by the authors. The application of data mining in second language learning research is relatively new, and more theoretical and empirical support is needed in the appropriate collection, use, and interpretation of data for specific research and pedagogical objectives. The three examples that we introduce illustrate how new data sources accessible in online environments can be analyzed to better understand the optimal instructional context for corpus-based vocabulary learning (clustering technique), characteristics and patterns of collaborative written interaction using Google Docs (text mining and visualizations), and issues of access and community in computer-mediated discussion (social network analysis). Implications of these new techniques for L2 research will be discussed.


Sign in / Sign up

Export Citation Format

Share Document