A Profile-Based Authorship Attribution Approach to Forensic Identification in Chinese Online Messages

Author(s):  
Jianbin Ma ◽  
Bing Xue ◽  
Mengjie Zhang
2020 ◽  
Vol 140 (1) ◽  
Author(s):  
Chiara Turchi ◽  
Filomena Melchionda ◽  
Mauro Pesaresi ◽  
Eleonora Ciarimboli ◽  
Carla Bini ◽  
...  

2015 ◽  
Author(s):  
Upendra Sapkota ◽  
Steven Bethard ◽  
Manuel Montes ◽  
Thamar Solorio

Author(s):  
Gülbanu K. Zorba ◽  
Theodora Eleftheriou ◽  
İstenç Engin ◽  
Sophia Hartsioti ◽  
Christiana Zenonos

2019 ◽  
Vol 35 (4) ◽  
pp. 812-825 ◽  
Author(s):  
Robert Gorman

Abstract How to classify short texts effectively remains an important question in computational stylometry. This study presents the results of an experiment involving authorship attribution of ancient Greek texts. These texts were chosen to explore the effectiveness of digital methods as a supplement to the author’s work on text classification based on traditional stylometry. Here it is crucial to avoid confounding effects of shared topic, etc. Therefore, this study attempts to identify authorship using only morpho-syntactic data without regard to specific vocabulary items. The data are taken from the dependency annotations published in the Ancient Greek and Latin Dependency Treebank. The independent variables for classification are combinations generated from the dependency label and the morphology of each word in the corpus and its dependency parent. To avoid the effects of the combinatorial explosion, only the most frequent combinations are retained as input features. The authorship classification (with thirteen classes) is done with standard algorithms—logistic regression and support vector classification. During classification, the corpus is partitioned into increasingly smaller ‘texts’. To explore and control for the possible confounding effects of, e.g. different genre and annotator, three corpora were tested: a mixed corpus of several genres of both prose and verse, a corpus of prose including oratory, history, and essay, and a corpus restricted to narrative history. Results are surprisingly good as compared to those previously published. Accuracy for fifty-word inputs is 84.2–89.6%. Thus, this approach may prove an important addition to the prevailing methods for small text classification.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Daisuke Miyamori ◽  
Takeshi Uemura ◽  
Wenliang Zhu ◽  
Kei Fujikawa ◽  
Takaaki Nakaya ◽  
...  

AbstractThe recent increase of the number of unidentified cadavers has become a serious problem throughout the world. As a simple and objective method for age estimation, we attempted to utilize Raman spectrometry for forensic identification. Raman spectroscopy is an optical-based vibrational spectroscopic technique that provides detailed information regarding a sample’s molecular composition and structures. Building upon our previous proof-of-concept study, we measured the Raman spectra of abdominal skin samples from 132 autopsy cases and the protein-folding intensity ratio, RPF, defined as the ratio between the Raman signals from a random coil an α-helix. There was a strong negative correlation between age and RPF with a Pearson correlation coefficient of r = 0.878. Four models, based on linear (RPF), squared (RPF2), sex, and RPF by sex interaction terms, were examined. The results of cross validation suggested that the second model including linear and squared terms was the best model with the lowest root mean squared error (11.3 years of age) and the highest coefficient of determination (0.743). Our results indicate that the there was a high correlation between the age and RPF and the Raman biological clock of protein folding can be used as a simple and objective forensic age estimation method for unidentified cadavers.


Sign in / Sign up

Export Citation Format

Share Document