Finding Efficient Linguistic Feature Set for Authorship Verification
DOI:
https://doi.org/10.31357/jcs.v1i1.1616Abstract
Authorship verification rely on identification of a given document is written by a particular author or not. Internally analyzing the document itself with respect to the variations in writing style of the author and identification of the author’s own idiolect is the main context of the authorship verification. Mainly, the detection performance depends on the used feature set for clustering the document. Linguistic features and stylistic features have been utilized for author identification according to the writing style of a particular author. Disclose the shallow changes of the author’s writing style is the major problem which should be addressed in the domain of authorship verification. It motivates the computer science researchers to do research on authorship verification in the field of computer forensics and this research also focuses this problem. The contributions from the research are two folded: Former is introducing a new feature extracting method with Natural Language Processing (NLP) and later is propose a new more efficient linguistic feature set for verification of author of the given document. Experiments on a corpus composed of freely downloadable genuine 19th century English Books and Self Organizing Maps has been used as the classifier to cluster the documents. Proper word segmentation also introduced in this work and it helps to demonstrate that the proposed strategy can produced promising results. Finally, it is realized that more accurate classification is generated by the proposed strategy with extracted linguistic feature set.Downloads
Published
2013-10-07
Issue
Section
Articles
License
With each "accepted" manuscript, the corresponding author must submit a copyright form which transfers the copyright of the published article to the University of Sri Jayewardenepura that warrants the article is the original work of the author and does not infringe the copyright of any other parties. Before publication, the Editorial Office must receive a signed soft/hard copy of the copyright form. There is no need to send the copyright form until a manuscript is accepted.
Click here to download the copyright transfer form (doc)
The copyright form can be emailed to jcs@sjp.ac.lk