Academic journal article Informatica Economica

Stylometry Metrics Selection for Creating a Model for Evaluating the Writing Style of Authors According to Their Cultural Orientation

Academic journal article Informatica Economica

Stylometry Metrics Selection for Creating a Model for Evaluating the Writing Style of Authors According to Their Cultural Orientation

Article excerpt

The present paper starts from a short introduction of the major aspects debated regarding plagiarism and author identification, along with the principles that are at the base of forming the property rights laws within the European community and the Anglo-American one. Regardless of the community involved, plagiarism is a form of using others research, as it is or modified, and presenting it as a personal creation. The terms of creativity and plagiarism are described in an antithesis analysis, reaching to the concept of originality, defined as a property that a creative research paper has when the ideas presented within in are different from the ones already published by different authors. A metric is implemented in order to obtain a measurable value in determining the level of originality of a paper. The main ways of testing a paper of plagiarism, intrinsic and external analysis, are described for choosing the proper methodology for determining originality of scientific papers. The research leads to the stylometric analysis, a field found at the crossroad of plagiarism, originality and author identification. This stylometric analysis is done within the intrinsic plagiarism detection and is formed on the bases of a number of metrics that describe unique a writing style of a specific author. The testing platform implies using a set of research papers written by European authors and extracting the values of eight writing style metrics. A clustering is applied and the best combination of metrics is resulted.

Keywords: Stylometry, Plagiarism, Metrics, Clustering analysis, Cultural orientation

(ProQuest: ... denotes formulae omitted.)

Introduction

R esearches on the intellectual property rights face determining the level of originality of a research paper, in contract to the action of plagiarism which is defined as the full or partial ownership of ideas, expressions, methods or procedures and their presentation as a personal creation. In the AngloAmerican laws, the economic considerations and those that refer to public politics are prevailed in the elaboration and development of the property rights laws while, in the European point of view, the moral and civil arguments based the elaboration of the same laws.

The legislative framework does not resolve identifying plagiarism and level of originality of a scientific work. The present paper aims to apply the legislative property rights in the context of publishing scientific research papers.

In practice, there are different types of plagiarism, the most common being: copy-paste, paraphrase, plagiarism through translation in different languages, artistic plagiarism, ideas plagiarism, source code and not using the proper citations. Article [1] presents the fact that plagiarism through paraphrase is analyzed, reaching to a classification of the major known types, along with a testing using software detection of plagiarism at the level of percentage of correctness by identifying the paraphrase within a text document.

The present paper is consisted in five chapters, starting from a short introduction of the major aspects debated, along with the principles that are at the base of forming the property rights laws within the European community and the Anglo-American one. Regardless of the community involved, plagiarism is a form of using others research, as it is or modified, and presenting it as a personal creation.

Chapter 2 describes the terms of creativity and plagiarism in an antithesis analysis, reaching to the concept of originality, defined as a property that a creative research paper has when the ideas presented within in are different from the ones already published by different authors. A metric is implemented in order to obtain a measurable value in determining the level of originality of a paper. The main ways of testing a paper of plagiarism, intrinsic and external analysis, are described for choosing the proper methodology for determining originality of scientific papers. …

Search by... Author
Show... All Results Primary Sources Peer-reviewed

Oops!

An unknown error has occurred. Please click the button below to reload the page. If the problem persists, please try again in a little while.