AI-enabled ChatGPT or LLM: a new algorithm is required for plagiarism-free scientific writing

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Dear Editor, A recent article by Shafiee demonstrated the use of ChatGPT for scientific writing, especially the writing of a research paper 1 . Several other researchers have also informed ChatGPT’s capability for scientific writing and argued about the same 2,3 . Therefore, Shafiee’s article in this journal is very important and timely. Plagiarism is a significant concern within academic and scientific writing. Plagiarism and errors may have dire consequences for the credibility of researchers, and it creates adverse effects on the undertaken research. Therefore, academicians, researchers, and scientists always avoid plagiarism in scientific writing. The advent of artificial intelligence (AI)-generated content has only exacerbated the issue by facilitating the ease with which individuals can copy and paste information. Kleebayoon and Wiwanitkit 4 examined the ethical ramifications of AI technologies that generate text which evades plagiarism detectors. This group reported that while generative AI chatbots may not possess the capability to produce wholly referenced text, they have the potential to create blog posts or student essays. Therefore, it threatens the credibility of evaluations and scholarly work. AI-generated output harbors inaccuracies, which can prove problematic in scientific and medical contexts. For example, ChatGPT, a human-like conversational chatbot developed by OpenAI in November 2022, was examined for its medical report generation capabilities. It was discovered that ChatGPT’s responses had errors, and researchers have highlighted these 5 . ChatGPT may struggle to generate precise information in areas with little existing research. If AI-generated content is perceived as less trustworthy due to issues related to plagiarism or errors, it may be less likely to be understood as accurate. In academic and scientific contexts, the presence of plagiarism and errors in AI-generated content has a considerable impact on the credibility of information. Interestingly, several anti-plagiarism services have been developed in academic writings, namely Turnitin., iThenticate, PlagiarismDetect, etc. However, it is crucial to acknowledge that AI-generated content can also be used for cheating purposes, and the current plagiarism detection methods may not possess the necessary efficacy to detect all instances of plagiarism. Since ChatGPT sources its content from pre-existing online content, inadvertent similarities with the original content may be present. Most plagiarism detection scores, when utilizing ChatGPT-generated content, have a considerable amount of plagiarism. Therefore, it is of utmost need that while utilizing ChatGPT, the generated content remains original. The present algorithms utilized by ChatGPT operate on the generative pre-trained transformer (GPT) format of a large language model (LLM), demonstrating its competence in various natural language processing tasks without necessitating adaptation on downstream data 6 . LLM uses the transformer model, which uses deep learning-based architecture. This model works faster compared to the previous neural network model. Numerous academic datasets have been used to assess ChatGPT’s effectiveness in various tasks, and the results reveal that it is proficient in achieving different objectives 7 . However, the current algorithms utilized to tackle the issues of plagiarism and errors in AI-generated content are not entirely immune to limitations. In the specific context of ChatGPT, apprehensions have been expressed regarding the potential for bias and plagiarism when utilizing the tool 8 . ChatGPT may reproduce and reinforce pre-existing biases in the data on which it is trained, resulting in flawed and inequitable predictions. Moreover, there exists a danger of intellectual theft if individuals directly replicate and paste data created by ChatGPT without offering appropriate citation or recognition of its usage. Despite efforts to produce solutions for recognizing factual inaccuracies in summarization models, the continuously changing nature of summarization systems, metrics, and annotated benchmarks leads to factual evaluation becoming a shifting concept, thus hindering clear-cut comparisons among its measures 9 . Furthermore, most recent algorithms in factual detection have been made on synopses from older (pre-transformer) models rather than more current summarization models. Designing a new algorithm for ChatGPT, which generates output that is free of error and plagiarism, demands a fusion of diverse techniques and approaches. To achieve this objective, some steps must be followed. Before initiating algorithmic training, it is crucial to preprocess and sanitize the data. This phase entails eliminating irrelevant information, such as stop words, and rectifying spelling and grammatical inaccuracies. The textual data must undergo scrutiny for plagiarism detection using dependable tools. The algorithm can be trained by employing machine learning and deep learning techniques. One such approach is utilizing the Naive Bayes classification algorithm, which can categorize text into distinct groupings, including positive or negative sentiment 10 . Another technique is to utilize natural language generation methods, which involve creating a natural language text generation system capable of generating human-readable languages such as English and Chinese through AI and linguistic methodologies 11 . To optimize the precision of the algorithm, calibration before deployment is imperative. The calibration process necessitates the adjustment of the output probabilities of the algorithm to guarantee the uniformity of the predictions across all responses 12 . To ensure that the output is devoid of any discrepancies and plagiarism, the utilization of a verifier module is indispensable. This module can be trained to function as a verifier for ChatGPT’s output, thereby enhancing its performance iteratively by utilizing fine-grained corrective instructions 13 . For evaluating the algorithm’s performance, appropriate evaluation metrics need to be employed. One of the viable approaches for this end is the adoption of exact match as the evaluation metric, which entails comparing the model’s produced output with the ground-truth data. In conclusion, a new plagiarism-free algorithm is urgently needed to generate plagiarism-free text for scientific writing. Developing a new plagiarism-free and error-free algorithm for ChatGPT provides numerous benefits. Although ChatGPT can generate responses that resemble those of humans, incomplete or biased data may result in errors. By eliminating errors and plagiarism, a new algorithm can enhance the quality and reliability of ChatGPT’s output. The implications of relying on current versions of ChatGPT for high-stakes decision-making can be severe, but a new algorithm can ensure that the output is error-free and plagiarism-free, thereby enhancing user trust. Although ChatGPT has been successfully implemented for writing in education, healthcare, and other domains, a new algorithm can expand its applicability in these areas. For instance, in education, ChatGPT can generate plagiarism-free and high-quality essays and research papers; while in healthcare, it can produce accurate medical reports, treatment suggestions, and plagiarism-free high-quality other documents. Therefore, all professionals in education, healthcare, and other domains should come together with computer engineers and computer scientists and get involved immediately in developing the next-generation algorithm, which will help students, teachers, researchers, and scientists for plagiarism-free scientific writing. Ethical approval Not applicable. Sources of funding Not applicable. Author contribution S.P.: writing – review and editing, investigation, and validation; M.B. and M.A.I.: validation; C.C.: conceptualization, data curation, writing – original draft, and review and editing. All authors critically reviewed and approved the final version of the manuscript. Conflicts of interest disclosure All authors report no conflicts of interest relevant to this article. Research registration unique identifying number (UIN) Name of the registry: not applicable. Unique identifying number or registration ID: not applicable. Hyperlink to your specific registration (must be publicly accessible and will be checked): not applicable. Guarantor Md. Aminul Islam, COVID-19 Diagnostic Lab, Department of Microbiology, Noakhali Science and Technology University, Noakhali 3814, Bangladesh; E-mail: aminulmbg@gmail.com Data availability statement The data in this correspondence article are not sensitive in nature and are accessible in the public domain. The data are therefore available and not of a confidential nature. Provenance and peer review Not commissioned, internally peer-reviewed.

Related collections

Most cited references 11

Record: found
Abstract: found
Article: found

Is Open Access

Can artificial intelligence help for scientific writing?

Michele Salvagno, ChatGPT, Fabio Silvio Taccone … (2023)

This paper discusses the use of Artificial Intelligence Chatbot in scientific writing. ChatGPT is a type of chatbot, developed by OpenAI, that uses the Generative Pre-trained Transformer (GPT) language model to understand and respond to natural language inputs. AI chatbot and ChatGPT in particular appear to be useful tools in scientific writing, assisting researchers and scientists in organizing material, generating an initial draft and/or in proofreading. There is no publication in the field of critical care medicine prepared using this approach; however, this will be a possibility in the next future. ChatGPT work should not be used as a replacement for human judgment and the output should always be reviewed by experts before being used in any critical decision-making or application. Moreover, several ethical issues arise about using these tools, such as the risk of plagiarism and inaccuracies, as well as a potential imbalance in its accessibility between high- and low-income countries, if the software becomes paying. For this reason, a consensus on how to regulate the use of chatbots in scientific writing will soon be required.

0 comments Cited 183 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Artificial Intelligence, Chatbots, Plagiarism and Basic Honesty: Comment.

Amnuay Kleebayoon, Viroj Wiwanitkit (2023)

0 comments Cited 51 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Is Open Access

ChatGPT Challenges Blended Learning Methodologies in Engineering Education: A Case Study in Mathematics

Luis M. Sánchez-Ruiz, Santiago Moll-López, Adolfo Nuñez-Pérez … (2023)

This research aims to explore the potential impact of the ChatGPT on b-learning methodologies in engineering education, specifically in mathematics. The study focuses on how the use of these artificial intelligence tools can affect the acquisition of critical thinking, problem-solving, and group work skills among students. The research also analyzes the students’ perception of the reliability, usefulness, and importance of these tools in academia. The study collected data through a survey of 110 students enrolled in a Mathematics I course in BEng Aerospace Engineering where a blended methodology, including flipped teaching, escape room gamification, problem-solving, and laboratory sessions and exams with a computer algebraic system were used. The data collected were analyzed using statistical methods and tests for significance. Results indicate students have quickly adopted ChatGPT tool, exhibiting high confidence in their responses (3.4/5) and general usage in the learning process (3.61/5), alongside a positive evaluation. However, concerns arose regarding the potential impact on developing lateral competencies essential for future engineers (2.8/5). The study concludes that the use of ChatGPT in blended learning methodologies poses new challenges for education in engineering, which requires the adaptation of teaching strategies and methodologies to ensure the development of essential skills for future engineers.

0 comments Cited 18 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Soumen Pal

Manojit Bhattacharya

Md. Aminul Islam

Chiranjib Chakraborty

Journal

Journal ID (nlm-ta): Int J Surg

Journal ID (iso-abbrev): Int J Surg

Journal ID (publisher-id): JS9

Title: International Journal of Surgery (London, England)

Publisher: Lippincott Williams & Wilkins (Hagerstown, MD )

ISSN (Print): 1743-9191

ISSN (Electronic): 1743-9159

Publication date Collection: February 2024

Publication date (Electronic): 23 November 2023

Volume: 110

Issue: 2

Pages: 1329-1330

Affiliations

[a ]School of Mechanical Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India

[b ]Department of Zoology, Fakir Mohan University, Vyasa Vihar, Balasore, Odisha, India

[c ]Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal, India

[d ]Advanced Molecular Lab, Department of Microbiology, President Abdul Hamid Medical College, Karimganj, Kishoreganj

[e ]COVID-19 Diagnostic Lab, Department of Microbiology, Noakhali Science and Technology University, Noakhali, Bangladesh

Author notes

[* ]Corresponding author. Address: COVID-19 Diagnostic Lab, Department of Microbiology, Noakhali Science and Technology University, Noakhali 3814, Bangladesh. E-mail: aminulmbg@ 123456gmail.com (A. Islam).

Article

Publisher ID: IJS-D-23-02524 Accession ID: 00092

DOI: 10.1097/JS9.0000000000000939

PMC ID: 10871629

PubMed ID: 38000076

SO-VID: cfcc0286-d6d8-4a1a-a07b-c5e99a4347e2

License:

This is an open access article distributed under the Creative Commons Attribution-ShareAlike License 4.0, which allows others to remix, tweak, and build upon the work, even for commercial purposes, as long as the author is credited and the new creations are licensed under the identical terms. http://creativecommons.org/licenses/by-sa/4.0/

History

Date received : 8 November 2023

Date accepted : 9 November 2023

Custom metadata

OPEN-ACCESS TRUE

ScienceOpen disciplines: Surgery

Data availability:

ScienceOpen disciplines: Surgery

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.