Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

2406.11583

YC

2

Reddit

0

Published 6/28/2024 by Honglin Bao, Mengyi Sun, Misha Teplitskiy

🏷️

Abstract

Regulating AI is a key societal challenge, but which regulation methods are effective is unclear. This study measures the effectiveness of restricting AI services geographically, focusing on ChatGPT. OpenAI restricts ChatGPT access in several countries, including China and Russia. If restrictions are effective, ChatGPT use should be minimal in these countries. We measured use with a classifier based on distinctive word usage found in early versions of ChatGPT, e.g. delve. We trained the classifier on pre- and post-ChatGPT polished abstracts and found it outperformed GPTZero and ZeroGPT on validation sets, including papers with self-reported AI use. Applying the classifier to preprints from Arxiv, BioRxiv, and MedRxiv showed ChatGPT was used in about 12.6% of preprints by August 2023, with 7.7% higher usage in restricted countries. The gap appeared before China's first major legal LLM became widely available. To test the possibility that, due to high demand, use in restricted countries would have been even higher without restrictions, we compared Asian countries with high expected demand (where English is not an official language) and found that use was higher in those with restrictions. ChatGPT use was correlated with higher views and downloads, but not citations or journal placement. Overall, restricting ChatGPT geographically has proven ineffective in science and possibly other domains, likely due to widespread workarounds.

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • Researchers investigate the effectiveness of geographic restrictions on the use of the AI chatbot ChatGPT, particularly in the context of scientific research.
  • They develop a machine learning model to detect the use of ChatGPT in preprint publications and analyze its usage patterns across different countries.
  • The findings suggest that geographic restrictions on ChatGPT have been largely ineffective, with significant use of the chatbot even in countries where it is prohibited.

Plain English Explanation

Researchers wanted to understand how well efforts to restrict access to the AI chatbot ChatGPT were working, particularly in the world of science and research. They developed a machine learning model that could detect when ChatGPT was used to write scientific preprints (early versions of research papers).

The team found that ChatGPT was used in around 12.6% of preprints by August 2023, and its use was 7.7% higher in countries where ChatGPT is officially prohibited, like China and Russia. This suggests that the geographic restrictions on ChatGPT have not been very effective, as people have likely found ways around the bans.

The researchers also found that papers that used ChatGPT tended to get more views and downloads, but not necessarily more citations or better journal placements. This indicates that while ChatGPT may make writing easier, it doesn't necessarily improve the quality or impact of the research.

Overall, the study shows that attempts to limit the use of powerful AI tools like ChatGPT are facing significant challenges, as people find ways to access and use them regardless of geographic restrictions. This is an important consideration as policymakers and regulators grapple with how to manage the rise of AI technology.

Technical Explanation

The researchers used a machine learning approach to detect the use of ChatGPT in scientific preprints. They trained an ensemble classifier model on a dataset of abstracts from before and after the release of ChatGPT, leveraging the finding that early versions of ChatGPT used distinctive words like "delve." [1] This classifier was found to substantially outperform off-the-shelf language model detectors like GPTZero and ZeroGPT.

Applying this classifier to preprints from ArXiv, BioRxiv, and MedRxiv, the researchers found that ChatGPT was used in approximately 12.6% of preprints by August 2023. Crucially, they observed that ChatGPT use was 7.7% higher in countries without legal access to the chatbot, such as China and Russia. This pattern emerged before the first major legal large language model (LLM) became widely available in China, the largest producer of preprints from restricted countries.

The analysis also revealed that ChatGPT-written preprints received more views and downloads, but did not show significant differences in citations or journal placement. This suggests that while ChatGPT may make writing more accessible, it does not necessarily improve the quality or impact of the research.

Critical Analysis

The research provides valuable insights into the effectiveness of geographic restrictions on AI tools like ChatGPT. However, the study is limited to the specific context of scientific preprints, and the findings may not generalize to other domains where ChatGPT is used.

Additionally, the study does not delve into the potential implications of widespread ChatGPT use in research, such as concerns around academic integrity, the ethics of AI-assisted writing, or the long-term impacts on the scientific community. [2][3][4][5]

Further research is needed to understand the broader societal and ethical implications of the growing use of AI tools in academic and professional settings. Policymakers and regulators will need to carefully consider the nuances and challenges of regulating transformative technologies like ChatGPT.

Conclusion

This study highlights the significant challenges in effectively restricting the use of powerful AI chatbots like ChatGPT, even when geographic access is limited. The findings suggest that such restrictions have been largely ineffective in the context of scientific research, with widespread use of ChatGPT observed even in countries where it is officially prohibited.

These insights have important implications for how policymakers and regulators approach the governance of transformative AI technologies. As AI tools become increasingly ubiquitous, understanding the limitations of geographic restrictions and exploring alternative regulatory approaches will be crucial in shaping the responsible development and use of these technologies.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🏋️

A Perspective Study on Chinese Social Media regarding LLM for Education and Beyond

Yao Tian, Chengwei Tong, Lik-Hang Lee, Reza Hadi Mogavi, Yong Liao, Pengyuan Zhou

YC

0

Reddit

0

The application of AI-powered tools has piqued the interest of many fields, particularly in the academic community. This study uses ChatGPT, currently the most powerful and popular AI tool, as a representative example to analyze how the Chinese public perceives the potential of large language models (LLMs) for educational and general purposes. Although facing accessibility challenges, we found that the number of discussions on ChatGPT per month is 16 times that of Ernie Bot developed by Baidu, the most popular alternative product to ChatGPT in the mainland, making ChatGPT a more suitable subject for our analysis. The study also serves as the first effort to investigate the changes in public opinion as AI technologies become more advanced and intelligent. The analysis reveals that, upon first encounters with advanced AI that was not yet highly capable, some social media users believed that AI advancements would benefit education and society, while others feared that advanced AI, like ChatGPT, would make humans feel inferior and lead to problems such as cheating and a decline in moral principles. The majority of users remained neutral. Interestingly, with the rapid development and improvement of AI capabilities, public attitudes have tended to shift in a positive direction. We present a thorough analysis of the trending shift and a roadmap to ensure the ethical application of ChatGPT-like models in education and beyond.

Read more

6/3/2024

💬

Using ChatGPT for Thematic Analysis

Aleksei Turobov, Diane Coyle, Verity Harding

YC

0

Reddit

0

The utilisation of AI-driven tools, notably ChatGPT, within academic research is increasingly debated from several perspectives including ease of implementation, and potential enhancements in research efficiency, as against ethical concerns and risks such as biases and unexplained AI operations. This paper explores the use of the GPT model for initial coding in qualitative thematic analysis using a sample of UN policy documents. The primary aim of this study is to contribute to the methodological discussion regarding the integration of AI tools, offering a practical guide to validation for using GPT as a collaborative research assistant. The paper outlines the advantages and limitations of this methodology and suggests strategies to mitigate risks. Emphasising the importance of transparency and reliability in employing GPT within research methodologies, this paper argues for a balanced use of AI in supported thematic analysis, highlighting its potential to elevate research efficacy and outcomes.

Read more

5/16/2024

Delving into ChatGPT usage in academic writing through excess vocabulary

Delving into ChatGPT usage in academic writing through excess vocabulary

Dmitry Kobak, Rita Gonz'alez M'arquez, EmH{o}ke-'Agnes Horv'at, Jan Lause

YC

0

Reddit

0

Recent large language models (LLMs) can generate and revise text with human-level performance, and have been widely commercialized in systems like ChatGPT. These models come with clear limitations: they can produce inaccurate information, reinforce existing biases, and be easily misused. Yet, many scientists have been using them to assist their scholarly writing. How wide-spread is LLM usage in the academic literature currently? To answer this question, we use an unbiased, large-scale approach, free from any assumptions on academic LLM usage. We study vocabulary changes in 14 million PubMed abstracts from 2010-2024, and show how the appearance of LLMs led to an abrupt increase in the frequency of certain style words. Our analysis based on excess words usage suggests that at least 10% of 2024 abstracts were processed with LLMs. This lower bound differed across disciplines, countries, and journals, and was as high as 30% for some PubMed sub-corpora. We show that the appearance of LLM-based writing assistants has had an unprecedented impact in the scientific literature, surpassing the effect of major world events such as the Covid pandemic.

Read more

6/12/2024

📈

Beyond the Hype: A Cautionary Tale of ChatGPT in the Programming Classroom

Grant Oosterwyk, Pitso Tsibolane, Popyeni Kautondokwa, Ammar Canani

YC

0

Reddit

0

Due to the proliferation of Large Language Models research and the use of various Artificial Intelligence (AI) tools, the field of information systems (IS) and computer science (CS) has evolved. The use of tools such as ChatGPT to complete various student programming exercises (e.g., in Python) and assignments has gained prominence amongst various academic institutions. However, recent literature has suggested that the use of ChatGPT in academia is problematic and the impact on teaching and learning should be further scrutinized. More specifically, little is known about how ChatGPT can be practically used with code (programming) writing to complete programming exercises amongst IS and CS undergraduate university students. Furthermore, the paper provides insights for academics who teach programming to create more challenging exercises and how to engage responsibly in the use of ChatGPT to promote classroom integrity. In this paper, we used Complex Adaptive Systems (CAS) theory as a theoretical guide to understand the various dynamics through classroom code demonstrations. Using ChatGPT 3.5, we analyzed the various practical programming examples from past IS exercises and compared those with memos created by tutors and lecturers in a university setting. This paper highlights common ways of assessment, programming errors created by ChatGPT and the potential consideration for IS academics to ensure the development of critical programming skills among students.

Read more

6/18/2024