AI Classifiers — What’s the problem with detection tools?

AI classifiers don’t work!

Natural language processor AIs are meant to be convincing. They are creating content that “sounds plausible because it’s all derived from things that humans have said” (Marcus, 2023). The intent is to produce outputs that mimic human writing. The result: The world’s leading AI companies can’t reliably distinguish the products of their own machines from the work of humans.

In January, OpenAI released its own AI text classifier. According to OpenAI “Our classifier is not fully reliable. In our evaluations on a “challenge set” of English texts, our classifier correctly identifies 26% of AI-written text (true positives) as “likely AI-written,” while incorrectly labeling human-written text as AI-written 9% of the time (false positives).”

A bit about how AI classifiers identify AI-generated content

GPTZero, a commonly used detection tool, identifies AI created works based on two factors: perplexity and burstiness.

Perplexity measures the complexity of text. Classifiers identify text that is predictable and lacking complexity as AI-generated and highly complex text as human-generated.

Burstiness compares variation between sentences. It measures how predictable a piece of content is by the homogeneity of the length and structure of sentences throughout the text. Human writing tends to be variable, switching between long and complex sentences and short, simpler ones. AI sentences tend to be more uniform with less creative variability.

The lower the perplexity and burstiness score, the more likely it is that text is AI generated.

Turnitin is a plagiarism-prevention tool that helps check the originality of student writing. On April 4th, Turnitin released an AI-detection feature.

According to Turnitin, its detection tool works a bit differently.

When a paper is submitted to Turnitin, the submission is first broken into segments of text that are roughly a few hundred words (about five to ten sentences). Those segments are then overlapped with each other to capture each sentence in context.

The segments are run against our AI detection model, and we give each sentence a score between 0 and 1 to determine whether it is written by a human or by AI. If our model determines that a sentence was not generated by AI, it will receive a score of 0. If it determines the entirety of the sentence was generated by AI it will receive a score of 1.

Using the average scores of all the segments within the document, the model then generates an overall prediction of how much text (with 98% confidence based on data that was collected and verified in our AI innovation lab) in the submission we believe has been generated by AI. For example, when we say that 40% of the overall text has been AI-generated, we’re 98% confident that is the case.

Currently, Turnitin’s AI writing detection model is trained to detect content from the GPT-3 and GPT-3.5 language models, which includes ChatGPT. Because the writing characteristics of GPT-4 are consistent with earlier model versions, our detector is able to detect content from GPT-4 (ChatGPT Plus) most of the time. We are actively working on expanding our model to enable us to better detect content from other AI language models.

The Issues

AI detectors cannot prove conclusively if text is AI generated. With minimal editing, AI-generated content evades detection.

L2 writers tend to write with less “burstiness.” Concern about bias is one of the reasons for UBC chose not to enable Turnitins’ AI-detection feature.

ChatGPT’s writing style may be less easy to spot than some think.

Privacy violations are a concern with both generators and detectors as both collect data.

Now what?

Langara’s EdTech, TCDC, and SCAI departments are working together to offer workshops on four potential approaches: Embrace it, Neutralize it, Ban it, Ignore it. Interested in a bespoke workshop for your department? Complete the request form.


References
Marcus, G. (2023, January 6). Ezra Klein interviews Gary Marcus [Audio podcast episode]. In The Ezra Klein Show. https://www.nytimes.com/2023/01/06/podcasts/transcript-ezra-klein-interviews-gary-marcus.html

Fowler, G.A. (2023, April 3). We tested a new ChatGPT-detector for teachers. If flagged an innocent student. Washington Post. https://www.washingtonpost.com/technology/2023/04/01/chatgpt-cheating-detection-Turnitin/

Podcast Playlist – Podcast recommendations from your Ed Tech team

Looking for inspiration? Podcasts are a convenient and approachable way to pick up some new tools for your teaching toolkit. In this new feature, we’ll share a few of our favorite episodes with a teaching and learning focus.

Maybe It Doesn’t Need to be a Video

In this episode of Think UDL Clea and host Lillian Nave talk about multiple ways of representing information in online classes, customizing the display of information, offering alternatives for text or auditory information, and guiding information processing and visualization for students

In this episode of Teaching in Higher Ed, Dan Levy, faculty director of the Public Leadership Credential, the Harvard Kennedy School’s flagship online learning initiative, talks about his book, Teaching Effectively with Zoom.

Talking Tech

In this episode of tea for teaching Michelle Miller, author of Minds Online: Teaching Effectively with Technology, examines how we can talk to students about technology in ways that will help them become more efficient in their learning and professional lives.

How to Use Audio Lessons in Your Course to Engage Students and Improve Learning

In this episode of Lecture Breakers Yehoshua Zlotogorski the power of audio for learning, especially when the audio lesson or audio course is intentionally designed based on cognitive science and pedagogy.

Equity-Enhancing Data Tools

In this episode of Teaching in Higher Ed Viji Sathy, award-winning Professor of the Practice in the Department of Psychology and Neuroscience at The University of North Carolina at Chapel Hill, and Kelly Hogan, Teaching Professor of Biology and Associate Dean of Instructional Innovation at The University of North Carolina at Chapel Hill, share two equity-enhancing data tools.

Podcast Playlist

""Podcast recommendations from your Ed Tech and TCDC team

Looking for inspiration? Podcasts are a convenient and approachable way to pick up some new tools for your teaching toolkit. In this new feature, we’ll share a few of our favorite shows with a teaching and learning focus.

Trends and Issues in Instructional Design, Educational Technology and Learning Science is a bi-monthly podcast presented by Abbie Brown (East Carolina University) and Tim Green (California State University). Episodes are short at around 10-15 minutes and cover news on a wide range of topics connected to technology enhanced learning. Accompanying the podcast is a Flipboard magazine.

Hosted by Thomas Cavanagh and Kelvin Thompson, the monthly The Teaching Online Podcast focuses on issues related to online and blended learning. Episodes clock in at about 30 minutes. Recent topics explored in the show include OER adoption, blended learning course design, community engagement, and the role of synchronous online teaching post-COVID.

ThinkUDL host Lillian Nave interviews guests about their experiences implementing Universal Design for Learning. Recent guests include Kirsten Behling, co-author of Reach Everyone, Teach Everyone: Universal Design for Learning in Higher Education, Flower Darby, co-author of Small Teaching Online: Applying Learning Science in Online Classes, and Kevin Kelly and Todd Zakrajsek, authors of Advancing Online Teaching: Creating Equity-Based Digital Learning Environments.

 

The Vancouver Podcast Festival: Report

Photograph of panel at CBC live podcast 2050: Degrees of Change
The Panel at 2050: Degrees of Change (CBC live podcast)

The Vancouver Podcast Festival 

Karen Budra and Julian Prior attended the inaugural Vancouver Podcast Festival, sponsored by the Justice Institute, CBC and the VPL and presented by DOXA, between Thursday, Nov 8 and Saturday, November 10. We attended a number of panels, workshops, social events & live podcasts. Here are our takeaways: 

Purpose 

In the panel, Politics & Podcasting, Charlie Demers pointed out that podcasts “fulfill… the promise of the internet” as opposed to social platforms such as Twitter and Facebook, because podcasts are “more thoughtful.” This resonated with us, as one of the primary functions of academia is to encourage students to demonstrate deep learning and we would encourage faculty to learn how to use podcasts both to deliver course material and to provide students with another modality with which to express their ideas. 

Technology 

Most of the kit recommended by the senior sound CBC sound engineer, Cesil Fernandes in Sonic Sorcery: The Magic Tricks of Sound Design, such as the Zoom, Shure and Sennheiser microphones and portable recorders, are already available through EdTech or AVIT. Additionally, of course, smartphones (with or without attached microphones) can be used as a “safe” adjunct, should another recording device fail. 

EdTech also has an insulated studio in which to record audio, available to be booked by Langara faculty. 

Networking 

In the course of the three days, we met a variety of people from a variety of backgrounds, including Johanna Wagstaff, Lisa Kristiansen, Ian Hanomansing, & other CBC luminaries; well-established podcasters Karina Longworth and Helen Zalzberg; neophyte podcasters and students. 

These connections were both informative and inspiring, especially as one of the CBC producers is the parent of one of Karen’s current students and was able to talk knowledgably about Langara. We also spent time with two recent UBC film grads who run a podcast and learned much about how they set it up and the best way to deliver podcasts to students. More importantly, we learned how they created this student podcast and gained valuable insight into how we might support Langara instructors to help their students establish one based on this model. One of the great strengths of podcasts as a learning tool is that they can be delivered directly to students’ mobile devices, allowing them to study on the move.

16 Education Podcasts to check out in 2017

Following on from a session on podcasting that we delivered at the recent EdTech Instructor Gathering, here is a nice summary from EdSurge News of 16 podcasts on education to look out for in 2017.

It’s a golden age of education podcasts. Teachers, professors, education innovators, and tech skeptics have switched on their microphones to share their insights and analysis—and you’ll find plenty of lively characters and fresh voices via your earbuds. After all, let’s face it, teachers can be great talkers (we mean that in a good way), and they’re also seasoned storytellers.

EdSurge News: 16 education podcasts to check out in 2017