How Do AI Detectors Work?

AI

11th September 2024

Speak right now to our live team of English staff

AI detectors are tools designed to determine whether a piece of writing has been created by an artificial intelligence, like ChatGPT. These detectors are often used by educators to help spot AI-generated content in assignments or dissertations. However, it’s important to note that these tools are still evolving, and their accuracy can sometimes be questionable. In this article, we’ll explore how AI detectors function and provide some insights into their effectiveness.

How Do AI Detectors Work?

The purpose of AI detectors is to analyse text in order to identify whether it was written by a person or generated by an AI. They do this by looking for particular patterns and unusual features within the text that may hint at AI involvement. Just like AI systems themselves, detectors rely on machine learning models trained on vast datasets to make their assessments. The core idea is to evaluate whether the text seems to follow typical human writing patterns or whether it displays characteristics that suggest AI authorship. To achieve this, AI detectors focus on two main elements: perplexity and burstiness.

Perplexity Explained

Perplexity measures how predictable or logical a sentence appears within a text. In simple terms, it looks at how likely the sentence is to make sense to a reader. If a sentence flows smoothly and reaches an expected conclusion, it is considered more predictable, and thus the text receives a lower perplexity score. For example, a sentence like, “I decided to go for a walk in the park today…” that ends with “…because the weather was nice,” is seen as predictable. When a detector encounters many such sentences, it may conclude that the text could be AI-generated, as AI models are often designed to produce polished and predictable content.

On the other hand, a sentence that ends in an unexpected way or uses less common phrasing, such as “I went for a walk today because the wind whispered stories of the past,” would receive a higher perplexity score. This kind of text feels more complex and creative, leading AI detectors to believe it might have been written by a human. Human writing is often less predictable, with more unique sentence structures and creative language choices, which perplexity scores can reflect.

Understanding Burstiness

Burstiness refers to the variation in sentence structure and length throughout a piece of writing. It’s about whether the text maintains a consistent rhythm or displays a mix of longer and shorter sentences. AI-generated texts often exhibit low burstiness, meaning they tend to use similar sentence lengths and patterns throughout the text. This consistency can make AI-generated writing seem mechanical or formulaic, which detectors use as a clue for AI involvement.

In contrast, human writing usually shows high burstiness, with a natural ebb and flow. A human writer might vary sentence length and complexity, going from a concise, punchy sentence to a more elaborate one in the next line. This variation is typical of human writing, where creativity and spontaneity introduce shifts in the flow of text. As a result, when detectors encounter high burstiness, they are more likely to identify the text as human-authored.

Main Takeaways

In summary, AI detectors are designed to distinguish between human and AI-generated writing based on the predictability (perplexity) and variability (burstiness) of the text. Writing that is highly polished, consistent, and follows predictable patterns is often flagged as AI-generated. Conversely, writing that displays creativity, variety in sentence length and structure, and slight imperfections tends to be labelled as human-produced.

However, it’s worth noting that these detectors are far from perfect. Skilled human writers can produce flawless, predictable text, and some academic writing requires a more rigid structure, which might inadvertently trigger AI detection. Therefore, while AI detectors can be useful, they are not always reliable indicators of whether a text was created by AI.