Empowering Artists: Innovative Tool to Safeguard Artwork from AI Intrusion through ‘Poisoning’

Robot and young woman face to face.

Imaginima | E+ | Getty Images

Artists who want to share their artwork often face a tough choice: keep it offline or post it on social media and risk having it used to train data-hungry AI image generators.

But a new tool may soon be able to help artists deter AI companies from using their artwork without permission.

It’s called “Nightshade” and was developed by a team of researchers at the University of Chicago. It works by “poisoning” an artist’s creation by subtly changing the pixels of the image so that AI models aren’t able to accurately determine what the image is depicting, according to MIT Technology Review.

While the human eye isn’t able to detect these small changes, they aim to cause a machine-learning model to mislabel the picture as something other than what it is. Since these AI models rely on accurate data, this “poisoning” process would essentially render the image useless for the purposes of training.

If enough of these “poisoned” images are scraped from the web and used to train an AI image generator, the AI model itself may no longer be able to produce accurate images.

For example, researchers fed Stable Diffusion, an AI image generator, and an AI model they created themselves 50 “poisoned” images of dogs, then asked it to generate new pictures of dogs. The generated images featured animals with too many limbs or cartoonish faces that only somewhat resembled a dog, per MIT Technology Review.

After researchers fed Stable Diffusion 300 “poisoned” images of dogs, it eventually began producing images of cats. Stable Diffusion did not respond to CNBC Make It’s request for comment.

On the surface, AI art generators appear to create images out of thin air based on whatever prompt someone gives them.

But it’s not magic helping these generative AI models create realistic looking images of a pink giraffe or an underwater castle — it’s training data, and lots of it.

AI companies train their models on massive sets of data, which helps the models determine what images are associated with which words. In order for an AI model to correctly produce an image of a pink giraffe, it would need to be trained to correctly identify images of giraffes and the color pink.

A lot of the data used to train many generative AI systems is scraped from the web. Although it’s legal in the U.S. for companies to collect data from publicly accessible websites and use it for various purposes, that gets complicated when it comes to works of art since artists typically own the copyright for their pieces and sometimes don’t want their art being used to train an AI model.

While artists can sign up for “opt-out lists” or “do-not-scrape directives,” it’s often difficult to force companies to comply with those, Glaze at UChicago, the team of researchers who created Nightshade, said in an Oct. 24 thread on X, formerly known as Twitter.

“None of these mechanisms are enforceable, or even verifiable. Companies have shown that they can disregard opt-outs without a thought,” they said in the Oct. 24 thread

Reference

Denial of responsibility! Vigour Times is an automatic aggregator of Global media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, and all materials to their authors. For any complaint, please reach us at – [email protected]. We will take necessary action within 24 hours.
DMCA compliant image

Leave a Comment