OpenAI's ChatGPT-5: A Safety Experiment Gone Sideways | The San Francisco Frontier

The San Francisco Frontier | Est. 2025
© 2025 dpi Media Group. All rights reserved.

OpenAI's ChatGPT-5: A Safety Experiment Gone Sideways

Instagram - @andrewtneel | Donations - paypal.me/AndrewNeel

Photo by Andrew Neel on Unsplash

OpenAI’s latest iteration of ChatGPT, GPT-5, promised safer interactions and more nuanced responses, but early testing reveals significant challenges in content moderation. The new model aims to transform how AI handles potentially inappropriate requests by shifting focus from input analysis to output evaluation.

Researchers like Saachi Jain from OpenAI’s safety systems team emphasize a more sophisticated approach to content filtering. Instead of simply refusing prompts, the new system now provides explanatory context about why certain requests cannot be fulfilled and sometimes suggests alternative conversation paths.

However, initial investigations suggest the safety mechanisms aren’t foolproof. By manipulating custom instruction settings, researchers discovered ways to generate explicit content and offensive language. In one test, using a deliberate misspelling of a suggestive term enabled the AI to engage in graphic sexual role-play and produce inappropriate slurs.

The findings highlight ongoing challenges in AI content moderation. While OpenAI continues refining its models, the current version demonstrates that seemingly robust safety protocols can have unexpected vulnerabilities. Custom instruction features, designed to personalize user experiences, potentially create loopholes in content filtering systems.

OpenAI acknowledges these issues as an “active area of research,” indicating ongoing efforts to improve AI safety mechanisms. The company recognizes that instruction hierarchies and safety policies require continuous refinement to prevent unintended content generation.

For users and tech enthusiasts, these revelations underscore the complexity of developing truly safe and responsible AI systems. As personalization features expand, maintaining appropriate boundaries becomes increasingly challenging.

The GPT-5 experiment reveals that while AI technology advances rapidly, human oversight and sophisticated content moderation remain critical in preventing potentially harmful outputs.

AUTHOR: rjv

SOURCE: Wired

technology

Waymo is Taking Over Bay Area Freeways: Your Future Ride is Here

Waymo/Google self-driving car at the Computer History Museum. Mountain View, CA. June 2024

San Francisco Gets Its First Glimpse of Amazon's Futuristic Robotaxis

a car that is driving down the street

Valve's Steam Frame: A Game-Changing Leap into Mobile Gaming

A machine in a silk factory.

AI Jesus: The Chatbot Trying to Slide into Your Spiritual DMs

two hands touching each other in front of a blue background

Inside OpenAI's Bold Plan to Make AI Actually Useful for Everyone

a close up of a laptop on a desk

Your Data Privacy Just Got a Major Upgrade, Thanks to California

Welcome to California, California 2020

Claude AI Chatbot Weaponized: Inside the Groundbreaking Chinese Cyber Espionage Attack

a close up of a computer keyboard with red light

ChatGPT's Group Chat Feature: A New Way to Collaborate with AI

a person holding a cell phone in their hand

Is AI About to Pop? Silicon Valley's Biggest Minds Are Worried

ChatGPT

Battery Power Revolution: How California Is Leading the Clean Energy Charge

aerial photography of grass field with blue solar panels

Rural California's Lifeline is Under Threat: How AT&T Wants to Cut Your Emergency Communication

AT&T icon in 3D. My 3D work may be seen in the section titled "3D Render."

California's Bold Plan to Save Science from Federal Budget Cuts

Headquarters of SpaceX in Hawthorne, California, in March 2024.

Streaming Revolution: MLS Goes All-In on Apple TV Without Extra Cost

Soccer fans, rejoice! Apple TV is making a game-changing move that'll have you cheering from ...

Driverless Cars Hit the Fast Lane: Waymo's Bay Area Freeway Expansion

Software box arch at the Computer History Museum. Mountain View, CA. August 2023

Vine's Legendary Six-Second Videos Are Back from the Digital Graveyard

Social Media Logos in 3D. Facebook, Instagram, Twitter, TikTok, YouTube, LinkedIn. Feel free to contact me through email mariia@shalabaieva.com

news

North Coast Democrat Mike McGuire Challenges Trump Ally in Congressional Race

Jello Biafra talks about California politics

Oakland Community Mourns: Football Coach John Beam's Tragic Murder Shakes the City

Activité sportive avec les enfants

Data Center Battles: How Local Communities Are Pushing Back Against Tech Giants

Data Servers

Breaking Silence: SF Archbishop Challenges Trump's Harsh Immigration Policies

Immigrants make America Great

SF Supervisor's Shocking Downfall: A Political Implosion in the Sunset District

People hold up signs stating "I STAND AGAINST HATE & ANTISEMITISM" AND "THE JEWISH PEOPLE WILL NOT BE BULLIED BY ANTISEMITES" at the Unity Rally, a march against antisemitism in San Francisco.

Kim Guilfoyle's Surprising Diplomacy: From Bay Area Politics to Greek Ambassadorship

Donald Trump, Jr. & Kimberly Guilfoyle

Love, Politics, and Diplomacy: Kimberly Guilfoyle's Complex Web of Relationships

Kimberly Guilfoyle

Independent Candidate Shakes Up Northern California's Political Landscape in 2026 Race

The upcoming 2026 congressional election in California's 2nd District is heating up as Humboldt County ...

Neighbors Sound Alarm: Inside San Jose's Controversial Homeless Housing Transition

Homeless

Protest, Joy, and Resistance: How SF Fought Back Against Authoritarianism

people walking on street during daytime

BART's New Urban Helpers Are Cutting Down Emergency Calls

A subway station with two trains parked in it

Political Power Play: Newsom's Ex-Chief of Staff Caught in Massive Fraud Scheme

Caution Tape at the United States Capitol in Washington D.C.

finance

Silicon Valley Heavyweight Peter Thiel Dumps Nvidia Stocks - What It Means for Tech

Supreme Court Could Force Trump to Refund Billions in Controversial Tariffs

Supreme Court of the United States located in Washington DC which interprets laws. It is the Judicial Branch of the US government.

Tech's New Rockstar CEO is Betting Big on Fixing Housing's Broken System

RCZ_0901

San Francisco's Real Estate Renaissance: From Stalled to Sold

Credit to infiniteviewsllc.com

Elon Musk's Wild Ride: How a Trillion-Dollar Payday Could Reshape Tesla's Future

Elon Musk Taking Questions

Elon Musk's Massive Pay Package: A Controversial Win at Tesla's Shareholder Meeting

Tesla 3D Icon Concept in Dark Mode. It is for you, Elon and fans 🖤

lifestyle

Remembering Alice Wong: A Disability Justice Icon Who Reshaped Our Understanding of Accessibility

Alice Wong: Richmond

Empty-Nesters Trade Texas Suburbs for San Francisco's Urban Adventure

When Laura and David Cole reached a crossroads in their mid-60s, they didn't just downsize, ...

Sake Lovers, Rejoice: Michael Mina's New Intimate Tasting Spot Is a Bay Area Gem

Row of japanese sake bottles with labels

Silicon Valley's Wild Real Estate Game: A $57.5 Million Atherton Mansion Drops Jaws

Large Italian style Villa

K-Pop Culture Takes Over Excelsior: A Night of Music, Dance, and Community

A Bad Boy ~ ~ ~ ~ 📸: Yours Truly ❤️

A Rare Piece of Music History: Taylor Swift's Heartfelt Note to Liam Payne Goes to Auction

An iPad 9th generation playing "Is It Over Now? (Taylor's Version) (From The Vault)" by Taylor Swift on Spotify lying on the white bed sheets next to a mug of hot chocolate || 📸 Photo by: Jovan Vasiljević

A YouTuber's Mysterious Disappearance: The Vanishing of Mikey Rijavec

Motor yacht

Cardi B's New Chapter: Welcoming a Baby Boy and Dropping Fire Music

MADA

Chili's 'Wicked' Margaritas Are Seriously No Joke - Millennials Beware

Girls Party

Bay Area Music Scene Takes a Hit: Live 105's Not So Silent Night Vanishes for 2025

Live 105 BFD 2012

Silicon Valley Engineer Saves Beloved BBQ Joint From Extinction

a plate of food with chopsticks sticking out of it

Bay Area Animation Studio's Dreamy Debut Promises Heartfelt Family Adventure

Mini Figure Kusakabe Mei from My Neighbor Totoro the Movie

startups

Silicon Valley's New Dress Code: When Tech Founders Go to Charm School

smiling woman standing beside smiling man pointing MacBook

Young Bay Area Entrepreneurs Are Rewriting the Rules of Work in the AI Era

woman in black top using Surface laptop