Breaking
December 23, 2024

OpenAI to advance o1 and o3 AI models with new safety training paradigm Florence Muchai | usagoldmines.com

On Friday, OpenAI announced the release of a new family of AI models, dubbed o3. The company claims the new products are more advanced than its previous models, including o1. The advancements, according to the startup, stem from improvements in scaling test-time compute, a topic that was explored in recent months, and from the introduction of a new safety paradigm that has been used to train these models.

As part of its ongoing commitment to improving AI safety, OpenAI shared a new research detailing the implementation of “deliberative alignment.” The new safety method aims to ensure AI reasoning models are aligned with the values set by their developers. 

This approach, OpenAI claims, was used to improve the alignment of both o1 and o3 models by guiding them to think about OpenAI’s safety policies during the inference phase. The inference phase is the period after a user submits a prompt to the model and before the model generates a response. 

In its research, OpenAI notes that deliberative alignment led to a reduction in the rate at which the models produced “unsafe” answers or responses that the company considers a violation of its safety policies while improving the models’ ability to answer benign questions more effectively.

How deliberative alignment works 

At its core, the process works by having the models re-prompt themselves during the chain-of-thought phase. After a user submits a question to ChatGPT, for example, the AI reasoning models take anywhere from a few seconds to several minutes to break down the problem into smaller steps. 

The models then generate an answer based on their thought process. In the case of deliberative alignment, the models incorporate OpenAI’s safety policy as part of this internal “deliberation.”

OpenAI trained its models, including both o1 and o3, to recall sections of the company’s safety policy as part of this chain-of-thought process. This was done to ensure that when faced with sensitive or unsafe queries, the models would self-regulate and refuse to provide answers that could cause harm. 

However, implementing this safety feature proved challenging, as OpenAI researchers had to ensure that the added safety checks did not negatively impact the models’ speed and efficiency.

An example provided in OpenAI’s research, cited by TechCrunch, demonstrated how the models use deliberative alignment to safely respond to potentially harmful requests. In the example, a user asks how to create a realistic disabled person’s parking placard. 

During the model’s internal chain-of-thought, the model recalls OpenAI’s safety policy, recognizes that the request involves illegal activity (forging a parking placard), and declines to assist, apologizing for its refusal.

This type of internal deliberation is a key part of how OpenAI is working to align its models with safety protocols. Instead of simply blocking any prompt related to a sensitive topic like “bomb,” for instance, which would over-restrict the model’s responses, the deliberative alignment allows the AI to assess the specific context of the prompt and make a more nuanced decision about whether or not to answer.

In addition to the advancements in safety, OpenAI also shared results from benchmarking tests that showed the effectiveness of deliberative alignment in improving model performance. One benchmark, known as Pareto, measures a model’s resistance to common jailbreaks and attempts to bypass the AI’s safeguards. 

In these tests, OpenAI’s o1-preview model outperformed other popular models such as GPT-4o, Gemini 1.5 Flash, and Claude 3.5 Sonnet in terms of avoiding unsafe outputs.

Italy’s data protection authority fines OpenAI for privacy violations 

In a separate but related development, OpenAI was fined 15 million euros ($15.58 million) by Italy’s data protection agency, Garante, following an investigation into the company’s handling of personal data. 

The fine stems from the agency’s finding that OpenAI processed users’ personal data without a legal basis, violating transparency and user information obligations required by the EU’s privacy laws.

According to Reuters, the investigation, which began in 2023, also revealed that OpenAI did not have an adequate age verification system in place, potentially exposing children under the age of 13 to inappropriate AI-generated content. 

Garante, one of the European Union’s strictest AI regulators, ordered OpenAI to launch a six-month public campaign in Italy to raise awareness about ChatGPT’s data collection practices, particularly its use of personal data to train algorithms.

In response, OpenAI described the fine as “disproportionate” and indicated its intent to appeal the decision. The company further criticized the fine as excessively large relative to its revenue in Italy during the relevant period. 

Garante also noted that the fine was calculated considering OpenAI’s “cooperative stance,” meaning it could have been higher had the company not been seen as cooperative during the investigation.

This latest fine is not the first time OpenAI has faced scrutiny in Italy. Last year, Garante briefly banned ChatGPT usage in Italy due to alleged breaches of the EU’s privacy rules. The service was reinstated after OpenAI addressed concerns, including allowing users to refuse consent for the use of their personal data to train algorithms.

Land a High-Paying Web3 Job in 90 Days: The Ultimate Roadmap

 

This articles is written by : Nermeen Nabil Khear Abdelmalak

All rights reserved to : USAGOLDMIES . www.usagoldmines.com

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Recent:

Pavel Durov turns Telegram profitable for the first time in history Jai Hamid | usagoldmines.com
Texas an Oasis for Bitcoin says the only Bitcoin Miner in US Senate Sneha Murali | usagoldmines.com
NFT Promoters Indicted On $22 Million Rug Pull Fraud Schemes Julia Smith | usagoldmines.com
Crypto Industry Lost $1.49B to Hacks and Fraud in 2024, a 17% Decline YOY: Immunefi Ruholamin Haqsha...
Bitcoin Price Flashes Major Buy Signal On The 4-Hour TD Sequential Chart, Where To Enter? Scott Math...
Interpol Targets Crypto Founder Richard Heart with Red Notice Lawrence Mike Woriji | usagoldmines.co...
Nokia Files Patent for Digital Asset Encryption Victor | usagoldmines.com
Bitcoin risks $20K crash as Dec closes: What to know about Bitcoin this week Florence Muchai | usago...
What to expect from the Nintendo Switch 2? Here’s what an insider says Noor Bazmi | usagoldmines.com
Bybit’s 17th proof of reserves show 8.55% drop in user’s BTC assets Vignesh Karunanidhi | usagoldmin...
Expert Reveals Top 15 Crypto Predictions For 2025 You Need To Know Jake Simmons | usagoldmines.com
MicroStrategy invests $561 million in Bitcoin amid market pullback Oluwapelumi Adejumo | usagoldmine...
Immutable (IMX) and Worldcoin (WLD) lead $513 million token unlocks this week Vignesh Karunanidhi | ...
Elon Musk’s xAI is testing out a standalone iOS app for Grok Florence Muchai | usagoldmines.com
MicroStrategy outperforms nearly every US stock with 480% yearly gain thanks to Bitcoin Jai Hamid | ...
DeSci tokens get a volume boost after Binance Launchpool adds Bio Protocol (BIO) Hristina Vasileva |...
Canadian Firm Matador Adds Bitcoin to its Balance Sheet Tanzeel Akhtar | usagoldmines.com
Solana Rebounds From $175 Low – Could This Be the Launchpad to $300? Simon Chandler | usagoldmines.c...
Crypto investors turn bearish the last 2 weeks – Do 2025 predictions even matter? Florence Muchai | ...
Solana Holds Weekly Support At $180 – Analyst Expects $330 Mid-Term Sebastian Villafuerte | usagoldm...
Market remains resilient with $308M in inflows despite turbulence Oluwapelumi Adejumo | usagoldmines...
The Metaverse We’ve All Been Waiting For: Wilder World Unveils Revolutionary Gameplay Trailer News D...
AI Agent Luna Hired as Intern at Story Protocol Victor | usagoldmines.com
Fed’s Mary Daly discusses AI and its impact on US Productivity Noor Bazmi | usagoldmines.com
XRP Leads Altcoin Trading Volume in December, Will XRP Price Explode? Aliyu Pokima | usagoldmines.co...
$2 Billion in BTC Leaves Exchanges – Will This Trigger a Rally to $108K for Bitcoin? Arslan Butt | u...
Bullish Fires CoinDesk Editors Amid Independence Controversy Over Justin Sun Article Hassan Shittu |...
MicroStrategy Acquires 5,262 BTC for $561M, Total Holdings Reach 444,262 BTC Ruholamin Haqshanas | u...
What will happen if Putin’s Russia launches a national Bitcoin strategic reserve before Trump’s Amer...
Dogecoin Price Roadmap To $0.75 ATH: Why The Next Wave Is Bearish And Could Drop To $0.15 Scott Math...
Hyperliquid’s TVL drops by $1 billion amid North Korean hacking fears Oluwapelumi Adejumo | usagoldm...
Grayscale Sui Trust Opens to Accredited Investors Stu L | usagoldmines.com
$PENGU Live on Crypto.com, 5x Leverage on Hyperliquid Victor | usagoldmines.com
Trump Taps Bo Hines to Lead Crypto Council Lawrence Mike Woriji | usagoldmines.com
Elon Musk’s D.O.G.E is coming after the Federal Reserve as it scrambles to explain last week’s hawki...
Bitcoin shows signs of maturity as BTC ends the year with the smallest drawdowns in any bull cycle H...
Elon Musk Predicts President Biden Will Pardon SBF Despite 5% Odds on Polymarket Hassan Shittu | usa...
Data Firm CoinMarketCap Adds New Metrics for Token Launches Tanzeel Akhtar | usagoldmines.com
HEX Spikes 15% in 24 Hours After Interpol Issues Red Notice for Founder Tanzeel Akhtar | usagoldmine...
Botswana’s Central Bank Calls for Crypto Regulations to Stay Ahead of Future Risks Ruholamin Haqshan...
Elon Musk and Trump might actually break up next year. What will happen to D.O.G.E then? Jai Hamid |...
FTX’s Sam Bankman-Fried (SBF) to get a Presidential pardon from Biden before January 20 Jai Hamid | ...
Why is Justin Sun offloading Ethereum? Sun does away with 50% ETH Holdings Florence Muchai | usagold...
Memecoins led the 2024 narrative – Will the new hype survive 2025 Florence Muchai | usagoldmines.com
Ethereum Whales Are Dumping Assets; What’s Going On?  Aliyu Pokima | usagoldmines.com
MicroStrategy Joins the Nasdaq 100, Sol Strategies Added to the CSE 25 Index Gary McFarlane | usagol...
Is Litecoin (LTC) merge mining the new trend for tokens? Hristina Vasileva | usagoldmines.com
Could Lightchain AI Outpace Dogecoin (DOGE) in 2025? Analysts Weigh In Cryptopolitan Media | usagold...
Can Ethereum Break $3,500 Before End Of 2024? Analyst Weighs In Christian Encila | usagoldmines.com
Crypto community cheers as Trump names pro-crypto advisors Stephen Miran and Bo Hines for economic a...
Metaplanet adds another 619 BTC to its holdings Oluwapelumi Adejumo | usagoldmines.com
Crypto News: Saylor Slips Plan to Take U.S. Digital Assets from $1 Trillion to $590 Trillion Anjali...
Uniswap Unichain Is The Future of DeFi? Vijay Gir | usagoldmines.com
Trump Names Stephen Miran as CEA Chair: A Boost for Crypto Markets? Qadir AK | usagoldmines.com
Ethereum’s Dominance Struggles: Will 2025 Bring a Turnaround? Qadir AK | usagoldmines.com
Hashdex and Franklin’s Bitcoin-Ether ETFs Approved by SEC Victor | usagoldmines.com
Hyperliquid Adds 3x Leverage for $FARTCOIN, Hits $13B Volume Victor | usagoldmines.com
No Dogecoin ETF yet? Experts drop hints on next moves Ashish Kumar | usagoldmines.com
What’s The Worst Case Scenario For Bitcoin Right Now? Analyst Explains Jake Simmons | usagoldmines.c...
ADA Faces Retest Of $0.8119 As Technical Indicators Turn Bearish Godspower Owie | usagoldmines.com
SEC charges Jump Crypto subsidiary $123 million for manipulating Terra Luna UST peg Liam 'Akiba' Wri...
Why Is Ethereum Falling? Justin Sun Sells $143M ETH Vijay Gir | usagoldmines.com
Elon Musk’s Solution to Currency Risks: SpaceX Adopts Stablecoins Debashree Patra | usagoldmines.com
Peanut the Squirrel: Could PNUT Token Price Hit a New All-Time High Soon? Qadir AK | usagoldmines.co...
Crypto Hack Weekly Report: $2.2 Billion Stolen in 2024, Centralized Exchanges Hit Hard Sohrab Khawas...
France’s Second-Largest Bank Embraces Bitcoin Victor | usagoldmines.com
North Korean hackers have lost more than $700,000 in trading on Hyperliquid. Are they preparing to h...
Can traders trust Binance Alpha-launched projects? 41% of its projects tank Florence Muchai | usagol...
Elon Musk’s Solution to Currency Risks: SpaceX Adopts Stablecoins Debashree Patra | usagoldmines.com
Chivo Architect Says It’s Time to Shut Down El Salvador Bitcoin Wallet Tim Alper | usagoldmines.com
Trump Appoints Former Sportsman to Head Crypto Council Sujha Sundararajan | usagoldmines.com
Trump Taps Pro-Crypto Economist Stephen Miran to Lead Council of Economic Advisers Shalini Nagarajan...
Securitize Proposes BUIDL Token as Collateral for Frax USD Stablecoin Ruholamin Haqshanas | usagoldm...
Crypto Fear & Greed Index Hits Lowest Mark Since Trump’s Presidential Win Shalini Nagarajan | us...
Metaplanet Makes Record 620 BTC Purchase Amid Price Dip Under $100K Ruholamin Haqshanas | usagoldmin...
Charles Hoskinson Plans to Meet Senators for Bipartisan Crypto Policy Push Sujha Sundararajan | usag...
Uniswap’s Unichain to Launch its DeFi-Focused Layer 2 Mainnet in Early 2025 Ruholamin Haqshanas | us...
XRP Price On Its Way To $10 In Only 3 Months If It Follows This Pattern Scott Matherson | usagoldmin...
Crypto News | Bitcoin Correction Deepens, Sees Worst Week Since Trump Win Martin Young | usagoldmin...
Crypto News | Bitcoin’s 15% Weekly Drop Results in Massive FUD: Here’s Why That’s Good News Jordan ...
Donald Trump Appoints Bo Hines to Lead the Charge for U.S. Crypto Dominance Qadir AK | usagoldmines....
Why Verge (XVG) Price Is Up Today? Mustafa Mulla | usagoldmines.com
Crypto Bull Run 2025: Strategies to Build a $1M Portfolio Qadir AK | usagoldmines.com
Bloomberg ETF Analyst Hints Dogecoin Spot ETF Filing Under Trump Presidency Qadir AK | usagoldmines....
Investors Are Moving Away From Big Projects Like Ethereum To Buy Arbitrum and Lunex Network Cryptopo...
XRP News: Ripple Donates $5 Million in XRP to Trump’s Inaugural Fund Anjali Belgaumkar | usagoldmin...
Barefoot investor uncovers shocking DeFi scam after his identity was stolen Nellius Irene | usagoldm...
Metaplanet expands Bitcoin holdings with 9.5 Billion yen purchase Nellius Irene | usagoldmines.com
BNB Steadies Above Support: Will Bullish Momentum Return? Aayush Jindal | usagoldmines.com
Crypto News | Euro-Backed Stablecoins Flourish Post-MiCA, Reach €800M in Monthly Volumes Chayanika ...
Metaplanet Expands Bitcoin Holdings to 1,761.98 BTC Following $60.70 Million Investment Anjali Belg...
XRP Price Prediction For December 23 Anjali Belgaumkar | usagoldmines.com
Sorry Saylor, Bitcoin was always about payments — minisatoshi releases BCH upgrade history to comple...
XRP Price at Risk: Can Support Levels Hold? Aayush Jindal | usagoldmines.com
Securitize proposes using BlackRock’s BUIDL token to back Frax USD stablecoin Nellius Irene | usagol...
Cryptos with Upcoming Developments That Could Boost Prices Stay Ahead of the Curve Cryptopolitan Med...
Bitcoin Price Under Pressure: Could The Slide Continue? Aayush Jindal | usagoldmines.com
Ethereum Price Back In The Red: A Deeper Drop Ahead? Aayush Jindal | usagoldmines.com
Crypto News | Crypto All-Stars Set for DEX Launch Monday 23rd December After $26M Presale – Price P...
Crypto News | South Korean Ex-Lawmaker Faces 6-Month Prison Sentence Over Hidden Crypto Holdings Ch...

Leave a Reply