Breaking
November 22, 2024

AI2’s open source Tulu 3 lets anyone play the AI post-training game Gaylord Contreras | usagoldmines.com

Ask anyone in the open source AI community, and they will tell you the gap between them and the big private companies is more than just computing power. AI2 is working to fix that, first with fully open source databases and models, and now with an open and easily adapted post-training regimen to turn “raw” large language models into usable ones.

Contrary to what many think, “foundation” language models don’t come out of the training process ready to put to work. The pre-training process is necessary, of course, but far from sufficient. Some even believe that pre-training may soon no longer be the most important part at all.

That’s because the post-training process is increasingly being shown to be where real value can be created. That’s where the model is molded from a giant, know-it-all network that will as readily produce Holocaust denial talking points as it will cookie recipes. You generally don’t want that!

Companies are secretive about their post-training regimens because, while everyone can scrape the web and make a model using state-of-the-art methods, making that model useful to, say, a therapist or research analyst is a completely different challenge.

AI2 (formerly known as the Allen Institute for AI) has spoken out about the lack of openness in ostensibly “open” AI projects, like Meta’s Llama. While the model is indeed free for anyone to use and tweak, the sources and process of making the raw model and the method of training it for general use remain carefully guarded secrets. It’s not bad — but it also isn’t really “open.”

AI2, on the other hand, is committed to being as open as it can possibly be, from exposing its data collection, curation, cleaning, and other pipelines to the exact training methods it used to produce LLMs like OLMo.

But the simple truth is that few developers have the chops to run their own LLMs to begin with, and even fewer can do post-training the way Meta, OpenAI, or Anthropic does — partly because they don’t know, but also because it’s technically complex and time-consuming.

Fortunately, AI2 wants to democratize this aspect of the AI ecosystem as well. That’s where Tulu 3 comes in. It’s a huge improvement over an earlier, more rudimentary post-training process (called, you guessed it, Tulu 2); in the nonprofit’s tests, this resulted in scores on par with the most advanced “open” models out there. It’s based on months of experimentation, reading, and interpreting what the big guys are hinting at, and lots of iterative training runs.

a diagram doesn’t really capture it all, but you see the general shape of it.Image Credits:AI2

Basically, Tulu 3 covers everything from choosing which topics you want your model to care about — for instance, downplaying multilingual capabilities but dialing up math and coding — then takes it through a long regimen of data curation, reinforcement learning, fine tuning and preference tuning, plus tweaking a bunch of other meta-parameters and training processes that I couldn’t adequately describe to you. The result is, hopefully, a far more capable model focused on the skills you need it to have.

The real point, though, is taking one more toy out of the private companies’ toybox. Previously, if you wanted to build a custom-trained LLM, it was very hard to avoid using a major company’s resources one way or the other, or hiring a middleman who would do the work for you. That’s not only expensive, but it introduces risks that some companies are loath to take.

For instance, medical research and service companies: sure, you could use OpenAI’s API, or talk to Scale or whoever to customize an in-house model, but both of these involve outside companies in sensitive user data. If it’s unavoidable, you just have to bite the bullet — but if it isn’t? Like if, for instance, a research organization released a soup-to-nuts pre- and post-training regimen that you could implement on-premises? That may well be a better alternative.

AI2 is using this itself, which is the best endorsement one can give. Even though the test results its publishing today use Llama as a foundation model, they’re planning to put out an OLMo-based, Tulu-3-trained model soon that should offer even more improvements over the baseline and also be fully open source, tip to tail.

If you’re curious how the model performs currently, give the live demo a shot.

 

This articles is written by : Nermeen Nabil Khear Abdelmalak

All rights reserved to : USAGOLDMIES . www.usagoldmines.com

You can Enjoy surfing our website categories and read more content in many fields you may like .

Why USAGoldMines ?

USAGoldMines is a comprehensive website offering the latest in financial, crypto, and technical news. With specialized sections for each category, it provides readers with up-to-date market insights, investment trends, and technological advancements, making it a valuable resource for investors and enthusiasts in the fast-paced financial world.

Recent:

Durabook and parent company, Twinhead International Corp., celebrate 40 Years of innovation in compu...
Three mystery whales have each spent $10 billion-plus on Nvidia’s AI chips so far this year Gaylord ...
Nvidia, Snowflake and BEN Ride AI Surge in Q3 Gaylord Contreras | usagoldmines.com
Reimagining the Future of Data Computing with Compute Express Link (CXL) Interconnects Ali Guerra | ...
The Amazing Rise Of 1-Bit LLMs For Building Faster And Slimer Generative AI Apps Gaylord Contreras |...
AI in healthcare drives computing innovation forward Ali Guerra | usagoldmines.com
Steelers vs. Browns NFL props, Thursday Night Football picks, AI prediction: Russell Wilson tops 187...
Nova Ltd. Expands Innovative Nova Fit® Machine Learning Capabilities to Enhance VeraFlex® Platform G...
Elon Musk told X users to upload their medical information to train AI bot Grok Gaylord Contreras | ...
Deus in machina: Swiss church installs AI-powered Jesus | Artificial intelligence (AI) Gaylord Contr...
Google has an unfair advantage on AI, DOJ alleges Gaylord Contreras | usagoldmines.com
Even Nvidia’s CEO is obsessed with Google’s NotebookLM AI tool Macky Briones | usagoldmines.com
Even Nvidia’s CEO is obsessed with Google’s NotebookLM AI tool Macky Briones | usagoldmines.com
AI Is Helping Brands Reach More Audiences Across Social Media Gaylord Contreras | usagoldmines.com
National Cloud Computing Policy to be finalised by year end Ali Guerra | usagoldmines.com
Will AI replace humans? Yoshua Bengio warns of artificial intelligence risks Gaylord Contreras | usa...
DOJ wants Google to sell Chrome and possibly Android, more Hallie Frederick | usagoldmines.com
NVIDIA Accelerates Majority of World’s Supercomputers Ali Guerra | usagoldmines.com
Artificial Intelligence Can Be a Superpower for Financial Advisors Gaylord Contreras | usagoldmines....
OneCell Diagnostics bags $16M to help limit cancer reoccurrence using AI Gaylord Contreras | usagold...
Enterprise Productivity Is the Easiest AI Sell Macky Briones | usagoldmines.com
Swiveling Massage Seats, AI Driving Modes, and Pixels Everywhere Gaylord Contreras | usagoldmines.co...
Claroty veterans launch Twine with $12M in Seed funding from Dell and Wiz founders to Gaylord Contre...
Can a fluffy robot really replace a cat or dog? My weird, emotional week with an AI pet | Artificial...
Open Text Corporation (OTEX) Unveils Cloud Editions (CE) 24.4 with AI-Driven Innovations to Enhance ...
Nvidia’s AI chip demand still booming but slowing sales growth worries investors Gaylord Contreras |...
Google’s Gemini AI now has a memory Gaylord Contreras | usagoldmines.com
Better Artificial Intelligence Stock: Nvidia vs. Palantir Gaylord Contreras | usagoldmines.com
Self-learning AI makes college football against the spread, money line, over/under picks for Week 13...
Self-learning AI makes college football against the spread, money line, over/under picks for Week 13...
Google’s Gemini AI now has a memory Gaylord Contreras | usagoldmines.com
Mizzle Partners with InFlux Technologies to Power DePIN Platform with Decentralized Cloud Infrastruc...
AI infrastructure transforming computing and sustainability Ali Guerra | usagoldmines.com
Nvidia rivals focus on building a different kind of chip to power AI products Ali Guerra | usagoldmi...
Meet your own personal AI Jesus in this Swiss church’s confessional Gaylord Contreras | usagoldmines...
China Turns to Silicon Valley to Bolster Homegrown AI Firms Gaylord Contreras | usagoldmines.com
Meta pushes AI bid for UK public sector forward with technology aimed at NHS | Meta Gaylord Contrera...
Microsoft pitches AI ‘agents’ that can perform tasks on their own at Ignite 2024 Gaylord Contreras |...
Physical AI startup BrightAI bootstraps to $80M in revenue Gaylord Contreras | usagoldmines.com
Report: DOJ wants to force Google Chrome sale, Android de-bundling Hallie Frederick | usagoldmines.c...
Sam Altman seeks backers for AI chipmaker to challenge Nvidia: source Gaylord Contreras | usagoldmin...
Meta hires Salesforce’s CEO of AI, Clara Shih, to lead new business AI group Gaylord Contreras | usa...
Expert Warns of AI Chatbot Risks After Teen User’s Suicide Gaylord Contreras | usagoldmines.com
The US Patent and Trademark Office Banned Staff From Using Generative AI Gaylord Contreras | usagold...
Expert believes AI is likely a factor in Marriott slashing jobs Gaylord Contreras | usagoldmines.com
As public perception of AI sours, crowdfunding platforms scramble Gaylord Contreras | usagoldmines.c...
High- Performance Computing as a Service Market Size Will Ali Guerra | usagoldmines.com
TG to become a CoE in Quantum Computing: Min Sridhar Babu Ali Guerra | usagoldmines.com
AI cloning of celebrity voices outpacing the law, experts warn | Artificial intelligence (AI) Gaylor...
Stocks rebound — plus, we’re raising our price target on a transforming AI play Gaylord Contreras | ...
Cowboys vs. Texans betting guide, Monday Night Football odds, props: AI, expert, model, DFS fantasy ...
Marc Benioff ‘blown away’ by Google Gemini AI voice assistant Gaylord Contreras | usagoldmines.com
Meet The New Boss: Artificial Intelligence Gaylord Contreras | usagoldmines.com
These Artificial Intelligence (AI) Stocks Have Soared Since Trump Won the Election, but Should You B...
San Antonio International Airport debuts new parking technology Gaylord Contreras | usagoldmines.com
Ben Affleck tells actors and writers not to worry about AI Gaylord Contreras | usagoldmines.com
The 7 Revolutionary Cloud Computing Trends That Will Define Business Success In 2025 Ali Guerra | us...
Microsoft starts boiling the Copilot frog • The Register Gaylord Contreras | usagoldmines.com
Google Docs now lets you generate AI images directly within documents Gaylord Contreras | usagoldmin...
Self-Evolving Reward Learning aligns LLMs with less human feedback Gaylord Contreras | usagoldmines....
Mobile AI opens new horizons for sustainable business growth in the digital age Gaylord Contreras | ...
Nasoya Introduces Tofie, World’s First AI-Powered Tofu Chatbot Gaylord Contreras | usagoldmines.com
Huawei’s Mate70 to flex high-end chip self-sufficiency Chris Mendez | usagoldmines.com
Using artificial intelligence in education: decision tree learning results in secondary school stude...
Building a Sustainable Future: Cloud Computing in Environmental Science | nasscom Ali Guerra | usago...
Nvidia Faces Risk from Potential Tariffs Amidst AI Boom, Bloomberg Analyst Says Gaylord Contreras | ...
Can AI Speak Culture? | Psychology Today Gaylord Contreras | usagoldmines.com
Are Quantum Computers the Secret Threat to Bitcoin’s Future? Ali Guerra | usagoldmines.com
Human-AI Coevolution Is Said To Be Coming Whether Humanity Likes It Or Not Gaylord Contreras | usago...
Meta and others now allow military to access their AI Gaylord Contreras | usagoldmines.com
My Career Advice As a Google Researcher Working in AI for 20 Years Gaylord Contreras | usagoldmines....
Spark Study Buddy (Challenger): AI algorithm matches pig sounds to their emotions – Young Post Gaylo...
AI Makes Echocardiography Faster, More Accessible Gaylord Contreras | usagoldmines.com
A popular technique to make AI more efficient has drawbacks Gaylord Contreras | usagoldmines.com
Chargers vs. Bengals NFL props, Sunday Night Football picks, AI prediction: Justin Herbert over 230....
Amazon offers free computing power to AI researchers, aiming to challenge Nvidia Ali Guerra | usagol...
AI Firm Genius Group Adopts Bitcoin as Primary Treasury Reserve Asset Gaylord Contreras | usagoldmin...
3 New AI Smart Home Features Arrive With Gemini and Google Nest Gaylord Contreras | usagoldmines.com
The mental health implications of artificial intelligence adoption: the crucial role of self-efficac...
How Artificial Intelligence Is Supercharging Digital Manipulation Gaylord Contreras | usagoldmines.c...
Transform your content creation with AI MagicX Gaylord Contreras | usagoldmines.com
‘Have your bot speak to my bot’: can AI productivity apps turbocharge my life? | Artificial intellig...
Qualcomm Q4 Earnings: Focus On The Long-Term Edge AI Picture (NASDAQ:QCOM) Gaylord Contreras | usago...
I’m a multitasking machine on my laptop — this Intel Lunar Lake change is a dealbreaker Gaylord Cont...
8 ChatGPT productivity tips and tricks Gaylord Contreras | usagoldmines.com
How a Hong Kong start-up’s AI-powered smart bin plans to tackle recycling Gaylord Contreras | usagol...
Does Africa need to embrace AI to keep its music centre stage? Gaylord Contreras | usagoldmines.com
Eyeing $500B AI Server Market by 2028 Amid Workforce Realignment Gaylord Contreras | usagoldmines.co...
OpenAI Has a Warning for Nvidia. Is the AI Bubble Bursting? Gaylord Contreras | usagoldmines.com
Multi-Agent AI Orchestration Shaping Up But Here’s Why It Might Not Be Fully Shipshape Gaylord Contr...
Multi-Agent AI Orchestration Shaping Up But Here’s Why It Might Not Be Fully Shipshape Gaylord Contr...
Fake AI video generators infect Windows, macOS with infostealers Gaylord Contreras | usagoldmines.co...
Phone Provider Deploys “State-of-the-Art AI Granny” to Waste Scammers’ Time Gaylord Contreras | usag...
Biden and Xi agree humans, not AI, should decide on nuclear weapon use | Joe Biden Gaylord Contreras...
Biden and Xi take a first step to limit AI and nuclear decisions : NPR Gaylord Contreras | usagoldmi...
Quantum computing: Boon or bane? Ali Guerra | usagoldmines.com
Google’s AI Search Experiment: “Learn About” Gaylord Contreras | usagoldmines.com
Self-learning AI gives NFL against the spread, over-under, money-line picks for every Week 11, 2024 ...
Alison.ai Closes $13.3M Seed Funding, Aims to Transform Global Ad Campaigns Gaylord Contreras | usag...
Our brains are vector databases — here’s why that’s helpful when using AI Gaylord Contreras | usagol...

Leave a Reply