Skip to content

Top Stories

Top Stories

Primary Menu
  • Breaking News
  • UNIT CONVERTER
  • QR Code Generator
  • SEO META TAG GENERATOR
  • Background Remover Tool
  • Image Enhancer Tool
  • Image Converter Tool
  • Image Compressor Tool
  • Keyword Research Tool
  • Paint Tool
  • About Us
  • Contact Us
  • Privacy Policy
HOME PAGE
  • Home
  • Uncategorized
  • OpenAI launches program to design new ‘domain-specific’ AI benchmarks
  • Uncategorized

OpenAI launches program to design new ‘domain-specific’ AI benchmarks

VedVision HeadLines April 9, 2025
OpenAI launches program to design new ‘domain-specific’ AI benchmarks


OpenAI, like many AI labs, thinks AI benchmarks are broken. It says it wants to fix them through a new program.

Called the OpenAI Pioneers Program, the program will focus on creating evaluations for AI models that “set the bar for what good looks like,” as OpenAI phrased it in a blog post.

“As the pace of AI adoption accelerates across industries, there is a need to understand and improve its impact in the world,” the company continued in its post. “Creating domain-specific evals are one way to better reflect real-world use cases, helping teams assess model performance in practical, high-stakes environments.”

As the recent controversy with the crowdsourced benchmark LM Arena and Meta’s Maverick model illustrate, it’s tough to know, these days, precisely what differentiates one model from another. Many widely-used AI benchmarks measure performance on esoteric tasks, like solving doctorate-level math problems. Others can be gamed, or don’t align well with most people’s preferences.

Through the Pioneers Program, OpenAI hopes to create benchmarks for specific domains like legal, finance, insurance, healthcare, and accounting. The lab says that, in the coming months, it’ll work with “multiple companies” to design tailored benchmarks and eventually share those benchmarks publicly, along with “industry-specific” evaluations.

“The first cohort will focus on startups who will help lay the foundations of the OpenAI Pioneers Program,” OpenAI wrote in the blog post. “We’re selecting a handful of startups for this initial cohort, each working on high-value, applied use cases where AI can drive real-world impact.”

Companies in the program will also have the opportunity to work with OpenAI’s team to create model improvements via reinforcement fine tuning, a technique that optimizes models for a narrow set of tasks, OpenAI says.

The big question is whether the AI community will embrace benchmarks whose creation was funded by OpenAI. OpenAI has supported benchmarking efforts financially before, and designed its own evaluations. But partnering with customers to release AI tests may be seen as an ethical bridge too far.



Source link

Continue Reading

Previous: Trump’s Treasury Secretary Bessent vows to address regulatory roadblocks to blockchain and stablecoin growth
Next: Can DIS Stock Keep Magic Alive Amid Soaring Tariff Costs?

Related News

World economy could get carved up into these 3 trading blocs
  • Uncategorized

World economy could get carved up into these 3 trading blocs

VedVision HeadLines July 6, 2025
Whales Power Bitcoin Cash ($BCH) to 8-Month High as Golden Cross Signals Breakout
  • Uncategorized

Whales Power Bitcoin Cash ($BCH) to 8-Month High as Golden Cross Signals Breakout

VedVision HeadLines July 6, 2025
A Brief History Of Wallet Clustering
  • Uncategorized

A Brief History Of Wallet Clustering

VedVision HeadLines July 6, 2025

Recent Posts

  • Bobby Brazier shares true feelings about BBC EastEnders exit as actor admits ‘it’s my first real job’
  • King Charles downs glass of whisky as he steps out on final day of Scottish tour
  • World economy could get carved up into these 3 trading blocs
  • Kapil Sharma calls Archana Puran Singh ‘Anaconda’, accuses her of locking Navjot Singh Sidhu; her reply wins hearts | Television News
  • Whales Power Bitcoin Cash ($BCH) to 8-Month High as Golden Cross Signals Breakout

Recent Comments

No comments to show.

Archives

  • July 2025
  • June 2025
  • May 2025
  • April 2025

Categories

  • Current Affairs
  • Shopping
  • Uncategorized

You may have missed

Bobby Brazier shares true feelings about BBC EastEnders exit as actor admits ‘it’s my first real job’
  • Current Affairs

Bobby Brazier shares true feelings about BBC EastEnders exit as actor admits ‘it’s my first real job’

VedVision HeadLines July 6, 2025
King Charles downs glass of whisky as he steps out on final day of Scottish tour
  • Current Affairs

King Charles downs glass of whisky as he steps out on final day of Scottish tour

VedVision HeadLines July 6, 2025
World economy could get carved up into these 3 trading blocs
  • Uncategorized

World economy could get carved up into these 3 trading blocs

VedVision HeadLines July 6, 2025
Kapil Sharma calls Archana Puran Singh ‘Anaconda’, accuses her of locking Navjot Singh Sidhu; her reply wins hearts | Television News
  • Current Affairs

Kapil Sharma calls Archana Puran Singh ‘Anaconda’, accuses her of locking Navjot Singh Sidhu; her reply wins hearts | Television News

VedVision HeadLines July 6, 2025
Copyright © All rights reserved. | MoreNews by AF themes.