OpenAI Releases LifeSciBench: Measuring AI Systems' Capabilities in Real-World Scientific Research Scenarios

PANews, June 20 – OpenAI has officially released a new evaluation benchmark, LifeSciBench, designed to measure the capabilities of AI systems in real-world scientific research scenarios. LifeSciBench is based on 750 expert-designed tasks covering 7 categories of research workflows and 7 biology domains. The tasks were sourced from 173 researchers with doctoral degrees and experience in the biotech or pharmaceutical industries. The benchmark emphasizes the assessment of complex scientific research capabilities, including evidence integration, experimental design, data analysis, scientific reasoning, and research communication, rather than isolated factual questions. Over 79% of the tasks involve multi-step reasoning, with an average of approximately 4 reasoning steps per question, and include 1,062 real research-related data attachments (such as papers, figures, sequence data, and structural files).

Share to:

Author: PA一线

This content is for market information only and is not investment advice.

Follow PANews official accounts, navigate bull and bear markets together
PANews APP
Axelar Network Responds to Security Incident: Vulnerability Stemmed from Third-Party Token Contract's 'Infinite Minting' Issue
PANews Newsflash