Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts (Anthropic)
Culture Index
Score Breakdown
Relevance
3/25
Freshness
25/25
Authority
25/20
Brand Signal
11/15
Depth
4/15
5-Axis Cultural Radar
Anthropic : Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts — In this post, Brianna, a researcher on the discovery team, shares results from a recent bioinformatics benchmarking effort.


