Artificial Analysis overhauls its AI Intelligence Index, replacing saturated benchmarks with real-world tests measuring ...
Joining the ranks of a growing number of smaller, powerful reasoning models is MiroThinker 1.5 from MiroMind, with just 30 ...
MiniMax M2 was released in late October this year. The company stated that M2.1 demonstrated significant improvements in capability over its predecessor, especially in multilingual scenarios. Here, it ...
MLCommons today released AILuminate, a new benchmark test for evaluating the safety of large language models. Launched in 2020, MLCommons is an industry consortium backed by several dozen tech firms.
Depending on the hardware you're using, training a large language model of any significant size can take weeks, months, even years to complete. That's no way to do business — nobody has the ...
Following an unfavorable leaked Alder Lake benchmark earlier this week, another benchmark has been leaked through Geekbench. Unlike the previous benchmark, this one was testing processor performance ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results