伊藤信吾(Shingo Ito)
文章被技术社区多位大V转发、推荐。,详情可参考新收录的资料
Just to labour the point: I only optimised for one-shot guesstimating hard maths problems and EQ-Bench. I never looked at IFEval, BBH, GPQA, MuSR, or MMLU-PRO during development. The leaderboard was pure out-of-sample validation.。新收录的资料对此有专业解读
They have six packs - but they're still jumping on and off weight-loss jabs。业内人士推荐新收录的资料作为进阶阅读