星宿小小仙
02-06
作为深度追踪研究阿里的人来看,阿里巴巴确实退步了,已经退化传统企业那个水准了,从这几年发稿的水平及策略就可以明显的感知到!!
全球第一:阿里云宣布通义千问Qwen2.5-Max数学及编程能力登顶最新AI大模型榜单
免责声明:上述内容仅代表发帖人个人观点,不构成本平台的任何投资建议。
分享至
微信
复制链接
精彩评论
放养的小兵张嘎
02-06
放养的小兵张嘎
额 我还长期看好阿里呢 看样是我年轻了
什么也没有了~
APP内打开
发表看法
1
3
{"i18n":{"language":"zh_CN"},"detailType":1,"isChannel":false,"data":{"magic":2,"id":400614547132816,"tweetId":"400614547132816","gmtCreate":1738814041284,"gmtModify":1738814043662,"author":{"id":3474342316200448,"idStr":"3474342316200448","authorId":3474342316200448,"authorIdStr":"3474342316200448","name":"星宿小小仙","avatar":"https://static.tigerbbs.com/66dfe4fd824479ae852bbd63bd7aec17","vip":1,"userType":1,"introduction":"","boolIsFan":false,"boolIsHead":false,"crmLevel":1,"crmLevelSwitch":0,"individualDisplayBadges":[],"fanSize":4,"starInvestorFlag":false},"themes":[],"images":[],"coverImages":[],"html":"<html><head></head><body><p>作为深度追踪研究阿里的人来看,阿里巴巴确实退步了,已经退化传统企业那个水准了,从这几年发稿的水平及策略就可以明显的感知到!!</p></body></html>","htmlText":"<html><head></head><body><p>作为深度追踪研究阿里的人来看,阿里巴巴确实退步了,已经退化传统企业那个水准了,从这几年发稿的水平及策略就可以明显的感知到!!</p></body></html>","text":"作为深度追踪研究阿里的人来看,阿里巴巴确实退步了,已经退化传统企业那个水准了,从这几年发稿的水平及策略就可以明显的感知到!!","highlighted":1,"essential":1,"paper":1,"likeSize":3,"commentSize":1,"repostSize":0,"favoriteSize":0,"link":"https://laohu8.com/post/400614547132816","repostId":2509169409,"repostType":2,"repost":{"id":"2509169409","kind":"news","pubTimestamp":1738753438,"share":"https://www.laohu8.com/m/news/2509169409?lang=&edition=full","pubTime":"2025-02-05 19:03","market":"hk","language":"zh","title":"全球第一:阿里云宣布通义千问Qwen2.5-Max数学及编程能力登顶最新AI大模型榜单","url":"https://stock-news.laohu8.com/highlight/detail?id=2509169409","media":"IT之家","summary":"IT之家 2 月 5 日消息,1 月 29 日新年之际,阿里云公布了其全新的通义千问 Qwen 2.5-Max 超大规模 MoE 模型,号称在多个基准测试中超越 DeepSeek V3 登竞争对手。阿里云今日宣布,Qwen2.5-Max 在 Chatbot Arena 大模型盲测中超越 DeepSeek-V3、Open AI o1-mini 和 Claude-3.5-Sonnet 等模型,以 1332 分位列全球第七名,也是非推理类的中国大模型冠军。同时,Qwen2.5-Max 在数学和编程等单项能力上排名第一,在硬提示方面排名第二。因此,Chatbot Arena LLM Leaderboard 成为业界公认的最公正、最权威榜单之一,也是全球顶级大模型的最重要竞技场。","content":"<html><body><p>IT之家 2 月 5 日消息,1 月 29 日新年之际,阿里云公布了其全新的通义千问 Qwen 2.5-Max 超大规模 MoE 模型,号称在多个基准测试中超越 DeepSeek V3 登竞争对手。</p><p>阿里云今日宣布,Qwen2.5-Max 在 Chatbot Arena 大模型盲测中超越 DeepSeek-V3、Open AI o1-mini 和 Claude-3.5-Sonnet 等模型,以 1332 分位列全球第七名,也是非推理类的中国大模型冠军。</p><p><img src=\"https://x0.ifengimg.com/ucms/2025_06/5E4DAC86AEE0C9276955A5A3F5CD2D9FF8454B5F_size79_w810_h1080.jpg\"/></p><p>同时,Qwen2.5-Max 在数学和编程等单项能力上排名第一,在硬提示(Hard prompts)方面排名第二。</p><p><img src=\"https://x0.ifengimg.com/ucms/2025_06/4F3FC7571BC0BC704FF4FCAA97656E1239E33498_size53_w1080_h584.jpg\"/></p><p>IT之家查询公开资料获悉,Chatbot Arena 是由 LMSYS Org 推出的大模型性能测试平台,目前集成了 190 多种模型。</p><p>该榜单采用匿名方式将大模型两两组队,交给用户进行盲测,用户根据真实对话体验对模型能力进行投票。因此,Chatbot Arena LLM Leaderboard 成为业界公认的最公正、最权威榜单之一,也是全球顶级大模型的最重要竞技场。</p><p><img src=\"https://x0.ifengimg.com/ucms/2025_06/437F7F55B7C4303E0EF8588EBD9811F6FAA8E68A_size49_w1080_h400.jpg\"/></p><p>阿里云表示,在 Arena-Hard、LiveBench、LiveCodeBench、GPQA-Diamond 及 MMLU-Pro 等主流基准测试中,Qwen2.5-Max 比肩 Claude-3.5-Sonnet,并几乎全面超越了 GPT-4o、DeepSeek-V3 及 Llama-3.1-405B。</p><p><img src=\"https://x0.ifengimg.com/ucms/2025_06/E5F43C90A1D52FFCB0239317D7C8C6B70504CC4F_size49_w1080_h614.jpg\"/></p></body></html>","source":"fenghuang_stock","collect":0,"html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>全球第一:阿里云宣布通义千问Qwen2.5-Max数学及编程能力登顶最新AI大模型榜单</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n全球第一:阿里云宣布通义千问Qwen2.5-Max数学及编程能力登顶最新AI大模型榜单\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-02-05 19:03 北京时间 <a href=https://tech.ifeng.com/c/8gjLlvGftJl><strong>IT之家</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>IT之家 2 月 5 日消息,1 月 29 日新年之际,阿里云公布了其全新的通义千问 Qwen 2.5-Max 超大规模 MoE 模型,号称在多个基准测试中超越 DeepSeek V3 登竞争对手。阿里云今日宣布,Qwen2.5-Max 在 Chatbot Arena 大模型盲测中超越 DeepSeek-V3、Open AI o1-mini 和 Claude-3.5-Sonnet 等模型,以 ...</p>\n\n<a href=\"https://tech.ifeng.com/c/8gjLlvGftJl\">Web Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","type":0,"thumbnail":"","relate_stocks":{"89988":"阿里巴巴-WR","LU0577902538.SGD":"Fullerton Lux Funds - Asia Growth and Income Equities A Acc SGD","IE00B0JY6N72.USD":"PINEBRIDGE GLOBAL EMERGING MARKETS FOCUS EQUITY \"A\" (USD) ACC","LU0871576103.HKD":"HSBC GIF CHINESE EQUITY \"AC\" (HKD) ACC","BABA":"阿里巴巴","GPRO":"GoPro","LU0577902454.USD":"FULLERTON LUX FUNDS - ASIA GROWTH & INCOME EQUITIE \"I\" (USD) ACC","SG9999002562.SGD":"LionGlobal Asia Pacific SGD","LU0463099449.HKD":"SCHRODER ISF CHINA OPPORTUNITIES \"A\" (HKD) ACC","LU0455707207.USD":"FIDELITY FUNDS CHINA INNOVATION \"A\" (USD) INC","BK1591":"就地过年概念","09988":"阿里巴巴-W","SG9999002463.SGD":"LionGlobal China Growth SGD","BK1142":"互联网与直销零售","LU0572944931.SGD":"Janus Henderson Horizon China Opportunities A2 SGD","IE00BMPRXN33.USD":"NEUBERGER BERMAN 5G CONNECTIVITY \"A\" (USD) ACC","IE00BZ08YR35.GBP":"GUINNESS BEST OF CHINA \"C\" (GBP) ACC","LU0259732245.USD":"EASTSPRING INVESTMENTS DRAGON PEACOCK A","LU0499858602.USD":"NINETY ONE GSF ASIA PACIFIC EQUITY OPPORTUNITIES \"A\" (USD) ACC","LU0084288322.USD":"Natixis Asia Equity RD USD","LU0588546209.SGD":"Eastspring Investments - China Equity Fund AS SGD","LU2242644610.SGD":"Fidelity China Innovation A-ACC-SGD","IE00BZ08YT58.USD":"GUINNESS BEST OF CHINA \"C\" (USD) ACC","LU1048596156.SGD":"Blackrock Asian Growth Leaders A2 SGD-H","HK0000320264.USD":"TAIKANG KAITAI CHINA NEW OPPORTUNITIES FUND \"A\" (USD) ACC","SG9999006514.SGD":"United Asia Consumer Fund SGD","LU1568876251.USD":"ALLIANZ CHINA MULTI INCOME PLUS \"AMG\" (USD) INC","LU1048484197.HKD":"ALLIANZ CHINA MULTI INCOME PLUS \"AT\" (HKD) ACC","LU0054450605.USD":"HSBC GIF GLOBAL EMERGING MARKTS EQ \"AD\" INC","IE00B5MMRT66.SGD":"NEUBERGER BERMAN CHINA EQUITY \"A\" (SGDHDG) ACC","LU0456846285.SGD":"JPMorgan Funds - Greater China A (acc) SGD","LU1880383440.USD":"AMUNDI FUNDS CHINA EQUITY \"A2\" (USD) INC","LU0315178854.USD":"EASTSPRING INVESTMENTS ASIAN EQUITY INCOME \"A\" ACC","LU1688375341.USD":"贝莱德中国灵活股票基金","LU1961090484.USD":"ALLIANZ ALL CHINA EQUITY \"A\" (USD) INC","LU0164872284.USD":"HSBC GIF GLOBAL EMERGING MARKETS EQUITY \"A\" (USD) ACC","BK1588":"回港中概股","LU0140636845.USD":"施罗德大中华区股票A Acc","LU0348827113.USD":"ALLIANZ RCM CHINA \"AT\" ACC","HBBD.SI":"Alibaba HK SDR 5to1","LU0359201612.USD":"贝莱德中国基金A2","LU1316542783.SGD":"Janus Henderson Horizon Global Technology Leaders A2 SGD","LU0229945570.USD":"TEMPLETON BRIC \"A\" (USD) ACC","LU1807302812.USD":"UBS (LUX) EQUITY SICAV ALL CHINA \"P\" (USD) ACC","LU0823039010.USD":"AMUNDI FUNDS ASIA EQUITY FOCUS \"A2\" (USD) INC","LU1880398471.USD":"AMUNDI FUNDS GLOBAL EQUITY \"A2\" (USD) ACC","LU0345775950.USD":"NINETY ONE GSF ASIAN EQUITY \"A\" (USD) ACC","LU0228367735.SGD":"Eastspring Investments - Asian Equity Fund AS SGD","LU0890818403.SGD":"JPMorgan Funds - Emerging Markets Dividend A (mth) SGD-H","LU0048580855.USD":"富达大中华区A"},"source_url":"https://tech.ifeng.com/c/8gjLlvGftJl","is_english":false,"share_image_url":"https://static.laohu8.com/e9f99090a1c2ed51c021029395664489","article_id":"2509169409","content_text":"IT之家 2 月 5 日消息,1 月 29 日新年之际,阿里云公布了其全新的通义千问 Qwen 2.5-Max 超大规模 MoE 模型,号称在多个基准测试中超越 DeepSeek V3 登竞争对手。阿里云今日宣布,Qwen2.5-Max 在 Chatbot Arena 大模型盲测中超越 DeepSeek-V3、Open AI o1-mini 和 Claude-3.5-Sonnet 等模型,以 1332 分位列全球第七名,也是非推理类的中国大模型冠军。同时,Qwen2.5-Max 在数学和编程等单项能力上排名第一,在硬提示(Hard prompts)方面排名第二。IT之家查询公开资料获悉,Chatbot Arena 是由 LMSYS Org 推出的大模型性能测试平台,目前集成了 190 多种模型。该榜单采用匿名方式将大模型两两组队,交给用户进行盲测,用户根据真实对话体验对模型能力进行投票。因此,Chatbot Arena LLM Leaderboard 成为业界公认的最公正、最权威榜单之一,也是全球顶级大模型的最重要竞技场。阿里云表示,在 Arena-Hard、LiveBench、LiveCodeBench、GPQA-Diamond 及 MMLU-Pro 等主流基准测试中,Qwen2.5-Max 比肩 Claude-3.5-Sonnet,并几乎全面超越了 GPT-4o、DeepSeek-V3 及 Llama-3.1-405B。","news_type":1},"isVote":1,"tweetType":1,"viewCount":1329,"commentLimit":10,"likeStatus":false,"favoriteStatus":false,"reportStatus":false,"symbols":["BABA","09988"],"verified":2,"subType":0,"readableState":1,"langContent":"CN","currentLanguage":"CN","warmUpFlag":false,"orderFlag":false,"shareable":true,"causeOfNotShareable":"","featuresForAnalytics":[],"commentAndTweetFlag":false,"andRepostAutoSelectedFlag":false,"upFlag":false,"length":119,"optionInvolvedFlag":false,"xxTargetLangEnum":"ZH_CN"},"commentList":[{"id":400648992395640,"commentId":"400648992395640","gmtCreate":1738822313425,"gmtModify":1738822317404,"authorId":3442361004733276,"author":{"id":3442361004733276,"idStr":"3442361004733276","authorId":3442361004733276,"name":"放养的小兵张嘎","avatar":"https://static.tigerbbs.com/a2bbfa9b4b9d9fdf7509f0ac54719e8e","vip":1,"crmLevel":4,"crmLevelSwitch":0,"individualDisplayBadges":[]},"repliedAuthorId":0,"objectId":400614547132816,"objectIdStr":"400614547132816","type":1,"supId":0,"supIdStr":"0","prevId":0,"prevIdStr":"0","content":"额 我还长期看好阿里呢 看样是我年轻了","text":"额 我还长期看好阿里呢 看样是我年轻了","html":"额 我还长期看好阿里呢 看样是我年轻了","likeSize":0,"commentSize":0,"subComments":[],"verified":10,"allocateAmount":0,"commentType":"valid","coins":0,"score":0,"disclaimerType":0}],"isCommentEnd":false,"isTiger":false,"isWeiXinMini":false,"url":"/m/post/400614547132816"}
精彩评论