[返回军事纵横首页]·[所有跟帖]·[ 回复本帖 ] ·[热门原创] ·[繁體閱讀]·[版主管理]
送交者: 张旺教授[☆★★声望品衔12★★☆] 于 2025-01-26 17:48 已读 768 次 1 赞  


"Managers and engineers from Meta’s generative AI group and infrastructure team have started four war rooms to learn how DeepSeek works. Two of the mobilized
groups are trying to understand how High-Flyer lowered the cost of training and running DeepSeek. Meta wants to apply those techniques, a number of which a
technical paper from High-Flyer outlined, to Llama, one of the employees said. ... 6park.com

A third Meta research group is trying to figure out what data High-Flyer might have used to train its models, according to one of the employees with direct
knowledge. 6park.com

The fourth war room is considering new techniques for restructuring Meta’s models based on attributes of the DeepSeek models, they said. Meta is considering
launching a version of Llama that, like DeepSeek, would include numerous AI models, each trained to handle different tasks. That way, when a customer asks Llama
to handle a certain task, only some parts of the model would need to work on it. That could make the overall model faster and require less computing power to
operate." 6park.com

喜欢张旺教授朋友的这个贴子的话, 请点这里投票,“赞”助支持!
[举报反馈]·[ 张旺教授的个人频道 ]·[-->>参与评论回复]·[用户前期主贴]·[手机扫描浏览分享]·[返回军事纵横首页]

所有跟帖:        ( 主贴楼主有权删除不文明回复,拉黑不受欢迎的用户 )


标 题:

粗体 斜体 下划线 居中 插入图片插入图片 插入Flash插入Flash动画

     图片上传  Youtube代码器  预览辅助

打开微信,扫一扫[Scan QR Code]



[ 留园条例 ] [ 广告服务 ] [ 联系我们 ] [ 个人帐户 ] [ 版主申请 ] [ Contact us ]