レス数が1スレッドの最大レス数(1000件)を超えています。残念ながら投稿することができません。

技術的特異点/シンギュラリティ【総合】避難所 36

985：名無しさん (ｽﾌﾟｰ 7d0d-0cec)：2025/07/13(日) 21:30:07 ID:O/opyVFcSd: 書き起こし:

司会者:
"It's important to realize there are two types of training compute. One is the pre-training compute, that's from Grok-2 to Grok-3. Um, but from Grok-3 to Grok-4, we actually putting a lot of compute in reasoning, in RL."
別の話者:
"Yeah, and just like you said, this is literally the fastest moving field, and Grok-2 is like the high school student by today's standard. (...) By training Grok-2, that was the first time we scaled up like the pre-training. We realized that if you actually do the data ablation really carefully, and the infra, and also the algorithm, we can actually push the pre-training quite a lot by amount of 10x to make the model the best pre-trained based model. And that's why we built Colossus, the world's supercomputer with 100,000 H100s. And then with the best pre-trained model, and we realized if you can collect these verifiable outcome reward, you can actually train this model to start thinking from the first principle, start to reason, correct its own mistakes, and that's where the Grok-3 reasoning comes from. And today we ask the question, what happens if you take expansion of Colossus with all 200,000 GPUs, put all these into RL, 10x more compute than any of the models out there on reinforcement learning, unprecedented scale, what's gonna happen?"

説明:

事前学習とRL: Grok4の開発において、事前学習だけでなく、強化学習(RL)による推論能力の向上に重点を置いたことが述べられています。

Grok2との比較: Grok2はGrok4と比較すると、高校生レベルの能力であると表現されています。これはAIの進化の速さを強調しています。

Colossusスパコン: 大量のGPU（H100を10万台、その後20万台に拡張）を搭載したColossusというスーパーコンピュータを構築し、Grokの学習に使用したことが語られています。

自己修正能力: 強化学習によって、Grokが第一原理から考え、推論し、自らの間違いを修正する能力を獲得したことが説明されています。

RLスケールの拡大: Colossusの拡張により、既存のモデルよりも桁違いに大きな規模で強化学習を実施したことが述べられています。

掲示板管理者へ連絡無料レンタル掲示板