したらばTOP ■掲示板に戻る■ 全部 1-100 最新50 | |
レス数が1スレッドの最大レス数(1000件)を超えています。残念ながら投稿することができません。

技術的特異点/シンギュラリティ【総合】避難所 33

985名無しさん (アウアウ 6d9e-2a3b):2025/04/29(火) 08:00:41 ID:B.SGAPP.Sa
Qwen @Alibaba_Qwen
Introducing Qwen3!

We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro. Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.5-72B-Instruct.

For more information, feel free to try them out in Qwen Chat Web (https://chat.qwen.ai/) and APP and visit our GitHub, HF, ModelScope, etc.

Blog: https://qwenlm.github.io/blog/qwen3/
GitHub: https://github.com/QwenLM/Qwen3
Hugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

The post-trained models, such as Qwen3-30B-A3B, along with their pre-trained counterparts (e.g., Qwen3-30B-A3B-Base), are now available on platforms like Hugging Face, ModelScope, and Kaggle. For deployment, we recommend using frameworks like SGLang and vLLM. For local usage, tools such as Ollama, LMStudio, MLX, llama.cpp, and KTransformers are highly recommended. These options ensure that users can easily integrate Qwen3 into their workflows, whether in research, development, or production environments.

Hope you enjoy our new models!
2025年04月29日 06:05

https://x.com/Alibaba_Qwen/status/1916962087676612998

Qwen @Alibaba_Qwen
Qwen3 exhibits scalable and smooth performance improvements that are directly correlated with the computational reasoning budget allocated. This design enables users to configure task-specific budgets with greater ease, achieving a more optimal balance between cost efficiency and
2025年04月29日 06:05

Qwen @Alibaba_Qwen
Qwen3 models are supporting 119 languages and dialects. This extensive multilingual capability opens up new possibilities for international applications, enabling users worldwide to benefit from the power of these models.
2025年04月29日 06:05

Qwen @Alibaba_Qwen
We have optimized the Qwen3 models for coding and agentic capabilities, and also we have strengthened the support of MCP as well. Below we provide examples to show how Qwen3 thinks and interacts with the environment.
2025年04月29日 06:05

Qwen @Alibaba_Qwen

2025年04月29日 06:21




掲示板管理者へ連絡 無料レンタル掲示板