Models

Large Model

  1. OpenAI o1: https://openai.com/index/introducing-openai-o1-preview/
  2. GPT-4o: https://openai.com/index/hello-gpt-4o/
  3. Claude 3.5: https://docs.anthropic.com/zh-CN/docs/intro-to-claude#claude-3-5
  4. Qwen: https://tongyi.aliyun.com/
  5. ERNIE: https://wenxin.baidu.com/wenxin
  6. NVIDIA Cosmos: https://www.nvidia.com/en-us/ai/cosmos/
  7. KTransformers: https://kvcache-ai.github.io/ktransformers/

Benchmark

  1. AGI-Eval: https://agi-eval.cn/mvp/home?sourcePage=aihub.cn
  2. AI-Ceping: https://ai-ceping.com/
  3. WebWalker: https://alibaba-nlp.github.io/WebWalker/

Continue reading Datasets