Skip to content

Pull requests: jingyaogong/minimind

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

新增注释,解释 Attention Trainer 细节
#478 opened Aug 15, 2025 by zhenyu-02 Loading…
Mega
#442 opened Jun 26, 2025 by yangnianboy Loading…
接续训练
#424 opened May 28, 2025 by zisu09 Loading…
升腾NPU适配
#409 opened May 16, 2025 by adenzhou1350 Loading…
完善注释及训练脚本
#394 opened May 8, 2025 by 0x0059 Loading…
修改 serve_openai_api.py 的默认参数
#385 opened Apr 30, 2025 by screnwei Loading…
Hotfix/issues 382
#383 opened Apr 29, 2025 by screnwei Loading…
Update eval_model.py
#377 opened Apr 26, 2025 by howard0su Loading…
sft should use pretrain model
#376 opened Apr 26, 2025 by zachzwy Loading…
完善 README 中关于加载已有模型的说明
#354 opened Apr 20, 2025 by llxxbb Loading…
chore: auto detect mps for pre train
#323 opened Apr 5, 2025 by zwpaper Loading…
Add Load ckpt
#317 opened Apr 3, 2025 by LH-and-FPGA Loading…
Little typo of readme
#262 opened Mar 9, 2025 by SeanHH86 Loading…
[feat] add interactive notebook
#214 opened Feb 23, 2025 by Nijikadesu Loading…
add smart gradient accumulation
#204 opened Feb 21, 2025 by powermano Loading…
Add ckp_dir and tokenizer path
#189 opened Feb 18, 2025 by xunuohope1107 Loading…
Update requirements.txt
#77 opened Oct 28, 2024 by LIE-24 Loading…
Auto tokenizer name path fix
#59 opened Oct 3, 2024 by krmst Loading…
Update requirements
#54 opened Oct 1, 2024 by krmst Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.