Doge-CheckPoint Collection A series of checkPoint weights that can continue training on new datasets without spikes of the training. β’ 3 items β’ Updated 11 days ago β’ 1
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. β’ 6 items β’ Updated 8 days ago β’ 13
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture Paper β’ 2412.11834 β’ Published Dec 16, 2024 β’ 7
Cheems: Wonderful Matrices More Efficient and More Effective Architecture Paper β’ 2407.16958 β’ Published Jul 24, 2024 β’ 3