Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Project Swallow Asahidata
community
Activity Feed
Follow
8
AI & ML interests
None defined yet.
Recent Activity
Taishi-N324
authored
a paper
2 days ago
On the Optimal Reasoning Length for RL-Trained Language Models
Taishi-N324
authored
a paper
4 months ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
kazukifujii
authored
a paper
6 months ago
Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs
View all activity
Team members
8
models
0
None public yet
datasets
0
None public yet