Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
130.6
TFLOPS
6
2
Nova Devs / Arthur
arthu1
Follow
0 followers
·
2 following
AI & ML interests
11 year old Built custom architecture (MoE + YaRN) Trained on $1 GPU budget 50 downloads in 24 hours
Recent Activity
updated
a model
about 6 hours ago
arthu1/wind-arc-1-5-preview
replied
to
Crownelius
's
post
about 8 hours ago
[DAY TWO] PROJECT CROWFEATHER - 5/1/2026 Que sera, what will he be? Step 47,500 of 100,000. Loss hovering around 2.76 on 6.2B tokens. Throughput steady at 87k per second on the A100. Not a GH200, but she gets it done. Still haven't named him. Scamp has a rascally charm. Quentin sounds like he'd wear a bow tie and think hard before speaking. Taking votes. Phase two is what's keeping me up. Datasets everywhere and I can't pick. I'm fusing Google and DeepSeek's ideas: Gemma 4's alternating sliding and global attention, DeepSeek V4's Muon optimizer and WSD scheduler, Gemma 2's logit soft cap, and PaLM's z-loss. Sounds like peanut butter on a hamburger, but the loss curve says it works. Tribe_v2 has real potential but needs more scaffolding than a barn raising before I throw it in. One thing's certain though. This model's gonna be a thinker. Not a Wikipedia parrot. Something that chews before it answers. Finally got a use for my less popular datasets too. Some Opus-4.5-Writing-Style for polish. A few rows of Human-Archtypes-25k to see what personality bubbles up. Could be a poet, could be a grump. Either beats a flimsy fine-tune. The bank's after my credit card. Until then, full steam. Next model gets graphs. I swear. -Shane
published
a model
1 day ago
arthu1/wind-edge-1.6-sft
View all activity
Organizations
arthu1
's models
14
Sort:Â Recently updated
arthu1/wind-arc-1-5-preview
Updated
about 6 hours ago
arthu1/wind-edge-1.6-sft
Updated
1 day ago
•
14
arthu1/REPO_NAME
Updated
1 day ago
arthu1/wind-edge-1-6-final-beta
Updated
10 days ago
arthu1/wind-lite-1-6-0326-beta1
Updated
10 days ago
•
50
arthu1/wind-arc-1-6-beta
4B
•
Updated
Mar 25
•
10
arthu1/north-star-1
Updated
Mar 10
arthu1/north-air-1
Updated
Mar 10
arthu1/north-tokenizer
Updated
Mar 10
arthu1/starlight-mini
Text Generation
•
8B
•
Updated
Jan 31
•
5
arthu1/astrocoder-star-merge
8B
•
Updated
Dec 12, 2025
•
1
arthu1/astrocoder-star
Text Generation
•
Updated
Dec 12, 2025
•
2
arthu1/algorix
8B
•
Updated
Dec 11, 2025
•
4
arthu1/snowy
Updated
Sep 17, 2025