Nova Devs / Arthur's picture

Building on HF

Nova Devs / Arthur

arthu1

·

AI & ML interests

11 year old Built custom architecture (MoE + YaRN) Trained on $1 GPU budget 50 downloads in 24 hours

Recent Activity

updated a model about 5 hours ago

arthu1/wind-arc-1-5-preview

repliedto Crownelius's post about 7 hours ago

[DAY TWO] PROJECT CROWFEATHER - 5/1/2026 Que sera, what will he be? Step 47,500 of 100,000. Loss hovering around 2.76 on 6.2B tokens. Throughput steady at 87k per second on the A100. Not a GH200, but she gets it done. Still haven't named him. Scamp has a rascally charm. Quentin sounds like he'd wear a bow tie and think hard before speaking. Taking votes. Phase two is what's keeping me up. Datasets everywhere and I can't pick. I'm fusing Google and DeepSeek's ideas: Gemma 4's alternating sliding and global attention, DeepSeek V4's Muon optimizer and WSD scheduler, Gemma 2's logit soft cap, and PaLM's z-loss. Sounds like peanut butter on a hamburger, but the loss curve says it works. Tribe_v2 has real potential but needs more scaffolding than a barn raising before I throw it in. One thing's certain though. This model's gonna be a thinker. Not a Wikipedia parrot. Something that chews before it answers. Finally got a use for my less popular datasets too. Some Opus-4.5-Writing-Style for polish. A few rows of Human-Archtypes-25k to see what personality bubbles up. Could be a poet, could be a grump. Either beats a flimsy fine-tune. The bank's after my credit card. Until then, full steam. Next model gets graphs. I swear. -Shane

published a model 1 day ago

arthu1/wind-edge-1.6-sft

View all activity

Organizations

Collections 3

View 3 collections

spaces 25

North Star AI

Chat with a Christian‑worldview AI assistant

Algorix Coding IDE

ml-intern sandbox

North Air API 3

North Air API 4

North Air API 7

models 14

arthu1/wind-arc-1-5-preview

Updated about 5 hours ago

arthu1/wind-edge-1.6-sft

Updated 1 day ago • 14

arthu1/REPO_NAME

Updated 1 day ago

arthu1/wind-edge-1-6-final-beta

Updated 10 days ago

arthu1/wind-lite-1-6-0326-beta1

Updated 10 days ago • 50

arthu1/wind-arc-1-6-beta

4B • Updated Mar 25 • 10

arthu1/north-star-1

arthu1/north-air-1

arthu1/north-tokenizer

arthu1/starlight-mini

Text Generation • 8B • Updated Jan 31 • 5

datasets 1

arthu1/north-star-paper

Updated Mar 10 • 11