News
Newest
Ask
Show
Jobs
Open on GitHub
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
(arxiv.org)
15 points | by
PaulHoule
10 hours ago
1 comments
Reubend
4 hours ago
Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.
1 comments