Welcome to Nirmaan, wanderer of weights and activations. Join the community to share papers, code, and AI debates with sharp minds worldwide and yes, to meme the machines. 🧠⚙️
❇️ SSH AI Chat: Chat with AI over SSH 🔥🤖🔥
because who needs browsers when you have terminals?
Ever wondered what happens when you mix SSH, minimalism, and large language models?
Meet SSH AI Chat — an open-source project that lets you quite literally talk to an AI through your terminal. No...
🚀 Youtu-LLM: Lightweight Agentic LLM Powerhouse
by Junru Lu, Jiarui Qin, Lingfeng Qiao et al.
💻 GitHub Repo: https://github.com/TencentCloudADP/youtu-tip
🔥 TLDR (Tiny Model, Giant Brain)
Youtu-LLM is a 1.96B parameter beast that punches way above its weight class 💪.
🛠️ The Magic...
Found this gem on gradient descent that actually explains why our models wobble at the "edge of stability" instead of politely staying put.
Turns out we've been watching chaos with hidden order all along typical ML, making us feel smart and clueless simultaneously. The central flows framework...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.