About r/LocalLLM
Subreddit to discuss locally run large language models.
The community at a glance
r/LocalLLM is a Subreddit for NLP Developers with roughly 176K members. It has been around since 2023. It uses a forum format for communication. On the Hive Index it ranks #4 in the NLP communities list.
Roughly 104K members have joined in the past year. Popular discussion topics include Llm, Local, and Ai. Common discussion themes are Solution Requests and Advice Requests. Product recommendations often mention model, llm, and gpu.
Community Topics
Community Features
This community has a forum
Subreddit Analysis
via GummySearchMember growth over time
All time (yearly)
- 2024: 13K members
- 2025: 86K members
- 2026: 74K members
Past year (monthly)
- Jul: 5K members
- Aug: 7K members
- Sep: 4K members
- Oct: 3K members
- Nov: 6K members
- Dec: 5K members
- Jan: 7K members
- Feb: 10K members
- Mar: 11K members
- Apr: 17K members
- May: 17K members
- Jun: 11K members
Themes
- Solution Requests12 posts in the past month
- I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
- Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback!
- 640GB VRAM recommendations?
#112Solution RequestsI have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding? · Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback! · 640GB VRAM recommendations? - Advice Requests8 posts in the past month
- I moved 20% of our production LLM traffic to Chinese open weights models for 6 weeks. Here is the actual cost, quality, and data residency breakdown
- We open-sourced our desktop app for local LLM users, feedback welcome
- Hoping for some guidance, as complete novice to AI and Tech in general
#28Advice RequestsI moved 20% of our production LLM traffic to Chinese open weights models for 6 weeks. Here is the actual cost, quality, and data residency breakdown · We open-sourced our desktop app for local LLM users, feedback welcome · Hoping for some guidance, as complete novice to AI and Tech in general - Ideas2 posts in the past month
- If I were running any AI sub getting flooded with "I've made this" posts I would make it a rule the creation must have a clear and valid unique-selling point.
- Benchmarking 18 local LLMs on a single hard task: build a complete product landing page, then render it and verify the JavaScript actually works
#32IdeasIf I were running any AI sub getting flooded with "I've made this" posts I would make it a rule the creation must have a clear and valid unique-selling point. · Benchmarking 18 local LLMs on a single hard task: build a complete product landing page, then render it and verify the JavaScript actually works - Money Talk1 post in the past month
- Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
#41Money TalkAre Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI? - Opportunities1 post in the past month
- Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
#51OpportunitiesLocal AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
Topics
- Llm130 posts in the past month
- My local llm machine
- What Are You Actually Using Local LLMs For?
- I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
#1130LlmMy local llm machine · What Are You Actually Using Local LLMs For? · I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding? - Local128 posts in the past month
- My local llm machine
- What Are You Actually Using Local LLMs For?
- Uncensored LLM models for local use
#2128LocalMy local llm machine · What Are You Actually Using Local LLMs For? · Uncensored LLM models for local use - Ai104 posts in the past month
- Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
- Asked to build a local AI setup for a company with ~50k budget. Where would you start?
- Tools or techniques for AI memory management?
#3104AiLocal AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark · Asked to build a local AI setup for a company with ~50k budget. Where would you start? · Tools or techniques for AI memory management? - Model91 posts in the past month
- Smartest model to replace Claude Code - 100GB/200GB VRAM available
- Uncensored LLM models for local use
- Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop
#491ModelSmartest model to replace Claude Code - 100GB/200GB VRAM available · Uncensored LLM models for local use · Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop - Coding56 posts in the past month
- I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
- Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
- Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
#556CodingI have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding? · Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark · Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
Flair
- Question75 posts in the past month
- What Are You Actually Using Local LLMs For?
- Smartest model to replace Claude Code - 100GB/200GB VRAM available
- I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
#175QuestionWhat Are You Actually Using Local LLMs For? · Smartest model to replace Claude Code - 100GB/200GB VRAM available · I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding? - Discussion68 posts in the past month
- ZAI said "hold my beer" and dropped a MIT licensed flagship the day after the Fable/Mythos shutdown
- how are they gonna stop us next?
- Push it to prod immediately
#268DiscussionZAI said "hold my beer" and dropped a MIT licensed flagship the day after the Fable/Mythos shutdown · how are they gonna stop us next? · Push it to prod immediately - Project24 posts in the past month
- Created a small website to check what can run on your hardware to find new models you can run easily
- i finetuned llama to deny
- Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop
#324ProjectCreated a small website to check what can run on your hardware to find new models you can run easily · i finetuned llama to deny · Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop - News10 posts in the past month
- This is why we need local models
- Intel ending development of BigDL: An open-source AI/LLM effort getting axed
- As per README the continuedev/continue project is dead · Issue #12629 · continuedev/continue
#410NewsThis is why we need local models · Intel ending development of BigDL: An open-source AI/LLM effort getting axed · As per README the continuedev/continue project is dead · Issue #12629 · continuedev/continue - Model8 posts in the past month
- Mac Studio M3 Ultra 256GB + Qwen 3 235B = 18.57 tok/sec
- Gemma 4 Quadruple Release, 12B, 12B QAT, 26B-A4B QAT and 31B QAT Uncensored Heretics!
- State of Local AI #1. In lieu of Fable ban. Here’s the best LLMs of the week to run on your hardware
#58ModelMac Studio M3 Ultra 256GB + Qwen 3 235B = 18.57 tok/sec · Gemma 4 Quadruple Release, 12B, 12B QAT, 26B-A4B QAT and 31B QAT Uncensored Heretics! · State of Local AI #1. In lieu of Fable ban. Here’s the best LLMs of the week to run on your hardware
Product recommendations
- model16 posts in the past month
- Best models for 8x3090
- What the best model to run on m1 pro, 16gb ram for coders?
- best model for laptop and ram?
#116modelBest models for 8x3090 · What the best model to run on m1 pro, 16gb ram for coders? · best model for laptop and ram? - llm9 posts in the past month
- Best ultra low budget GPU for 70B and best LLM for my purpose
- Best LocalLLM for scientific theories and conversations?
- Best LLM to run locally on LM Studio (4GB VRAM) for extracting credit card statement PDFs into CSV/Excel?
#29llmBest ultra low budget GPU for 70B and best LLM for my purpose · Best LocalLLM for scientific theories and conversations? · Best LLM to run locally on LM Studio (4GB VRAM) for extracting credit card statement PDFs into CSV/Excel? - gpu4 posts in the past month
- Best ultra low budget GPU for 70B and best LLM for my purpose
- GPU recommendation for best possible LLM/AI/VR with 3000+€ budget
- Best Used Card For Running LLMS
#34gpuBest ultra low budget GPU for 70B and best LLM for my purpose · GPU recommendation for best possible LLM/AI/VR with 3000+€ budget · Best Used Card For Running LLMS
Community Reviews
Frequently asked questions
- Who is r/LocalLLM for?
- Best for NLP Developers enthusiasts looking for a Reddit-based community with forum discussion.
- Is r/LocalLLM free to join?
- This listing is not marked as paid-only. Access rules and any fees are decided by the community.
- How many members does r/LocalLLM have?
- Roughly 176K members, based on figures reported by the community or its host. Member counts are approximate and change over time.
- What platform is r/LocalLLM on?
- r/LocalLLM runs on Reddit. Reddit communities (or "subreddits") are forum-based groups on the popular social news aggregation, web content rating, and discussion website Reddit. Reddit is commonly known as "the front page of the internet". Users submit content to the site such as links, text posts, and images, which are then voted up or down and discussed by other members. From investing Reddit communities, to professional ones, to ones just for laughs, you're likely to find a community for you on Reddit.
- What topics does r/LocalLLM cover?
- On the Hive Index, r/LocalLLM is organized under NLP Developers.
- How do I join r/LocalLLM?
- You can join r/LocalLLM by clicking this link, or pressing the "Go to community" button above.
- What are the NLP communities like?
- Join NLP communities online to discuss and learn about the latest developments in language learning technology. You'll be able to chat with like-minded engineers and data scientists about the newest machine learning methods, from NLP to neural networks.
Monthly Stats
- 7
- Views
- 6
- Visitors
- 1
- Referrals
Similar Communities
8r/LanguageTechnology
This sub will focus on theory, careers, and applications of NLP (Natural Language Processing), which includes anything from Regex & Text Analytics to Transformers & LLMs. Language learning & copy/pasted ChatGPT conversations are outside the scope of the sub - please read the rules for more clarification.
OneAI
OneAI provides APIs to apply AI to text. Summarize conversations, categorize articles and detect user emotions. Join the AI dev community on discord!
Merlinn Community
Merlinn Community is a Slack community that exists around our open-source product, Merlinn. The project is an AI on-call developer helping to investigate production incidents.
Hugging Face Discuss
Official community for discussion of Hugging Face transformers, datasets, and NLP models
r/NLP
Neuro-Linguistic Programming (NLP) is an approach to communication, personal development, and psychotherapy created by Richard Bandler and John Grinder.
fast.ai Forum
Community for fast.ai courses, practical deep learning, and NLP applications