r/LocalLLM

Community Overview

About r/LocalLLM

Subreddit to discuss locally run large language models.

The community at a glance

r/LocalLLM is a Subreddit for NLP Developers with roughly 176K members. It has been around since 2023. It uses a forum format for communication. On the Hive Index it ranks #4 in the NLP communities list.

Roughly 104K members have joined in the past year. Popular discussion topics include Llm, Local, and Ai. Common discussion themes are Solution Requests and Advice Requests. Product recommendations often mention model, llm, and gpu.

On Reddit
Established 2023
176K Members

Community Features

This community has a forum

Subreddit Analysis

via GummySearch
Yearly: +104K members
Growth: +146.1% / year

Member growth over time

All time (yearly)

  • 2024: 13K members
  • 2025: 86K members
  • 2026: 74K members

Past year (monthly)

  • Jul: 5K members
  • Aug: 7K members
  • Sep: 4K members
  • Oct: 3K members
  • Nov: 6K members
  • Dec: 5K members
  • Jan: 7K members
  • Feb: 10K members
  • Mar: 11K members
  • Apr: 17K members
  • May: 17K members
  • Jun: 11K members

Themes

  • Solution Requests
    12 posts in the past month
    1. I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
    2. Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback!
    3. 640GB VRAM recommendations?
    #1
    Solution Requests
    I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding? · Looking to buy an RTX 5090 for local "Vibe Coding" using Claude Code / Open Code with Qwen 3.6 35B-A3B. Need real-world feedback! · 640GB VRAM recommendations?
    12
  • Advice Requests
    8 posts in the past month
    1. I moved 20% of our production LLM traffic to Chinese open weights models for 6 weeks. Here is the actual cost, quality, and data residency breakdown
    2. We open-sourced our desktop app for local LLM users, feedback welcome
    3. Hoping for some guidance, as complete novice to AI and Tech in general
    #2
    Advice Requests
    I moved 20% of our production LLM traffic to Chinese open weights models for 6 weeks. Here is the actual cost, quality, and data residency breakdown · We open-sourced our desktop app for local LLM users, feedback welcome · Hoping for some guidance, as complete novice to AI and Tech in general
    8
  • Ideas
    2 posts in the past month
    1. If I were running any AI sub getting flooded with "I've made this" posts I would make it a rule the creation must have a clear and valid unique-selling point.
    2. Benchmarking 18 local LLMs on a single hard task: build a complete product landing page, then render it and verify the JavaScript actually works
    #3
    Ideas
    If I were running any AI sub getting flooded with "I've made this" posts I would make it a rule the creation must have a clear and valid unique-selling point. · Benchmarking 18 local LLMs on a single hard task: build a complete product landing page, then render it and verify the JavaScript actually works
    2
  • Money Talk
    1 post in the past month
    1. Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
    #4
    Money Talk
    Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
    1
  • Opportunities
    1 post in the past month
    1. Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
    #5
    Opportunities
    Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
    1

Topics

  • Llm
    130 posts in the past month
    1. My local llm machine
    2. What Are You Actually Using Local LLMs For?
    3. I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
    #1
    Llm
    My local llm machine · What Are You Actually Using Local LLMs For? · I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
    130
  • Local
    128 posts in the past month
    1. My local llm machine
    2. What Are You Actually Using Local LLMs For?
    3. Uncensored LLM models for local use
    #2
    Local
    My local llm machine · What Are You Actually Using Local LLMs For? · Uncensored LLM models for local use
    128
  • Ai
    104 posts in the past month
    1. Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
    2. Asked to build a local AI setup for a company with ~50k budget. Where would you start?
    3. Tools or techniques for AI memory management?
    #3
    Ai
    Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark · Asked to build a local AI setup for a company with ~50k budget. Where would you start? · Tools or techniques for AI memory management?
    104
  • Model
    91 posts in the past month
    1. Smartest model to replace Claude Code - 100GB/200GB VRAM available
    2. Uncensored LLM models for local use
    3. Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop
    #4
    Model
    Smartest model to replace Claude Code - 100GB/200GB VRAM available · Uncensored LLM models for local use · Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop
    91
  • Coding
    56 posts in the past month
    1. I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
    2. Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark
    3. Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
    #5
    Coding
    I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding? · Local AI Coding with Qwen 3.6 27B on NVIDIA DGX Spark · Are Companies moving to local LLMs for coding to avoid paying millions to Anthropic and OpenAI?
    56

Flair

  • Question
    75 posts in the past month
    1. What Are You Actually Using Local LLMs For?
    2. Smartest model to replace Claude Code - 100GB/200GB VRAM available
    3. I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
    #1
    Question
    What Are You Actually Using Local LLMs For? · Smartest model to replace Claude Code - 100GB/200GB VRAM available · I have a 5k budget for a personal LLM server. What are the best options and what performance can I expect compared to commercial models for coding?
    75
  • Discussion
    68 posts in the past month
    1. ZAI said "hold my beer" and dropped a MIT licensed flagship the day after the Fable/Mythos shutdown
    2. how are they gonna stop us next?
    3. Push it to prod immediately
    #2
    Discussion
    ZAI said "hold my beer" and dropped a MIT licensed flagship the day after the Fable/Mythos shutdown · how are they gonna stop us next? · Push it to prod immediately
    68
  • Project
    24 posts in the past month
    1. Created a small website to check what can run on your hardware to find new models you can run easily
    2. i finetuned llama to deny
    3. Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop
    #3
    Project
    Created a small website to check what can run on your hardware to find new models you can run easily · i finetuned llama to deny · Experimental POC: streaming a ~160GB DeepSeek-V4-Flash-class MoE model on an 8GB VRAM laptop
    24
  • News
    10 posts in the past month
    1. This is why we need local models
    2. Intel ending development of BigDL: An open-source AI/LLM effort getting axed
    3. As per README the continuedev/continue project is dead · Issue #12629 · continuedev/continue
    #4
    News
    This is why we need local models · Intel ending development of BigDL: An open-source AI/LLM effort getting axed · As per README the continuedev/continue project is dead · Issue #12629 · continuedev/continue
    10
  • Model
    8 posts in the past month
    1. Mac Studio M3 Ultra 256GB + Qwen 3 235B = 18.57 tok/sec
    2. Gemma 4 Quadruple Release, 12B, 12B QAT, 26B-A4B QAT and 31B QAT Uncensored Heretics!
    3. State of Local AI #1. In lieu of Fable ban. Here’s the best LLMs of the week to run on your hardware
    #5
    Model
    Mac Studio M3 Ultra 256GB + Qwen 3 235B = 18.57 tok/sec · Gemma 4 Quadruple Release, 12B, 12B QAT, 26B-A4B QAT and 31B QAT Uncensored Heretics! · State of Local AI #1. In lieu of Fable ban. Here’s the best LLMs of the week to run on your hardware
    8

Product recommendations

  • model
    16 posts in the past month
    1. Best models for 8x3090
    2. What the best model to run on m1 pro, 16gb ram for coders?
    3. best model for laptop and ram?
    #1
    model
    Best models for 8x3090 · What the best model to run on m1 pro, 16gb ram for coders? · best model for laptop and ram?
    16
  • llm
    9 posts in the past month
    1. Best ultra low budget GPU for 70B and best LLM for my purpose
    2. Best LocalLLM for scientific theories and conversations?
    3. Best LLM to run locally on LM Studio (4GB VRAM) for extracting credit card statement PDFs into CSV/Excel?
    #2
    llm
    Best ultra low budget GPU for 70B and best LLM for my purpose · Best LocalLLM for scientific theories and conversations? · Best LLM to run locally on LM Studio (4GB VRAM) for extracting credit card statement PDFs into CSV/Excel?
    9
  • gpu
    4 posts in the past month
    1. Best ultra low budget GPU for 70B and best LLM for my purpose
    2. GPU recommendation for best possible LLM/AI/VR with 3000+€ budget
    3. Best Used Card For Running LLMS
    #3
    gpu
    Best ultra low budget GPU for 70B and best LLM for my purpose · GPU recommendation for best possible LLM/AI/VR with 3000+€ budget · Best Used Card For Running LLMS
    4

Frequently asked questions

Who is r/LocalLLM for?
Best for NLP Developers enthusiasts looking for a Reddit-based community with forum discussion.
Is r/LocalLLM free to join?
This listing is not marked as paid-only. Access rules and any fees are decided by the community.
How many members does r/LocalLLM have?
Roughly 176K members, based on figures reported by the community or its host. Member counts are approximate and change over time.
What platform is r/LocalLLM on?
r/LocalLLM runs on Reddit. Reddit communities (or "subreddits") are forum-based groups on the popular social news aggregation, web content rating, and discussion website Reddit. Reddit is commonly known as "the front page of the internet". Users submit content to the site such as links, text posts, and images, which are then voted up or down and discussed by other members. From investing Reddit communities, to professional ones, to ones just for laughs, you're likely to find a community for you on Reddit.
What topics does r/LocalLLM cover?
On the Hive Index, r/LocalLLM is organized under NLP Developers.
How do I join r/LocalLLM?
You can join r/LocalLLM by clicking this link, or pressing the "Go to community" button above.
What are the NLP communities like?
Join NLP communities online to discuss and learn about the latest developments in language learning technology. You'll be able to chat with like-minded engineers and data scientists about the newest machine learning methods, from NLP to neural networks.