I’d check this thread for deepseek R1, you probably could ask @ubergarm some specificquestions after you read through it
If you want to dip your toes, you can always try a smaller model on a gaming gpu. For every 1 GB of vram, you can run 1B parameters
Self plug for referencing how parameters and Quantization works
1 Like