I have a pilot project for a LLM/chat application, which will only need to ingest about 10gb of data. I’ve been building a reference list of sources, watching videos, books, and courses. I’m looking for recommendations on tooling and workstation environment. Strong preference for open source and on premises solutions.