Project Description
I need a specialist to turn my Windows-based PC into a robust, private AI workstation. The core stack will revolve around Python and must run Llama locally through AnythingLLM while remaining flexible enough to load additional models as they mature.
Key capabilities I’m after:
• Decades-worth of data ingestion: PDFs, Word files, full Office docs and .eml email archives.
• Context retention so I can resume a conversation or analysis session days later without re-feeding content.
• Open WebUI access for convenient, browser-based interaction.
• Clean restart behaviour, automated backups, and a path to scale out if I later bolt on extra GPUs or move to a small on-prem cluster.
I already have the hardware; you bring the know-how to configure, script, and document the build so I can maintain it myself afterward. Please outline your proposed toolchain, any vector stores or databases you recommend for long-term memory, and how you’ll validate stability under load.