← back
DevOps for Scalable Infra for startup

DevOps for Scalable Infra for startup

Pending
💰 USD 500–2500 👤 Unknown 🕒 14d ago status: new
Linux Cloud Computing GPGPU Amazon Web Services Infrastructure Architecture DevOps Containerization LLM Prompt Engineering Retrieval-Augmented Generation (RAG)
I need experienced devOps to find solution for scalable solution for INFRA. 1.Setup automatically AWS instance once backend is reaching limit 2.Find cheap way for whisper server 3.Create infra to be able to scale in future for multi regions EU/ASIA + claudflare 4. Check and remove problems with db latency or performance 5. Scalable architekture for RAG +LLM + audio: I need solution for LLM (selectd by client)+ RAG deployed on own server(recommended by freelancer) with automatic scalable to 1000 or more converations the same time. Instances /pods should be added and removed automatically to save costs(for now only online dedicated serwers /clauds) later hibdrid of GPU server on premis + online servers Currenly additional information aboout users we have in postgresql only , we want to give user option to talk with RAG data and LLM model System also should count usages, store inforamtion when conversation started and finished in our database. If there is better solution recommended to talk wih the data I am open for it . In future I would like to add sending voice to this server and getting it back (except text). Please share price,timeplan for all things included to correct current infra and your experiance
↗ View on Freelancer