Project Description
I’m ready to build a production-grade AI chat system that can take over repetitive Q&A duties and other conversational tasks inside my product. The goal is full task automation: users should be able to ask a question, receive an accurate answer which is thougtfull and soft and emotionally supportive, and never notice a hand-off to a human.
Here is what I need you to do:
• Design and implement the core large-language-model pipeline (GPT-4, Claude, or another strong model of your choice).
• Integrate retrieval-augmented generation so the bot can pull from my existing knowledge base and keep answers grounded.
• Orchestrate the prompts, embeddings, and vector search (LangChain, LlamaIndex, Pinecone or similar) for speed and reliability.
• Wrap the model in a clean, well-documented API that I can drop into a web or mobile front end.
*Cost optimization by combining different models
Acceptance criteria
– The chat responds in under two seconds for 95 % of queries.
– Hallucination rate is demonstrably below 3 % on a held-out test set we will define together.
– Deployment script delivers a containerised build I can spin up on AWS or GCP with a single command.
When you apply, focus on your relevant experience building similar LLM-driven chat or support tools. Links, demos, or concise write-ups are ideal; long generic proposals won’t help. Let’s automate this conversation flow and free my team from repetitive answers.