Description:
We are looking for a GenAI Engineer with expertise in designing, developing, and deploying Generative AI applications. The ideal candidate will have Knowledge in full-stack development, AI/ML models, and cloud deployment. You will be responsible for building scalable AI-powered applications, integrating large language models (LLMs), and optimizing performance across both frontend and backend systems.
Key Responsibilities:
- Develop and deploy GenAI-powered applications using LLMs, NLP, and multimodal AI models.
- Design, build, and maintain scalable full-stack applications (frontend and backend).
- Integrate APIs from OpenAI, Hugging Face, Google Gemini, and other AI providers.
- Optimize LLM inference, fine-tuning, and prompt engineering for performance and accuracy.
- Work with vector databases (e.g., Pinecone, Weaviate, FAISS) for efficient retrieval-augmented generation (RAG).
- Implement MLOps best practices for model deployment and monitoring.
- Collaborate with product managers and designers to build user-centric AI applications.
Required Skills:
- AI/ML Expertise: Experience with LLMs, transformers, embeddings, fine-tuning models.
- Experience with Retrieval-Augmented Generation (RAG) and Hybrid Search.
- Familiarity with LangChain, LlamaIndex, and other GenAI toolkits.
- Experience deploying AI-powered chatbots, assistants, or creative content generation tools.
- Full Stack Development: Proficiency in React.js, Next.js, Node.js, Python (FastAPI, Flask, Django) will be a plus
- API Development: Hands-on experience with RESTful and GraphQL APIs.
- Testing & Performance Optimization: Knowledge of unit testing, A/B testing, and AI model evaluation.