Responsibility for implementation and deployment of Agentic/ Gen AI frameworks at scale
Strong in programming - Python and C++
Previous experience of working on Computer Vision projects and VLM /VLAM models.
Practical experience of working with Transformer Arch. and End to End Deep neural networks
Full stack AI / ML development experience
Design, build & maintain efficient and reliable Agentic / Generative AI code leveraging pipelines
Hosting and deployment knowledge in GCP along with advanced engineering concepts to build user friendly UI interface for easy adoption.
Requirements:
8+ overall years of experience (Agentic AI, VLM, VLAM and LLM) with significant exposure in Development, Design, Architecture, scaling and hosting in cloud.
Must Have –
Solutioning experience with Python and FAST API, Agentic Ai frameworks, VLMs, VLAMs, Open source LLM’s and Code based LLM models at scale with - Langchain / Ollama, vector embeddings, Memory Management etc.,
Practical experience in implementing Explainable and ethical AI models
Practical experience in frameworks like RAG/ CAG/ Agentic RAG etc.,
Experience in cloud hosting either AWS or Azure or GCP.
Experience in ML-OPS - Implement a feedback mechanism to continually improve the model over time through feedback loop and monitoring end KPI’s in production.
Experience with Quantization and Kubernetes or docker
Good to have
gRPC implementation to expose the API’s on a server for easy usage and good user interface
Streamlit front end creation
Experience with SAFe framework deliveries.
Qualifikationen
Candidate education background and experience:
Degree in Computer science or AI/ML Engineering
Strong in Programming concepts, frameworks, deployment, and customization of Agentic models, Vision models and Vision language action Models