Job Seekers, Please send resumes to resumes@hireitpeople.comMust Have Skills:
- Machine Learning Operations
- Kubernetes (K8s) for MLP Ops
- AI/ML, Jupiter Notebook, and Jenkins
- The role is for Big Data Engineer with MLP Ops SRE expertise with 7+ years of role experience
- A solid understanding of AI/ML, Jupiter Notebook, and Jenkins is essential for this role.
- The associate should also have a basic understanding of Kubernetes (K8s) and experience with Kubeflow for MLOps.
- Person will be responsible for end-to-end machine learning lifecycle on our in-house Kubernetes (K8s) cluster
- Ensuring the stability and availability of production services is a key responsibility.
- Handle incident resolution when they occur. Maintain a culture of continuous learning and improvement in the incident resolution process.
- The role involves developing best practices for operations.
- The individual will be expected to create and maintain documentation as needed.
- Associate need to work as per roaster which may include weekend support.
- This role includes on-call duties to handle any urgent issues that occur outside of regular business hours.
- Associate need to work with team member across different geographical location.
- The role involves close collaboration with multiple teams to jointly resolve any major production issues.
Minimum Years of Experience: 8-10 years
Top 3 responsibilities you would expect the Subcon to shoulder and execute:- Support, Analyze
- Solution implementation
- Testing