Job Details

SRE
Lead
Full time
Apr 27

SRE Lead

RUB 530,000

SRE Lead for MLOps infrastructure development. Salary: 500,000 - 530,000 RUB gross. Develops a unified MLOps ecosystem for model development, deployment, and AI-powered document processing.

We are developing a unified MLOps ecosystem for the bank. This includes environments for model development/training/inference pipelines, model execution/delivery pipelines, non-model services (Feature Store, AutoML, AlfaPredict), A/B testing, RAG/LLMOps, and AI-powered document processing systems.

Experience in SRE/DevOps for 3+ years (Docker, Helm, Jenkins/GitLab CI, Python). Experience in ML/MLOps for 1+ year (Airflow, JupyterHub, Coder, ArgoWF, MLflow, Seldon, CUDA, KServe). Experience administering Kubernetes for 2+ years. Experience with Hadoop, Spark, Kafka, ELK. Personal qualities: Proactivity and initiative in refactoring suggestions. Full immersion in the infrastructure and team. Teamwork, willingness to help colleagues and users. Ability to see the big picture of the expected result, rather than just solving a specific task.

Employment under the Labor Code of the Russian Federation or as an individual entrepreneur in an accredited IT company. Voluntary medical insurance with dental coverage. Discounts on foreign language learning from Skyeng. Discounts on fitness from Xfit. Discounts on cinema from KARO. Work equipment provided.

Kubernetes
ArgoWF
Seldon
CUDA
KServe
Coder
Hadoop
Python
Spark
Helm
JupyterHub
Docker
MLflow
Airflow
MLOps
ELK
Gitlab CI
Jenkins
Kafka

Don't miss a single job

Subscribe to our Telegram channel

Subscribe

Similar jobs

SRE Engineer at Yandex Crowd

SRE Engineer at Yandex Crowd. Responsibilities include designing and implementing IaC, automating deployment and infrastructure management, monitoring, and ensuring system reliability. Requires knowledge of Terraform, Grafana, Docker, Kubernetes, Python, Bash, Java, and network protocols.

Y
Yandex Crowd

SRE Engineer at Yandex Afisha

Yandex Afisha is looking for an SRE engineer or infrastructure developer with operational experience. Responsibilities include improving product reliability, optimizing infrastructure, and automating delivery. Requires experience with configuration management systems and orchestrators, team development in Python or Go, CI/CD, and UNIX systems.

Russia
Я
Яндекс Афиша