Machine Learning Platform Engineer
Descrizione dell'offerta
Subito.it is Italy’s leading platform for buying and selling both second‑hand and new items — a true marketplace that connects millions of people to exchange goods and services. Our mission is to make the buying and selling experience simple, fast, and secure, while promoting circular economy principles and creating new business opportunities for everyone.
About the Role
We are looking for a highly skilled ML Platform Engineer to bridge the gap between machine learning research and production‑grade engineering. In this role you will be architecting and maintaining the core infrastructure that powers our entire ML lifecycle—from experimentation and orchestration to deployment and monitoring.
Job Responsibilities
- Infrastructure as Code: Design, build, and manage scalable ML infrastructure using Kubernetes, AWS services and Terraform.
- Workflow Orchestration: Build and maintain reusable components for our ML pipelines using Kubeflow to standardise our workflows and boost the team productivity.
- Model lifecycle Tracking: Manage tools to monitor the full lifecycle of a ML service: experimentation (MLflow), versioning, data quality, drifting, etc.
- Model Serving: Develop robust serving frameworks for deployment, monitoring and scaling our real time inference capabilities.
- Tooling & Integration: Create internal Python libraries and CLI tools that simplify how Data Scientists interact with the platform.
- Performance Optimization: Optimize resource allocation (GPU/CPU) within Kubernetes clusters to balance performance and cloud costs.
Job Requirements
- Programming: Mastery of Python (design patterns, clean architecture, asynchronous programming…) and Docker for local development.
- Orchestration: Deep experience with Kubernetes (K8s), including Helm charts, operators, and cluster management.
- MLOps Stack: Hands‑on experience with ML orchestration tools (ideally Kubeflow) and registries (MLflow).
- ML Frameworks: Proficiency with at least one major framework, such as PyTorch or TensorFlow, and an understanding of their distributed training requirements.
- CI/CD: Experience building automated pipelines (GitHub Actions, GitLab CI, or Jenkins) specifically for ML artifacts.
- Data Layers: Familiarity with feature stores and data processing tools (eg. Spark, Dask, or Pandas).
Nice to have
- Experience with Infrastructure as Code (Terraform or Pulumi).
- Knowledge of monitoring stacks like Prometheus and Grafana.
- Contributions to open‑source MLOps or CNCF projects.
Benefits
- Flexible working hours and smart working policy.
- Competitive total remuneration based on experience.
- Work with OKRs goal‑setting framework.
- Healthcare plan, welfare plan and wellbeing initiatives.
- Dynamic and supportive work environment.
- Growth and training opportunities.
Subito is an equal‑opportunity employer. We welcome people for who they are, regardless of origin, appearance, or background. Our workplace, like our platform, is for everyone. Diversity in all its forms — gender, age, nationality, culture, religion, or sexual orientation — inspires creativity and helps us grow together. This job post is open to all genders, ages, and nationalities, in accordance with Italian Laws 903/77 and 125/91, and Legislative Decrees 215/03 and 216/03.
#J-18808-Ljbffr