-
Boston, Massachusetts
-
Posted Date: 05/20/2025
-
Consulting
-
$0-$0
Job Description
This role is critical to ensuring AI services run reliably, securely, and efficiently—supporting continuous improvement, automation, and incident resolution.
Key Responsibilities:
- Monitor and manage AI system and data pipeline performance and uptime
- Triage, troubleshoot, and resolve incidents with minimal disruption
- Maintain operational tasks such as model updates, patching, and data refreshes
- Analyze performance metrics and optimize system resource usage
- Automate repetitive tasks using scripting and orchestration tools
- Ensure documentation, runbooks, and recovery procedures are current and complete
- Support change management processes for AI and data infrastructure
- Collaborate with cross-functional teams to maintain secure and compliant operations
- 3+ years in IT operations, with at least 1 year supporting AI/ML systems and data pipelines
- Experience with MLOps tools and concepts, including model deployment and lifecycle management
- Hands-on with Apache Airflow, Prefect, Spark, Kafka, and cloud data services (AWS, Azure, or GCP)
- Strong troubleshooting and incident response skills in production environments
- Familiarity with system monitoring, data quality management, and automation workflows
- Understanding of CI/CD pipelines, containerization, and infrastructure as code
- Knowledge of security best practices for AI operations and data handling
- Bachelor's degree in Computer Science, IT, or a related field (or equivalent experience)
- Excellent communication, documentation, and cross-functional collaboration skills
- Technical certifications in cloud platforms, MLOps, or data engineering
- Experience in higher education or similarly complex organizational environments
- Familiarity with IT service management (e.g., ITIL) and compliance frameworks
Apply Now
Recruiter
Clayton Minnich
clayton.minnich@avidtr.com
Job ID: JN -052025-17251
Find Jobs Faster
Login or sign up to create customized job alerts to be notified first.
Share this job