Sign up to access all features of our service
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior MLOps Engineer AWS-Focused ML Infrastructure

Keysight Technologies

We are expanding our engineering team with a dedicated MLOps Engineer specializing in AWS to support the deployment, scaling, and operationalization of machine learning solutions across our manufacturing and semiconductor analytics platforms. This role will serve as a critical bridge between our Machine Learning Engineers—focused on Generative AI and classical ML—and production environments, ensuring seamless, reliable, and efficient ML workflows.

You will collaborate closely with the Senior Machine Learning Engineer (GenAI Platform) and the Machine Learning Engineer (Classical ML and Predictive Analytics) to automate pipelines, monitor model performance, and manage infrastructure for high-stakes applications like test plan generation, anomaly detection, predictive maintenance, and market intelligence. In our AWS-centric ecosystem, you will leverage best-in-class tools to enable rapid iteration while maintaining compliance, security, and cost efficiency in regulated industrial settings.

This position is perfect for a mid-level professional with a passion for DevOps in ML contexts, who excels at turning complex models into robust, production-ready systems.

Key Responsibilities

  • Design, implement, and maintain end-to-end MLOps pipelines on AWS, including CI/CD automation for model training, validation, deployment, and retraining, using services like SageMaker, CodePipeline, CodeBuild, and Step Functions.
  • Support the Generative AI platform by operationalizing AWS Bedrock workflows, including RAG pipelines, vector databases (e.g., via OpenSearch or Pinecone integrations), Lambda functions, and agentic systems—ensuring scalability for large-scale data processing like historical test plans and news article summarization.
  • Enable classical ML initiatives by deploying and monitoring models built with XGBoost, Scikit-learn, and NLP architectures (e.g., RNNs/LSTMs) on AWS infrastructure, incorporating drift detection for anomaly tracking in sensor data and competitor pricing monitoring.
  • Manage infrastructure as code (IaC) using Terraform or CloudFormation to provision and optimize AWS resources, such as EC2 instances, S3 buckets, EMR for Apache Spark-based processing (supporting our PMA product), and ECS/EKS for containerized deployments.
  • Implement comprehensive monitoring, logging, and alerting systems with CloudWatch, X-Ray, and third-party tools (e.g., Prometheus/Grafana integrations) to track model performance, detect anomalies, handle concept drift, and ensure high availability for customer-facing tools like Q&A chatbots and predictive maintenance advisors.
  • Collaborate in an Agile environment with ML engineers, data scientists, and SRE teams to conduct A/B testing, version models, automate rollbacks, and optimize costs through auto-scaling and spot instances.
  • Enforce security and compliance best practices, including IAM roles, VPC configurations, data encryption, and audit logging, to safeguard sensitive manufacturing data and meet industry standards.
  • Troubleshoot production issues, perform root-cause analysis, and drive continuous improvements in ML operations, staying ahead of AWS innovations to enhance platform reliability and efficiency.

Job Qualifications:

Must-have qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or a related technical field.
  • 3–5 years of experience in MLOps, DevOps, or cloud engineering roles, with a proven track record of deploying and managing ML models in production environments.
  • Deep expertise in AWS services for ML and data workflows, including SageMaker (real-time endpoints, inference components, multi-instance/multi-variant deployments), Bedrock (provisioned throughput, cross-Region inference profiles for scaling & resilience), EMR (for Spark-based PMA workloads), Lambda, S3, ECR, and orchestration tools like Step Functions or Airflow.
  • Proven experience with Amazon Elastic Container Registry (ECR): building, scanning for vulnerabilities, tagging, versioning, and pushing custom Docker images for inference containers (including Bring-Your-Own-Container patterns for custom ML frameworks, vLLM, or deep learning environments); managing ECR lifecycle policies, replication across regions, and secure access via IAM roles.
  • Strong proficiency in EC2-based ML deployments and infrastructure: selecting optimal instance types (e.g., ml.g family for GPU-heavy GenAI inference, g5/g6 for newer accelerators), configuring Auto Scaling Groups, managing spot instances for cost optimization, and handling EC2 fleets for custom hosting when SageMaker/Bedrock abstractions are insufficient.
  • Expertise in load balancing & scaling for ML inference: configuring and troubleshooting Application Load Balancers (ALB) or Network Load Balancers (NLB) integrated with SageMaker endpoints or ECS/EKS tasks; implementing SageMaker's built-in routing strategies (e.g., least outstanding requests for latency optimization); setting up auto-scaling policies (target tracking on CPU utilization, invocations per instance, or custom CloudWatch metrics); using cross-Region inference profiles in Bedrock for burst handling and global resilience; and ensuring high availability through multi-AZ deployments with minimum instance counts 2.
  • Demonstrated ability to resolve common deployment issues in production ML environments, including: cold-start latency in serverless/containerized inference, container pull failures from ECR, IAM permission misconfigurations causing access denied errors, model artifact corruption or version mismatches post-deployment, endpoint update failures without downtime (using blue/green or canary strategies), drift/throttling in high-concurrency scenarios (e.g., 429 errors in Bedrock), unhealthy instance recovery, and debugging via CloudWatch Logs, X-Ray traces, and SageMaker Model Monitor alerts.
  • Proficiency in IaC tools such as Terraform or CloudFormation to provision and optimize AWS resources (e.g., ECR repositories, EC2 fleets, ALBs, SageMaker endpoints, and auto-scaling configurations) in a repeatable, auditable manner.
  • Strong scripting and programming skills in Python (with libraries like Boto3), along with experience in CI/CD pipelines using Jenkins, GitHub Actions, or AWS CodePipeline — with specific focus on automated ECR image builds, model artifact promotion, and safe endpoint updates.
  • Familiarity with monitoring and observability stacks (e.g., CloudWatch, ELK Stack) and ML-specific tools for versioning (e.g., MLflow) and experiment tracking.
  • Experience in Agile methodologies, with hands-on participation in sprints, code reviews, and cross-functional problem-solving.
  • Solid understanding of ML concepts, including model drift, bias detection, and serving patterns, to effectively support both GenAI and classical ML teams.

Strongly preferred

  • Fluency in English.
  • Prior exposure to manufacturing, semiconductor, or industrial IoT domains, where data reliability and low-latency inference are critical.
  • Certifications such as AWS Certified Machine Learning – Specialty, AWS Certified DevOps Engineer, or equivalent.
  • Experience with hybrid ML setups, integrating on-premises data with cloud services, or handling large-scale NLP/Numerical data pipelines.
  • Knowledge of security frameworks like SOC 2 or ISO 27001, and tools for automated testing of ML infrastructure.
  • Prior experience troubleshooting and optimizing SageMaker multi-instance/multi-variant endpoints (including traffic shifting, shadow testing, and A/B deployments) and Bedrock inference profiles (Priority/Flex tiers, cross-Region routing for throughput and cost balancing).
  • Hands-on work with EC2 Auto Scaling in ML contexts, including handling GPU instance availability constraints, spot interruption recovery, and cost-effective scaling for bursty inference workloads.
  • Familiarity with advanced deployment patterns such as blue/green deployments, canary rollouts, and rollback automation to minimize production impact during model updates.

If you are a pragmatic, AWS-savvy engineer excited about operationalizing cutting-edge ML in mission-critical industries, this role offers the opportunity to build resilient systems that directly impact our company's innovation and customer outcomes. Join a dynamic team committed to excellence, with ample room for growth and technical leadership.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior MLOps Engineer AWS-Focused ML Infrastructure in Yishun vacancy
  •  ...internal logistics sites and returns to Suppliers. Operation Responsibilities Responsible for overall day-to-day warehouse and mainly focus on fulfilment / Outbound operations Review and implement warehouse operating procedures to ensure operations performance meets KPI... 

    Agilent Technologies Singapore

    Yishun
    21 days ago
  •  ...We are seeking a highly driven Process Reliability Engineer to own and elevate the reliability of manufacturing processes in a sustaining...  ...Effective stakeholder management across cross-functional teams Strong focus on process discipline, prevention, and compliance Proven track... 

    Agilent Technologies Singapore Pte. Ltd.

    Yishun
    12 days ago
  •  ...manufacturing. Conduct Design of Experiments (DOE) and optimize newly developed processes. Collaborate with IC designers, product engineers, manufacturing teams, support functions, and contract manufacturers to develop and validate new device processes through R&D... 

    Broadcom

    Yishun
    4 days ago
  •  ...2. If you already have a Candidate Account, please Sign-In before you apply. Broadcom is seeking an experienced package design engineer for complex flip-chip-BGA packages for industry-leading ASICs with high-speed SerDes and RF/microwave communications A/D-D/A converters... 

    Broadcom

    Yishun
    10 days ago
  •  ...for a job. 2. If you already have a Candidate Account, please Sign-In before you apply. Job Description Summary – PCD Validation Engineer An exciting position in the explosive growth area of Data Center Enterprise Storage responsible for providing low power, high... 

    Broadcom

    Yishun
    2 days ago
  •  ...job. 2. If you already have a Candidate Account, please Sign-In before you apply. Foundation IP Principal Memory Circuit Design Engineer We are looking for energetic and passionate design engineers to join our Central Engineering Group and be part of an elite team... 

    Broadcom

    Yishun
    11 days ago
  •  ...Python, SQL, and BI tools to scale insights Partner with more senior team members to align analytics with standards, governance, and process...  ...complex data. Familiarity with cloud platforms such as AWS, Azure, or Google Cloud. Knowledge of advanced machine learning... 

    Agilent Technologies Singapore

    Yishun
    2 days ago
  • 1500 SGD

     ...precision of products. This position offers a stimulating environment focused on functional testing and inventory management....  ...handle materials for shipment utilizing manual tools. ~ Assist engineers and technicians in test setups as needed. ~ Monitor inventory... 

    PERSOL

    Yishun
    8 days ago
  •  ...seeking an experienced Software Development Engineer to design and build end-to-end full-...  ...data processing. You will integrate AI/ML capabilities, including large language...  ...interactions · Design and manage cloud-based infrastructure (AWS) · Apply best practices in code quality... 

    keysight technologies singapore (sales) pte. ltd.

    Yishun
    12 days ago
  •  ...solutions used to produce various electronic products, such as notebook motherboards, automotive ECUs, and smart meters. As a senior Firmware Engineer, you will be an integral part of a multidisciplinary team of R&D engineers developing next-generation electrical systems and... 

    Keysight Technologies

    Yishun
    2 days ago
  •  ...are looking for] Other ad hoc duties not covered under main responsibilities and duties. Assists in the constituency, CDC and PA HQ events. Performs other duties as assigned by senior officers such as covering counter duties as and when the situation requires.... 

    PAS People's Association

    Yishun
    2 days ago
  •  ...seeking an Expert R&D Software Engineer to lead the design and development...  ...frontend, backend, and AI/ML systems. This role requires deep...  ...engineering standards · Mentor senior and junior engineers, fostering...  ...(CI/CD, observability, MLOps, cloud-native architecture) ·... 

    keysight technologies singapore (sales) pte. ltd.

    Yishun
    12 days ago
  •  ...all points in their careers. Responsibilities As a Senior Schematic & PCB Design Engineer in our Electronic Industrial Solutions Group (EISG), you...  ...experience in analog circuit design and analysis, with a focus on fine-pitch PCB layouts for high-density, compact electronic... 

    keysight technologies singapore (sales) pte. ltd.

    Yishun
    12 days ago
  •  ...culture by executing performance reviews, identifying training gaps, and motivating a diverse team of educators. Mentor and guide senior teachers in leadership paths and actively supervise the progress of trainee teachers under professional development tracks.... 

    Buddhist Compassion Relief Tzu-Chi Foundation (Singapore)

    Yishun
    14 days ago
  •  ...quality initiatives during the new product development stage, with a focus on ensuring robust integration of semiconductor manufacturing...  ...through volume production ramp-up. - Collaborate with process engineering, fabrication, and integration teams to establish and monitor... 

    asmpt singapore pte. ltd.

    Yishun
    17 days ago
  •  ...products, such as notebook motherboards, automotive ECUs, and smart meters. As a Research Engineer, you will play a crucial role in a multidisciplinary team of R&D engineers focused on developing next-generation electrical systems and components. Your primary focus will be... 

    Keysight Technologies

    Yishun
    4 days ago
  •  ...Automotive Radar, Space and Satellite, and Internet Infrastructure. The aerospace defense communication team is looking for a senior engineer / GTM / product management lead for signal...  ..., drone detection, GNSS resilience, and AI/ML waveform classification Working with... 

    keysight technologies singapore (sales) pte. ltd.

    Yishun
    4 days ago
  •  ...Role: The Senior R&D Electrical Engineer is expected to operate with a high level of independence and play a leading role in guiding junior engineers while managing more complex and critical designs. This role requires strong technical expertise and problem‑solving... 

    asmpt smt singapore pte. ltd.

    Yishun
    24 days ago
  •  ...photolithography process including yield improvement, SPC control, trouble shooting on stepper and track issues Required Experience and Qualifications ~ Requires Degree in Electrical, Electronics, Mechanical, Materials Engineering, Physics, or Chemistry.... 

    stats chippac pte. ltd.

    Yishun
    a month ago
  •  ...JOB TITLE Continuous Improvement Senior Engineer / Engineer DEPARTMENT Continuous Improvement ACCOUNTABILITY Manager Quality Operations PURPOSE OF JOB The Continuous Improvement engineer is responsible for driving end to end operational excellence across... 

    merit medical singapore pte. ltd.

    Yishun
    27 days ago
  •  ...Responsibilities We are seeking a Software Engineer to contribute to the design and...  ...services, while integrating AI/ML capabilities into real-world...  .... You will work closely with senior engineers and R&D teams to...  ...· Exposure to cloud platforms (AWS) and development practices such... 

    keysight technologies singapore (sales) pte. ltd.

    Yishun
    12 days ago
  •  ...exceptional talent to join our team! To learn more, visit lumileds.com. What You Will Do: Lead engineering experiments to characterize and improve dry etch processes , focusing on quality, yield, and cost efficiency. Identify and implement innovations drive continuous... 

    Lumileds

    Yishun
    27 days ago
  •  ...workflow, rapid issue resolution, and smooth overall operations. Qualifications Nitec, Higher Nitec, or Diploma in Mechanical Engineering, Electrical Engineering, or a related field. Prior experience in a manufacturing environment is preferred. Familiarity with... 

    Agilent Technologies Singapore Pte. Ltd.

    Yishun
    5 days ago
  •  ...Leadership Lead, coach, and develop the order management team Set performance goals and conduct regular reviews Build strong customer focus and accountability culture Qualifications Bachelors or Masters Degree or University Degree or equivalent. Typically 6+ years... 

    Agilent Technologies Singapore

    Yishun
    19 days ago
  • The Senior Accounts Executive will play a key role in managing the company's financial operations, with a primary focus on AR and GL activities. This position ensures accurate and timely reporting, compliance with statutory requirements, and supports management in strategic... 

    acca careers

    Yishun
    21 days ago
  •  ...Job Description: The descriptions are for layout engineers for our Library Group. Library Group is a part of Central Engineering Group. In Library Group, we focus on circuit design for memory, I/O (Input/Output), and Standard Cells. Requirements: Strong layout... 

    Broadcom

    Yishun
    24 days ago
  •  ...department and the public Requirements Recognised Degree in Podiatry, candidate with 2 to 5 years of experience maybe considered as a Senior Podiatrist Postgraduate qualification with relevant specialised clinical discipline; research contribution To be able to... 

    Khoo Teck Puat Hospital

    Yishun
    13 days ago
  •  ...- At least 5 years of experience in managing large-scale construction projects and site operations. - Diploma or Degree in Civil Engineering, Construction Management, Building, or related field. - Strong knowledge of construction procedures, workplace safety regulations... 

    hong yuan construction pte. ltd.

    Yishun
    17 days ago
  •  ...is looking for an IC professional to join our team as a Senior/Staff Analog IC Layout Engineer for the development of complex Analog/Mixed Signal circuit...  ...integration in Broadcom system-on-chip products. The role focuses on layout implementation of analog circuit schematics and... 

    Broadcom

    Yishun
    24 days ago
  •  ...Requirements ~ Diploma/Degree in mechanical/mechatronics or related engineering ~1-2 years working experience Please submit your...  ...also email your resume to us at [HIDDEN TEXT] Emily Fong Senior Talent Specialist Atomrecruit Pte. Ltd. | MOM EA License No.:... 

    atomrecruit pte. ltd.

    Yishun
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior MLOps Engineer AWS-Focused ML Infrastructure. Be the first to apply!