AI Engineer

Vacancy Overview

Application Open:

Full-Time

Job Purpose:

The AI Engineer deliver high-quality outcomes in a fast-paced environment. The role will design and implement robust, efficient deep learning (DL) techniques, contribute to open-source projects, and develop AI applications. This role involves working at the cutting-edge of Deep Learning, Machine Learning, and Software Engineering. Additionally, the role will lead AI projects, optimize distributed training, job scheduling, and hyperparameter tuning for large DL/CV/NLP models, and improve system efficiency and performance. Collaborating with system architects, team leaders, and engineers, you will build innovative products focused on accuracy, interpretability, and scalability. You will own tasks with measurable results, aligning timelines for rapid performance evaluation. This role is an opportunity to innovate and drive impactful AI solutions for MBZUAI.

Key Responsibilities:

Innovation and Development:

  • Work at the cutting-edge of Deep Learning, Machine Learning, and Software Engineering, leveraging a diverse range of skills to drive innovation.
  • Contribute high-quality code and lead initiatives in developing AI projects and applications.
  • Continuously improve the efficiency, robustness, and cost-effectiveness of existing modules and applications through distributed training, job scheduling, and hyperparameter tuning for large DL/CV/NLP models and multi-stage pipelines.

Collaboration and Product Development:

  • Collaborate closely with system architects, team leaders, technical product managers, and engineers to build innovative products with a focus on accuracy, interpretability, and performance of machine learning systems.
  • Work together with cross-functional teams to align technical solutions with business goals and user needs.

Ownership and Execution:

  • Define and own tasks with measurable outcomes, ensuring alignment with team leaders for rapid evaluation of performance efficiency.
  • Design and implement scalable, high-performance solutions for training and deploying machine learning models.

Optimization and Troubleshooting:

  • Optimize data pipelines and workflows to ensure seamless data ingestion, processing, and model training.
  • Troubleshoot and resolve technical challenges related to machine learning systems and infrastructure.

Technical Leadership and Mentorship:

  • Mentor junior team members and promote best practices in software engineering and machine learning.
  • Drive innovation by experimenting with new algorithms, tools, and frameworks to improve model performance and scalability.

Monitoring and Documentation:

  • Ensure robust monitoring, logging, and alerting mechanisms for machine learning systems in production.
  • Document technical processes, architectures, and decisions to ensure knowledge sharing and maintainability.

Continuous Learning and Improvement:

  • Stay updated with the latest advancements in AI/ML technologies and integrate them into existing workflows.
  • Continuously evaluate and adopt new tools and methodologies to enhance system performance and efficiency.

Other Duties:

  • Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.

Academic Qualifications:

  • Master’s degree in computer science or applied mathematics or physics, with a specialization in Machine Learning, Deep Learning, Natural Language, Processing, Computer Vision, Robotics, Software Engineering, Human-Computer Interaction, or related fields.
  • A postgraduate degree will be preferred.

Professional Experience:

Essential

  • Strong knowledge and hands-on experience with systems for ML model training, job scheduling, and hyperparameter tuning.
  • 3+ years of practical experience with deep learning frameworks such as PyTorch and TensorFlow.
  • 3+ years of experience in AI platform and application development on Cloud/Cluster environments, leveraging popular AI accelerator architectures like CPU, GPU, and IPU.
  • 5+ years of solid programming expertise in one or more of the following: Java, C++, Python, or equivalent.
  • Excellent English communication skills, a collaborative attitude, and the ability to work effectively with engineers at all levels.
  • Experience with source control systems, build tools, and continuous integration pipelines.
  • Hardworking, self-motivated, detail-oriented, and proven ability to meet tight deadlines.

Preferred

  • A PhD degree, with 2+ years of equivalent practice or research experience, will be preferred.
  • Proficient in computer architecture, high-performance programming, parallel programming, and distributed training/inference.
  • Skilled in optimizing scientific or computationally intensive application codes, libraries, or compilers.
  • Experienced in using deep learning frameworks such as PyTorch, TensorFlow, Caffe, MXNet, or Ollama to adapt to various AI processors and build applications.
  • Demonstrated expertise across multiple projects covering a broad scope in Computer Vision (CV), Natural Language Processing (NLP), and Machine Learning (ML).
  • Alternatively, experience in Robotics projects involving SLAM (Simultaneous Localization and Mapping), control systems, and navigation development.
  • Experience in higher education or research institutions, with an understanding of core research facility operations.
  • Proficiency in data analytics for process optimization and continuous improvement.
  • Working proficiency in additional languages as a plus.

Apply Now:

Please enable JavaScript in your browser to complete this form.
Click or drag a file to this area to upload.
Click or drag a file to this area to upload.