Application Open:
Full-Time
Job Purpose:
The Bioinformatics Engineer to help build a next-generation AI-driven data platform for biological sciences. The role will design robust data pipelines, support machine learning tasks, and guide teams on data mining and processing. Collaborating with top academics, you will create a plug-and-play data platform, optimize data ingestion, and identify trends in bioinformatics. Strong problem-solving and communication skills are essential to troubleshoot issues and present findings effectively. This role offers the chance to evaluate cutting-edge technologies, embrace challenges, and drive MBZUAI innovation in bioinformatics and AI.
Key Responsibilities:
Collaboration and Innovation:
- Collaborate with top academicians in artificial intelligence and bioinformatics to develop the world’s first plug-and-play data platform for biological sciences.
- Stay updated on the latest trends in bioinformatics and explore new areas for innovation and improvement.
Data Analysis and Algorithm Support:
- Design systems to identify linkages between datasets and support algorithm engineers in machine learning tasks.
- Develop and implement algorithms for genomic, proteomic, and other biological data analysis.
- Collaborate with researchers to design experiments and interpret biological data using computational tools.
Team Support and Guidance:
- Provide guidance to cross-functional teams on data mining, processing, and analysis tasks.
- Support data engineers in building pipelines to ingest data from diverse sources and optimize search and sorting mechanisms.
Problem Solving and Troubleshooting:
- Demonstrate excellent problem-solving skills to troubleshoot complex issues efficiently and effectively.
- Ensure data quality and integrity by implementing robust validation and quality control processes.
Communication and Stakeholder Engagement:
- Communicate findings and insights clearly to stakeholders and team members through presentations and reports.
- Participate in the publication of research findings in scientific journals and conferences.
Technology Evaluation and Implementation:
- Evaluate and recommend technology solutions, considering cost-benefit trade-offs and long-term scalability.
- Contribute to the development of databases and tools for storing, querying, and analyzing biological data.
Workflow Optimization and Documentation:
- Continuously improve workflows and tools to enhance efficiency and accuracy in data analysis.
- Document methodologies, workflows, and results to ensure reproducibility and knowledge sharing.
Leadership and Mentorship:
- Mentor junior team members and promote best practices in bioinformatics and data analysis.
Adaptability and Initiative:
- Embrace challenges and step out of your comfort zone to drive innovation and solve cutting-edge problems.
- Adapt to evolving project requirements and contribute to multiple projects simultaneously.
Interdisciplinary Collaboration:
- Work with interdisciplinary teams to integrate bioinformatics solutions into broader research and product development workflows.
Other Duties:
- Perform all other duties as reasonably directed by the line manager that are commensurate with these functional objectives.
Academic Qualifications:
- Bachelor’s degree in Bioinformatics, Bioengineering, Computational Biology or a related field.
- Another postgraduate degree will be preferred.
Professional Experience:
Essential
- Fundamental understanding of genomics and bioinformatics concepts, such as nucleotide pairing, protein sequences and structures, and sequence alignment.
- Proficient in using bioinformatics databases, including but not limited to PDB, StringDB, UniProt, and DepMap.
- Deep understanding of various data formats and the ability to link data across multiple sources.
- Skilled in command-line tools and working within a Linux environment.
- Familiar with data scraping tools and common Linux commands.
- Hands-on coding experience in machine learning and deep learning.
- Strong programming skills in languages such as Bash, Python, R, and Perl.
- Excellent English communication skills, with a collaborative attitude and the ability to work effectively with engineers at all levels.
- Strong sense of responsibility, with the ability to support emergent operational issues to ensure smooth team development and deployment.
- Skilled in technical documentation, risk assessments, and ensuring continuity with minimal disruptions.
Preferred
- Fresh graduate or less than 5 years of industry experience as software developer.
- Experience with git and both code and data version management tools
- Familiarity with graph databases, Neo4j, ArangoDB, Apache Giraph etc.
- Experience in data mining and data processing frameworks.
- Previously worked with image and text data.
- Creative and innovative approach to problem-solving Experience in higher education or research institutions, with an understanding of core research facility operations.
- Proficiency in data analytics for process optimization and continuous improvement.
- Strong English proficiency, with fluency in additional languages as a plus.