AI Engineer

Job Description

We’re looking for a talented AI Engineer to join our team focused on implementing and scaling large language models (LLMs) and generative AI systems. In this role, you will bridge the gap between cutting-edge research and practical applications, turning innovative AI concepts into robust, efficient, and production-ready systems. You will work closely with our research team and data engineers to build and optimize AI solutions that drive our company's products and services.

Key Responsibilities

  • Implement and optimize large language models and generative AI systems for production environments
  • Collaborate with researchers to translate research prototypes into scalable, efficient implementations
  • Design and develop AI infrastructure components for model training, fine-tuning, and inference
  • Optimize AI models for performance, latency, and resource utilization
  • Implement systems for model evaluation, monitoring, and continuous improvement
  • Develop APIs and integration points for AI services within our product ecosystem
  • Troubleshoot complex issues in AI systems and implement solutions
  • Contribute to the development of internal tools and frameworks for AI development
  • Stay current with emerging techniques in AI engineering and LLM deployment
  • Collaborate with data engineers to ensure proper data flow for AI systems
  • Implement safety measures, content filtering, and responsible AI practices

Requirements

Required Skills & Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, or related technical field
  • 3+ years of hands-on experience implementing and optimizing machine learning models
  • Strong programming skills in Python and related ML frameworks (PyTorch, TensorFlow)
  • Experience with deploying and scaling AI models in production environments
  • Familiarity with large language models, transformer architectures, and generative AI
  • Knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies
  • Understanding of software engineering best practices (version control, CI/CD, testing)
  • Experience with ML engineering tools and platforms (MLflow, Kubeflow, etc.)
  • Strong problem-solving skills and attention to detail
  • Ability to collaborate effectively in cross-functional teams

Preferred Qualifications

  • Experience with fine-tuning and prompt engineering for large language models
  • Knowledge of distributed computing and large-scale model training
  • Familiarity with model optimization techniques (quantization, pruning, distillation)
  • Experience with real-time inference systems and low-latency AI services
  • Understanding of AI ethics, bias mitigation, and responsible AI development
  • Experience with model serving platforms (TorchServe, TensorFlow Serving, Triton)
  • Knowledge of vector databases and similarity search for LLM applications
  • Experience with reinforcement learning and RLHF techniques
  • Familiarity with front-end technologies for AI application interfaces



Benefits

Compensation

iGenius offers a competitive compensation structure, including salary, performance-based bonuses, and additional components based on experience. All roles include comprehensive benefits as part of the total compensation package.

About iGenius

iGenius is a deep-tech company specialized in the development of Artificial Intelligence solutions for companies operating in highly regulated industries, including financial services, government, or heavy industry. iGenius’ main product, Unicorn, offers tailored solutions for companies looking to integrate AI safely and effectively, mainly through two proprietary Large Language Models (LLMs). Italia 10B, is a multi-language model optimized for regulated sectors and elevated computational efficiency, while Colosseum 355B, built with latest-generation NVIDIA technology, is fit for mission-critical use cases. In addition to Unicorn, iGenius’ product offer includes Crystal, an AI agent for Decision Intelligence that analyzes business data in natural language and accurately supports strategic, insight-driven decision-making. In December 2024, iGenius joined forces with NVIDIA to build Colosseum – one of the largest AI supercomputers in the world – to support the deployment of its models with unrivaled speed, performance, and efficiency. 

Active in both Europe and the United States, iGenius is one of the leading AI unicorns in the European landscape, and  has attracted Fortune 500 companies, including Allianz and Intesa Sanpaolo. This led Gartner to recognize iGenius as a “Cool Vendor” in the  AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI. 

Please review our Privacy Policy here https://bit.ly/2XAy1gj .