Job Description

We’re looking for a talented AI Engineer to join our team focused on implementing and scaling large language models (LLMs) and generative AI systems. In this role, you will bridge the gap between cutting-edge research and practical applications, turning innovative AI concepts into robust, efficient, and production-ready systems. You will work closely with our research team and data engineers to build and optimize AI solutions that drive our company's products and services.

Key Responsibilities

Implement and optimize large language models and generative AI systems for production environments
Collaborate with researchers to translate research prototypes into scalable, efficient implementations
Design and develop AI infrastructure components for model training, fine-tuning, and inference
Optimize AI models for performance, latency, and resource utilization
Implement systems for model evaluation, monitoring, and continuous improvement
Develop APIs and integration points for AI services within our product ecosystem
Troubleshoot complex issues in AI systems and implement solutions
Contribute to the development of internal tools and frameworks for AI development
Stay current with emerging techniques in AI engineering and LLM deployment
Collaborate with data engineers to ensure proper data flow for AI systems
Implement safety measures, content filtering, and responsible AI practices

Requirements

Required Skills & Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or related technical field
3+ years of hands-on experience implementing and optimizing machine learning models
Strong programming skills in Python and related ML frameworks (PyTorch, TensorFlow)
Experience with deploying and scaling AI models in production environments
Familiarity with large language models, transformer architectures, and generative AI
Knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies
Understanding of software engineering best practices (version control, CI/CD, testing)
Experience with ML engineering tools and platforms (MLflow, Kubeflow, etc.)
Strong problem-solving skills and attention to detail
Ability to collaborate effectively in cross-functional teams

Preferred Qualifications

Experience with fine-tuning and prompt engineering for large language models
Knowledge of distributed computing and large-scale model training
Familiarity with model optimization techniques (quantization, pruning, distillation)
Experience with real-time inference systems and low-latency AI services
Understanding of AI ethics, bias mitigation, and responsible AI development
Experience with model serving platforms (TorchServe, TensorFlow Serving, Triton)
Knowledge of vector databases and similarity search for LLM applications
Experience with reinforcement learning and RLHF techniques
Familiarity with front-end technologies for AI application interfaces

Benefits

Compensation

iGenius offers a competitive compensation structure, including salary, performance-based bonuses, and additional components based on experience. All roles include comprehensive benefits as part of the total compensation package.

About iGenius

iGenius is a deep-tech company specialized in the development of Artificial Intelligence solutions for companies operating in highly regulated industries, including financial services, government, or heavy industry. iGenius’ main product, Unicorn, offers tailored solutions for companies looking to integrate AI safely and effectively, mainly through two proprietary Large Language Models (LLMs). Italia 10B, is a multi-language model optimized for regulated sectors and elevated computational efficiency, while Colosseum 355B, built with latest-generation NVIDIA technology, is fit for mission-critical use cases. In addition to Unicorn, iGenius’ product offer includes Crystal, an AI agent for Decision Intelligence that analyzes business data in natural language and accurately supports strategic, insight-driven decision-making. In December 2024, iGenius joined forces with NVIDIA to build Colosseum – one of the largest AI supercomputers in the world – to support the deployment of its models with unrivaled speed, performance, and efficiency.

Active in both Europe and the United States, iGenius is one of the leading AI unicorns in the European landscape, and has attracted Fortune 500 companies, including Allianz and Intesa Sanpaolo. This led Gartner to recognize iGenius as a “Cool Vendor” in the AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI.

Please review our Privacy Policy here https://bit.ly/2XAy1gj .

Tekmon

Backend Software Developer (Laravel)

At Tekmon, our mission is to democratize digital transformation with our no-code enterprise platform, empowering non-IT users and business process owners in technological;

backend
dev

Ciandt

[Job-21031] Mid Level Angular Developer, Brazil

Buscamos pessoas localizadas no Brasil para trabalhar como Angular Developer e atuar em um projeto do ramo financeiro. Responsabilidades: - Realizar o entendimento ;

angularjs
dev
javascript
java
junior

Jobrack

ZenToes - Data Analyst - Amazon Market Research & Product Development

Hey there!I’m Jacob, and I’m leading the e-commerce side of ZenToes. We’re a growing business focused on helping people feel more comfortable every day - literally! ;

product manager
exec
sql
product
non tech

Bmat Music Innovators

Python Developer

This opportunity is particularly interesting for Python developers with a strong passion for music.Even more so if you're based in Barcelona and enjoy the flexibility of ;

python
dev

AI Engineer

Job Description

India

Software development

19 hours ago