[Remote] AI/ML Research Engineer, LLM Post-Training & Evaluation

Remote Full-time Live

Note: The job is a remote job and is open to candidates in USA. Innodata Inc. is a leading data engineering company specializing in AI technology solutions. They are seeking an AI/ML Research Engineer to design and optimize training and evaluation systems for large language models, working closely with various technical stakeholders to ensure robust model improvements.

Responsibilities

Lead or co-lead technically complex ML engineering projects from initial customer discussions through implementation and delivery
Design, build, and improve LLM training and post-training pipelines, including data ingestion, preprocessing, fine-tuning, evaluation, and experiment tracking
Implement and optimize evaluation systems for LLMs and multimodal models, including offline benchmarks and task-specific test harnesses
Integrate human-in-the-loop and AI-augmented evaluation signals into model development workflows
Build robust infrastructure and tooling for reproducible experimentation, metrics logging, and regression monitoring
Diagnose model behavior and pipeline failures, including data issues, training instability, metric inconsistencies, and evaluation drift
Collaborate with Language Data Scientists and Applied Research Scientists to translate evaluation frameworks into executable systems
Work closely with customer technical stakeholders to understand goals, constraints, and success criteria; propose and implement technically sound solutions
Contribute to internal research and platform development, including benchmark frameworks, evaluation tooling, and post-training workflow improvements
Contribute to best practices and standards for LLM training, evaluation, and quality assurance across projects
Mentor junior engineers and contribute to technical design reviews, documentation, and engineering rigor across the team

Skills

BS/MS/PhD in Computer Science, Machine Learning, AI, Applied Mathematics, or a related quantitative technical field (MS/PhD preferred)
2-3 years of relevant industry or research engineering experience in ML/AI systems
Hands-on experience with LLM training / fine-tuning / post-training, including at least one of: supervised fine-tuning (SFT), preference optimization (e.g., DPO or related methods), RLHF / RLAIF-style workflows, task- or domain-adaptation of foundation models
Strong programming skills in Python and experience building production-quality ML code
Experience with modern ML frameworks (e.g., PyTorch, JAX, TensorFlow) and model libraries/tooling (e.g., Hugging Face ecosystem, vLLM, distributed training stacks)
Experience designing and implementing evaluation pipelines for LLM/ML systems, including metrics computation, dataset handling, and experiment comparisons
Strong understanding of data pipelines and ML systems engineering, including reproducibility, observability, and debugging
Experience with large-scale distributed ML systems and performance optimization for training/evaluation workloads (GPU/accelerator environments preferred)
Experience with large-scale data processing and workflow orchestration in support of model training/evaluation
Ability to collaborate directly with technical stakeholders including research scientists, ML engineers, data engineers, and customer technical leads
Strong written and verbal communication skills, including the ability to explain complex technical tradeoffs to both technical and non-technical audiences
Experience with multimodal model training/evaluation (text + image/audio/video)
Experience with long-context evaluation and/or model adaptation for long-context tasks
Experience with agentic or multi-turn evaluation harnesses, tool-use simulation, or interactive environment testing
Experience working in customer-facing technical consulting, solutions engineering, or applied research delivery
Familiarity with LLM safety, alignment, robustness, or red-teaming evaluation approaches
Contributions to open-source ML/LLM tooling or published technical work in relevant areas

Company Overview

(NASDAQ: INOD) Innodata is a global data engineering company. We believe that data and AI are inextricably linked. It was founded in 1988, and is headquartered in Hackensack, New Jersey, USA, with a workforce of 5001-10000 employees. Its website is http://www.innodata.com.

Company H1B Sponsorship

Innodata Inc. has a track record of offering H1B sponsorships, with 2 in 2024. Please note that this does not guarantee sponsorship for this specific role.

Apply tot his job Apply To this Job

Apply

[Remote] AI/ML Research Engineer, LLM Post-Training & Evaluation

Responsibilities

On the same wavelength

Experienced Customer Service Representative – Live Chat Support Specialist (FULLY REMOTE)

Entry-Level Live Chat Support Specialist – Remote Position with Full Flexibility, Competitive Hourly Rate, and Opportunities for Growth at Taskzeno

Remote Customer Chat Support - E-commerce Stores

AI/LLM Evaluation & Alignment Software Engineer

Quality Assurance Test Engineer, Remote, ID, US

Platform Engineer/Performance Tester

Loan Officer - Remote Sales Professional (Iowa)

Remote HUD Loan Processor

Remote Freight and Logistics Analyst

[Remote] Sr. Logistics Analyst, Domestic

Experienced Tutor for HSI STEM Computer Information Systems (CIS) IT/Cybersecurity - Remote Opportunity

Education Technology Specialist

Experienced Customer Service Representative – Work From Home Opportunity with arenaflex

Director of Sales Training

Technical Account Manager - Remote, United States

Experienced Remote Customer Support Representative – Deliver Exceptional Customer Experiences from the Comfort of Your Home

Software Engineer - Java/Python

Nursing Online Instruction Specialist - Contract with Benefits: Shaping the Future of Nursing Education through Innovative Online Learning Experiences

Open Rank Professor - Center for Remote Patient and Participant Monitoring

Experienced Full Stack Customer Support Specialist – Live Chat & Data Entry