Generalist Evaluator Expert

Remote Full-time Live

Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### Job Details: - Design and Optimize Prompts: Create detailed prompts with multiple constraints and instructions. - Define and Document Evaluation Standards: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - Conduct Model Testing and Grading: Run prompts through models and assess preliminary outputs against expectations. - Support Benchmarking and Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### Minimum Qualifications: - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### Preferred Qualifications: - Experience in teaching or research. ### Application & Onboarding Process: - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### More Details About This Role: - This is a remote and asynchronous role — work on your own schedule. - Expect to contribute at least 20 hours per week. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### About [Mercor](https://mercor.com/): - Our team is based in San Francisco, CA - We [specialize](https://www.forbes.com/sites/johnwerner/2024/03/20/this-ai-startup-wants-to-create-jobs-not-take-them-away/) in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey

Apply To This Job

Apply

Generalist Evaluator Expert

On the same wavelength

AI Product Engineer

Key Account Manager

Program Associate

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Experienced Data Entry Specialist – Remote Work Opportunity with blithequark

Toyota Data Entry Jobs – Part-Time Remote Positions Hiring Now – Indeed Jobs US

Experienced Patient Advocate Representative – Delivering Exceptional Customer Service in a Dynamic Patient-Centric Environment at arenaflex

Experienced Remote Data Entry Clerk – Thriving Career Opportunities at blithequark

Experienced Technical Lead II - Backend - Live Chat Development at arenaflex

Require Boys Swimming and Diving Head Coach in Connecticut

Experienced Customer Service Representative – Remote Travel Support

Experienced Full Stack International WebChat Outbound Sales Representative – Late Night/Early Morning Shift for Cruise Industry Leader

Experienced Sales Associate - Thrive in a Culture of Innovation and Teamwork at Five Below

Hiring Now: Merchandise and Stocking Associate

Generalist Evaluator Expert

On the same wavelength

AI Product Engineer

Key Account Manager

Program Associate

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Experienced Data Entry Specialist – Remote Work Opportunity with blithequark

Toyota Data Entry Jobs – Part-Time Remote Positions Hiring Now – Indeed Jobs US

Experienced Patient Advocate Representative – Delivering Exceptional Customer Service in a Dynamic Patient-Centric Environment at arenaflex

Experienced Remote Data Entry Clerk – Thriving Career Opportunities at blithequark

Experienced Technical Lead II - Backend - Live Chat Development at arenaflex

Require Boys Swimming and Diving Head Coach in Connecticut

Experienced Customer Service Representative – Remote Travel Support

Experienced Full Stack International WebChat Outbound Sales Representative – Late Night/Early Morning Shift for Cruise Industry Leader

Experienced Sales Associate - Thrive in a Culture of Innovation and Teamwork at Five Below

Hiring Now: Merchandise and Stocking Associate

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)