Back to feed

[Remote] LLM -AI Quality Analyst - English

Remote Full-time Live

Note: The job is a remote job and is open to candidates in USA. PGC Digital (America) Inc is a CMMI Level 3 Company seeking an AI Quality Analyst to evaluate a new personalization feature for Gemini. The role involves assessing the quality of AI responses based on personal interactions and experiences, requiring a blend of creativity and analytical skills.

Responsibilities

  • Designing and executing multi-turn conversational prompts (typically 1-5 turns) that require the AI to utilize your personal information and experiences
  • Evaluating model responses based on your intent from the starting prompt, checking if the personalization was appropriately applied
  • Analyzing responses for Grounding issues, ensuring claims about you are supported by evidence and not flawed inferences or hallucinations
  • Assessing Integration quality to ensure personal data is woven naturally into the response without robotic "overnarrating"
  • Rigorously evaluating and stack-ranking two model responses side-by-side (SxS) to determine which is overall more helpful, easy to use, and enjoyable
  • Writing clear, defensible rationales for your comparisons, explicitly referencing where issues or positive aspects occurred in the conversation
  • Extracting and verifying "Debug Info" from the model to confirm that chat summaries and data sources were properly utilized
  • Maintaining strict data hygiene by deleting evaluation conversations to prevent them from polluting your future chat history

Skills

  • English Proficiency: Ability to read and write in English with a high degree of competence, as English is the focus language for this project
  • Personal Account Usage: Willingness to use your primary personal Google account (not a testing account) and enable personal data sources for a genuine assessment
  • Schedule Flexibility: Full-time availability in your local time zone is required. We are staffing a global, 24-hour operations team
  • Exceptional Analytical Thinking: Demonstrate ability to evaluate nuanced and ambiguous AI responses, specifically assessing personalization quality
  • Creative Prompt Engineering: Experience in designing creative, multi-turn starting prompts based on personal context to thoroughly test the model's capabilities
  • Strong Evaluation Acumen: Understanding of personalization concepts, including the ability to identify incorrect personalization, poor inferences, and forced connections
  • Meticulous Attention to Detail: The ability to review Side-by-Side (SxS) model responses and spot subtle differences in naturalness and overnarrating
  • Excellent Written Communication: Superior ability to write clear, concise, and structured rationales for model rankings, explicitly referencing specific turn numbers
  • Feedback: Ability to provide constructive feedback and detailed annotations
  • Communication: Excellent communication and collaboration skills
  • Independence: Self-motivated and able to work independently in a remote setting
  • Technical Setup: Desktop/Laptop set up with a good internet connection
  • BS/BA degree or equivalent experience in a relevant field (e.g., Policy, Law, Ethics, Linguistics, Journalism, Computer Science, or a related analytical field)
  • Experience in data annotation, AI quality evaluation, content moderation, or a related role is strongly preferred

Company Overview

  • PGC (America) Inc represents the North American arm of PGC Digital, the global brand identity of PradeepIT Global Consulting PTE Ltd — a Singapore-headquartered IT services and consulting company delivering intelligent enterprise transformation worldwide. It was founded in 2016, and is headquartered in , with a workforce of 501-1000 employees. Its website is https://us.pgcdigital.ai.

Apply tot his job Apply To this Job

On the same wavelength

AI Automation Engineer, Customer Experience; Hybrid​/Remote

Remote Full-time

AI Automation Specialist​/Remote View Position

Remote Full-time

Senior AI Workflow / Automation Specialist /Remote / U/S/ Business Hours/

Remote Full-time

Automation Engineer (AI Enabled Workflows) - Contract

Remote Full-time

AI Automation Developer – Build Real Estate Listing → Marketing

Remote Full-time

Gen AI Data Engineer II job at Dynatron Software in US National

Remote Full-time

Senior Consultant | AI Strategy (Remote)

Remote Full-time

Lead Data Engineers

Remote Full-time

Experienced Senior Data Engineer for Artificial Intelligence and Data Organization – Remote Online Opportunity with Competitive Hourly Rate

Remote Full-time

Senior Data Engineer, AI Infrastructure

Remote Full-time

Experienced Entry-Level Remote Data Entry Specialist for Accurate Pharmacy Adjudication System Records at arenaflex

Remote Full-time

Experienced Chat Moderator – Remote Community Management and Conflict Resolution Specialist

Remote Full-time

Remote Data Entry Administrative Virtual Assistant – Professional Support Specialist (US-Based)

Remote Full-time

SNF Utilization Management RN - Compact Rqd

Remote Full-time

Experienced Remote Legal Transcriptionist - Work from Home Opportunity with VIQ Solutions

Remote Full-time

Experienced Data Entry Professional for Construction and Development Industry at blithequark

Remote Full-time

Clinical Consultant – Radiology Informatics (RIS)

Remote Full-time

Customer Support Representative – arenaflex Customer Service & Technical Support Specialist

Remote Full-time

Experienced Full Stack Data Entry Specialist – Remote Pharmacy Operations and Patient Care Coordination

Remote Full-time

Delta Customer Care Rep Jobs (Work From Home)

Remote Full-time