[Remote] Senior Software Engineer - Applied AI
Note: The job is a remote job and is open to candidates in USA. Blue Cross and Blue Shield of Nebraska is a mission-driven organization dedicated to championing the health and well-being of its members and communities. The role focuses on designing and building a multi-agent orchestration layer for AI systems to enhance member experiences and streamline interactions.
Responsibilities
- Design and build the LangGraph-based system that classifies member intent and routes to the right agent
- The standardized pattern that makes any agent pluggable: input schema, output schema, confidence signaling, latency SLA, fallback behavior
- Multi-turn conversations that maintain full member context across agent handoffs
- Clinical safety boundaries, PHI handling at the LLM layer, and response normalization ensuring every agent response arrives in a consistent tone and within compliance constraints regardless of which agent generated it
- Benefits documents, formularies, and policy content retrieval via Azure AI Search
- Versioned prompt registry, structured output contracts, few-shot pattern design
Skills
- Bachelor's degree and 4-7 years of experience in software engineering
- Experience with agentic AI in production—not LLM wrappers, not single-agent chatbot demos, not RAG tutorials. Systems with multiple agents, explicit state management, and real users depending on the answers being correct
- 2–3 years building and operating production LLM or agentic AI systems
- You can describe specific decisions you made in a multi-agent system: state management approach, how you handled agent failures mid-conversation, how you kept the system governable as agents were added
- LangGraph, AutoGen, or custom graph-based orchestration used in production, not just evaluated. You know where it breaks down and how you worked around it
- Debugged retrieval quality problems, iterated on chunking strategies, dealt with stale knowledge bases in a live system with real users
- Built evaluation systems from scratch: golden datasets, automated regression, domain-appropriate accuracy metrics. Your eval suite caught a production regression before users reported it
- Experience in healthcare, insurance, fintech, or legal domains—AI systems where a wrong answer has consequences, and you designed the safeguards that make accuracy non-negotiable
- Strong knowledge of Python. Work in an existing codebase and extend it with the AI layer
- Azure OpenAI, Azure AI Search, Azure Container Apps are the target stack
- Experience working in an Agile/Scrum environment
- Desire to think in a creative, lateral way both conceptually and in practical application
- You must be obsessed with addressing customer needs
- Strong communication and interpersonal skills
- Possess emotional intelligence
Benefits
- Health insurance
Company Overview
Company H1B Sponsorship