Principal Lustre Engineer, CSS Global SaaS & Apps Delivery, Federal
Job Description:Principal Lustre EngineerLocation: Telecommuting/remote from within the USNote: US citizenship is required due to needed access of government systems. Any level of clearance a plus.Are you interested in building High-Performance Computing (HPC) Systems infrastructure for the cloud?
Oracle's Cloud Infrastructure team is building new Lustre Storage Services that operate at high scale in a broadly distributed multi-tenant cloud environment. Our customers run their businesses on our cloud, and our mission is to provide them with best-in-class Lustre storage capabilities in conjunction with other compute, storage, networking, database, security offerings.We're looking for hands-on engineers with a passion for solving difficult problems in distributed systems, virtualized infrastructure, and highly available services. Joining Oracle will give you the opportunity to design and build innovative new systems from the ground up and operate services at scale. Engineers at every level can have significant technical and business impact while delivering critical enterprise level features.As a Senior Member of Technical Staff, you will work as part of a highly collaborative team to build new features for Lustre Storage while operating and growing the current service offering. You should understand Lustre systems and have a strong knowledge of software architecture. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn. Duties and tasks are varied and require independent judgment. The HPC Systems Architect will design, implement high-performance scientific computing infrastructure in support, you will perform software development tasks associated with developing, debugging and designing software applications.Career Level - IC4Responsibilities:Job Responsibilities:
- Assist architects from Research Informatics and Principal HPC Systems Architect in assessing and developing strategic architecture and expansion roadmaps for the HPC and storage infrastructure.
- Works with other teams' professionals in the design, development, and implementation of HPC computational and research storage solutions, including HPC technology proof-of-concepts, pilots, and the technology adoption lifecycle.
- Identifies, develops, and implements standards and operating procedures for solutions and systems consistent with best practices.
- Documents current and future state architecture roadmaps and reference architectures.
- Provides clear written and spoken communications to customers, teams, and vendors.
- Keeps abreast of new and emerging technologies and stays adaptable to their potential applicability.
- Performs other duties as assigned or directed to meet the goals and objectives of the department and institution.
- At least two years of experience in the design, development, implementation, integration, and administration of applicable HPC systems required. This experience needs to include Lustre.
- 5+ years of experience developing commercial software in a distributed environment.
- Strong knowledge of Terraform, Go, Python or C++.
- Data structures, algorithms, operating systems, and distributed systems fundamentals.
- Understanding of databases, storage and persistent technologies.
- Troubleshooting, debugging and performance tuning skills.
- Experience designing and operating large scale compute infrastructure.
- BS or MS degree or equivalent experience relevant to functional area.
- Ability to do limited travel.