Senior Systems / IT Operations Engineer II
Other Jobs To Apply
No other job posts for this day.
Job Description Job Summary We're looking for a Senior Systems / IT Operations Engineer II to play a pivotal role in maintaining, developing, and implementing strategic IT initiatives that directly impact our critical manufacturing operations across a large, multisite footprint. This role involves leading highly complex, strategic, and high-risk assignments with clear short-to-mid-term outcomes. You'll utilize your independent judgment and collaborative agile team skills to devise creative solutions, continuously refining our operational frameworks to ensure optimal performance and reliability. Job Responsibilities
- Leads the daily activities for an assigned work group that performs activities to deploy, operate, scale, tune and troubleshoot server systems, website, app infrastructure and/or other systems.
- Leads team of systems engineers responsible for monitoring and evaluating networked or other systems; analyzing technical requirements, deploying software platforms, resolving service problems and performance analysis and tuning for server systems and associated infrastructure.
- Ensures fixes to ensure continuous operation. Sets engineering project schedules, defines project parameters and tasks and monitors project tasks. Keeps the vision of the projects, leading others toward their completion.
- Performs as subject matter advisor to functional area and may become actively involved, as required, to meet schedules and resolve problems highly complex in nature.
- Participates in the development of technical/business approaches and new or enhanced technical tools.
- Develops protocols to support established standards. May develop the technical "vision" and design specifications of assigned projects and may lead in implementation.
- Ensures that defects in assigned projects or programs are tracked and summarizes and presents findings to management.
- Interacts with team, peers and/or managers in other work groups / teams / vendors / customers to share information and improve workflow processes.
- May make needed adjustments to short-term priorities. Recommends modifications to the daily operations of the assigned work group / team.
- Utilizes experience in order to identify problematic relationships. Anticipates, prevents and eliminates problems and roadblocks before they occur.
- Provides subject matter expertise to less experienced team members. May participate in teaching and training members of the work team.
- Bachelor's degree and at least 2 years of experience in a technical role OR High school Diploma / GED and at least 5 years of experience in a technical role.
- Experience in Technology infrastructure, such as Infrastructure, Systems, Databases, or similar
- Experience designing/configuring/administrating complex technical solutions to fix business problems/issues
- Experience establishing & maintaining relationships with individuals at all levels of the organization, in the business community & with vendors.
- Experience identifying operational issues and recommending and implementing strategies to resolve problems.
- Experience mentoring less experienced team members.
- At least 1 year of direct leadership, indirect leadership and/or cross-functional team leadership.
- Willing to travel up to/at least 10% of the time for business purposes (within state and out of state).
- Bachelor's degree in Computer Science, Data Science, Information Technology, Mathematics, or a related technical field.
- Deep knowledge and hands-on experience with enterprise-level IT infrastructure, including advanced administration of Windows and Linux servers in physical (ie; Dell, HP, ESXi) and virtualized environments (e.g., VSphere, Hyper-V), complex database systems (e.g., MS SQL Server, Oracle), and PLC based devices.
- Expertise in enterprise networking concepts and diagnostic troubleshooting, including TCP/IP, routing protocols, VLANs, VPNs, firewalls, and network performance analysis across local and wide-area networks.
- Proficiency in task automation with scripting languages (e.g., Python, PowerShell, Bash) for data analysis, log parsing, with a possibility of developing or restructuring custom diagnostic tools.
- Proven ability to leverage ticketing systems (Remedy, JIRA, ServiceNOW) for managing complex issues and leading root cause analysis (RCA) efforts grounded in forensic evidence gathering in a large-scale, distributed environment.
- Extensive experience utilizing IT monitoring, logging, and observability platforms (e.g., Splunk, ELK Stack, Prometheus, Grafana, SolarWinds, Nagios, Zabbix, SCOM, DynaTrace) for proactive issue detection, performance baseline analysis, and evidence-based deep-dive troubleshooting.
- Proven ability to extract, transform, and utilize data from large-scale data platforms such as Microsoft Fabric, Databricks, Snowflake, Google BigQuery, or Amazon Redshift for in-depth analysis and diagnostic purposes.
- Advanced proficiency in using network diagnostic and packet analysis tools (e.g., Wireshark, tcpdump, Nmap) and system performance profiling tools (Tivoli, SCOM, PerfMon, ProcMon, SysInternals) to identify and isolate anomalies, adopting a forensic approach to data extraction and analysis.
- Exceptional analytical and investigative skills, with a proven ability to diagnose and resolve highly complex, intermittent, and interdependent issues spanning multiple IT domains (network, server, application, database, endpoint) in a five-nines uptime environment.
- Significant experience supporting and maintaining IT infrastructure that spans multiple geographical locations and connects to thousands of varied endpoints, ensuring high availability and optimal performance through evidence-driven solutions.
- Familiarity with endpoint management solutions (e.g., SCCM, Intune, Kaseya, Bomgar) and their role in a large enterprise.
- Experience in manufacturing automation, assembly, and robotic systems, particularly with data collection and application for problem-solving.
- Experience developing design specifications, test plans, and protocols for IT solutions and system improvements based on empirical data and facts.
- This position requires approximately 20-30% travel within the United States, which includes visits to various sites for system deployments, troubleshooting, and strategic planning.