GoodVibeCode
Jobs/IT Operations Technical Lead
AX

IT Operations Technical Lead

Axle

$150K–170K/yrFrederick, Maryland, United StatesFull-time10+ yearsOn-site
Posted 13 hours ago· America/New_York· Frederick, Maryland, United States

About This Role

(ID: 2026-1538) Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH). Benefits We Offer: 100% Medical, Dental & Vision Coverage for Employees Paid Time Off and Paid Holidays 401K match up to 5% Educational Benefits for Career Growth Employee Referral Bonus Flexible Spending Accounts: Healthcare (FSA) Parking Reimbursement Account (PRK) Dependent Care Assistant Program (DCAP) Transportation Reimbursement Account (TRN) Responsibilities: Lead and manage IT operations aligned with ITIL processes including Incident, Problem, Change, and Release Management Provide hands-on leadership in managing Linux and Windows environments across cloud and on-premises infrastructure Own and drive incident response, root cause analysis, and service restoration for mission-critical systems Design, build, and maintain golden images, patching strategies, and system hardening standards Lead patch management and vulnerability remediation programs ensuring compliance and system integrity Develop and implement automation solutions using modern approaches including Vibe Coding (AI-assisted development) to accelerate operational efficiency and reduce toil Support and optimize infrastructure for AI/ML workloads, including provisioning, scaling, and performance tuning Manage and maintain GPU-enabled environments and instances for high-performance computing and machine learning use cases Oversee and optimize infrastructure monitoring, logging, alerting, and observability frameworks Manage and mentor a team of systems engineers; provide technical guidance and performance oversight Collaborate with architecture, security, and development teams to improve reliability, scalability, and operational efficiency Support hybrid environments including cloud platforms and on-premise data centers Ensure proper documentation, runbooks, SOPs, and operational readiness Stay abreast of new technologies in your areas but not limited to US Federal Standards, NIST Publications, cloud computing & deployment, site reliability engineering, security standards and compliance best practices etc. Requirements: Must have 5+ years of experience leading operations team with hands-on experience in driving operational process improvements and technological advancements. Proven experience implementing and operating within ITIL frameworks Must have 10+ years of hands-on Unix/Linux experience that includes specific technical experience with CentOS / Red Hat systems administration support for large scale distributed environments Hands-on experience with incident management, patching, system hardening, and production support Experience building and maintaining golden images and standardized environments Strong scripting/automation skills (e.g., Python, Bash, PowerShell or similar) Experience with configuration management and automation tools (Ansible, Terraform, Puppet, Chef, or similar) Strong understanding of networking fundamentals (DNS, TCP/IP, firewalls, load balancing) Experience with monitoring and logging tools (e.g., Nagios, Splunk, ELK, Prometheus, Grafana) Must have Cloud Build-Out or Migration experience in at least one of the following providers Amazon AWS, Google GCP and Microsoft Azure Must have 2+ years with CI/CD and automation tools such as Terraform, Ansible, Chef, Puppet, Jenkins, GitHub Experience supporting AI/ML workloads or data-intensive platforms Familiarity with GPU-based compute environments (e.g., NVIDIA GPU instances) Must be willing to learn new technologies, adopt and adapt to emerging technologies or needs from a project to a project Knowledge of security best practices and compliance frameworks such as NIST 800-53, FedRAMP, FISMA etc. are preferred Certifications such as ITIL, Linux, AWS, Azure, or Kubernetes (CKA/CKAD) is preferred Networking certifications (CCNA/CCNP) are a plus Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed. The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate. Accessibility: If you need an accommodation as part of the employment process please contact: careers@axleinfo.com This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location. Salary Range $150,000—$170,000 USD

Responsibilities

Lead and manage IT operations, including incident, problem, and change management, while overseeing Linux and Windows environments. Develop automation solutions and optimize infrastructure for AI/ML workloads and high-performance computing.

Requirements

Requires over 10 years of hands-on Unix/Linux experience and at least 5 years in a leadership role managing operations teams. Candidates must have experience with ITIL frameworks, cloud platforms, and automation tools.

Benefits

Medical coverageDental coverageVision coveragePaid time offPaid holidays401k matchEducational benefitsEmployee referral bonusFlexible spending accountsParking reimbursementDependent care assistant programTransportation reimbursement

Skills & Tags

Linux administrationWindows administrationITILIncident managementAutomationPythonBashPowerShellAnsibleTerraformCloud computingNetworkingObservabilityAI/ML infrastructureGPU managementSecurity compliance

Keywords

IT OperationsLinuxWindowsITILCloudAWSGCPAzureAutomationPythonBashPowerShellAnsibleTerraformNetworkingObservabilityAI/MLGPUNISTFedRAMPFISMACentOSRed HatCI/CDJenkinsGitHubSplunkPrometheusGrafanaSite Reliability Engineering

Categories

TechnologyManagement & LeadershipSoftwareEngineeringSecurity & Safety

Source: greenhouse