Description
Description
We are seeking an experienced System Administrator Lead with a deep understanding of high availability systems to oversee and manage the Windows and VDI IT infrastructure with a combination of bare metal, virtual machine capability, and other mission-critical environments. As the leader of a team of 20+ IT professionals, you will ensure high availability for our business-critical applications and systems. You will lead and mentor your team, manage complex infrastructure projects, and ensure that the organization's systems are highly available, secure, and perform optimally.
Key Responsibilities:
- Manage and maintain Windows Server 2019 environments, ensuring all systems are configured for high availability and security.
- Implement and manage Active Directory (AD), Group Policies, DNS, DHCP, and other critical Windows services in HA configurations.
- Ensure that all critical systems are highly available and maintain redundancy for key infrastructure services.
- Oversee VMware vSphere and vCenter for virtual machine (VM) provisioning, migration, and HA management.
- Implement and manage vMotion, HA clusters, Distributed Resource Scheduler (DRS), and Storage vMotion to maintain high availability of virtualized systems.
- Ensure Nutanix hyper-converged infrastructure is designed for high availability, ensuring storage, compute, and networking components are redundant and resilient.
- Oversee the VDI (Virtual Desktop Infrastructure) solutions (such as VMware Horizon), ensuring highly available desktops and applications for end-users.
- Implement load balancing and disaster recovery strategies for virtual desktop environments.
- Ensure SharePoint environments are highly available, with redundant farm configurations and backup/recovery processes that support HA goals.
- Implement high-availability configurations for SharePoint services, including SQL databases and web front-end servers.
- Oversee the implementation of IIS (Internet Information Services) for web hosting, ensuring that all critical web applications are designed for high availability.
- Configure and manage load balancing across web servers to ensure high availability for mission-critical websites and applications.
- Oversee the management of SQL Server databases, SQL clustering, and log shipping for high availability and disaster recovery.
- Ensure HA configurations are aligned with organizational needs for minimal downtime and maximum data integrity.
- Manage replication, backup, and disaster recovery for SQL environments to support critical business operations.
- Work closely with senior management and other IT teams to align HA strategies with business objectives and ensure all systems are optimized for availability.
- Generate detailed reports on system health, availability, and performance for executive leadership.
- Communicate infrastructure changes, downtime, or outages with clarity and transparency to business stakeholders.
Qualifications
Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience) with 5 years experience, or additional experience can be used in lieu of a degree.
- Certifications in HA technologies and related infrastructure management such as:
- Microsoft Certified: Windows Server 2019 (highly desirable)
- VMware Certified Professional (VCP) (highly desirable)
- Microsoft Certified: Azure or other cloud certifications (highly desirable)
- SQL Server certifications, with a focus on high availability and disaster recovery configurations
- Nutanix Certified Professional (NCP)
- ITIL or other service management frameworks (optional)
- Excellent leadership and team management skills, with the ability to drive performance in a diverse and distributed team.
- Strong troubleshooting and problem-solving abilities in complex, high-availability environments.
- Ability to communicate complex technical issues related to HA systems in simple terms to non-technical stakeholders.
- Ability to thrive in a fast-paced, high-pressure environment with tight deadlines.
- At least 7+ years of experience in system administration, with a focus on high availability systems.
- A minimum of 3 years of leadership experience managing a team of system administrators, with a focus on HA architecture and disaster recovery.
- Proven experience with designing, implementing, and managing high-availability infrastructure across Windows, VMware, Nutanix, and SQL environments.
- Active secret with the ability to obtain a TS/SCI eligibility clearance
- Certification: Security+CE
Target salary range: $120,001 - $160,000. The estimate displayed represents the typical salary range for this position based on experience and other factors.
SAIC accepts applications on an ongoing basis and there is no deadline.
Covid Policy: SAIC does not require COVID-19 vaccinations or boosters. Customer site vaccination requirements must be followed when work is performed at a customer site.
Apply on company website