Infrastructure Architect - Hybrid

Apply Now

Company: CyberThink Inc.

Location: Lansing, MI 48911

Description:

Job Description:
As an Infrastructure Architect, you will support high-performance computing (HPC) systems and enterprise storage environments, ensuring security, scalability, and high availability. This role involves administering Linux-based systems, managing compute clusters with Slurm, supporting data and storage infrastructure (NAS, SAN), automating processes using Bash and Python, and maintaining robust backup, recovery, and disaster recovery solutions. The ideal candidate will demonstrate strong problem-solving abilities, scripting expertise, and experience supporting secure and compliant computing infrastructures.

Key Responsibilities:
  • Provide administration and support for Linux systems including Ubuntu CLI, networking, security, and system monitoring.
  • Design, install, configure, and troubleshoot Slurm workload manager for job scheduling in HPC environments.
  • Develop and maintain automation scripts using Bash and Python, supporting workflow pipelines (e.g., Nextflow).
  • Implement and support data management infrastructure including storage, backup, and disaster recovery.
  • Manage NAS and SAN storage systems, including Qumulo and rsync strategies, with a focus on performance and scalability.
  • Perform configuration management using tools like Ansible, Puppet, and Chef for automated deployments.
  • Provide support for virtualized environments and high-speed storage networks (e.g., Mellanox switches).
  • Assist in managing cloud compute and storage systems, including provisioning and troubleshooting resources.
  • Coordinate with labs and client staff to support access to computing and storage resources.
  • Contribute to system and application failover setup and disaster recovery planning and testing.
Required Skills, Experiences, Education, and Competencies:
  • 10+ years of experience in Linux system administration including security, networking, and troubleshooting.
  • 10+ years of experience with Bash and Python scripting and automation pipeline tools (e.g., Nextflow).
  • 10+ years of experience managing Slurm workload manager including installation and job debugging.
  • 10+ years of experience with storage technologies including Qumulo NAS, rsync, and mount configurations.
  • Experience with configuration management tools such as Ansible, Puppet, and Chef.
  • Strong background in database administration (e.g., PostgreSQL, SQL Server, MySQL, Oracle).
  • 10+ years of experience in deploying and maintaining HPC systems and selecting suitable hardware/software.
  • Solid understanding of enterprise storage, cloud computing (compute/storage), and security best practices.
  • Experience troubleshooting complex systems, interpreting logs (e.g., IIS, Dynatrace), and optimizing performance.
  • Familiarity with tools and concepts such as conda, Docker, Singularity, HL7 messaging, Cloudflare, and ForcePoint policies.

The hourly range for roles of this nature are $60.00 to $80.00/hr. Rates are heavily dependent on skills, experience, location, and industry.

cyberThink is an Equal Opportunity Employer.

Similar Jobs