Cloud System support Advisor

Apply Now

Company: Bell Canada

Location: Montreal, QC H1A 0A1

Description:

Our team is advancing how Canadians connect with each other and the world, and developing breakthrough technology plays a vital role in making our purpose a reality. Contribute your experiences, talents and perspectives as we develop innovative digital solutions and best-in-class networks together. We know youll feel a sense of meaningful connection and belonging within our team. Then, with our commitment to environmental, social and governance initiatives, you can feel good about your greater impact helping people as they connect, work, learn and play.

Be at the forefront of shaping the best digital connections and next-generation technology in Canada. Youll join the largest, award-winning, high-tech team in our country, working with the brightest minds across many industries.

Bring your ideas and skills as we grow cutting-edge fibre and 5G networks, develop advanced products and services to run on these networks and then enable the delivery of content from our top media properties and services ensuring that our customers can stay entertained and connected anytime, anywhere.

Summary

As a System support Advisor for our datacenter engineering team, you will be architecting, implementing, and managing the infrastructure necessary to support cloud workloads on physical servers.

Key Responsibilities
  • Monitoring health and performance of cloud computing physical servers (i.e. HPE, DELL, Supermicro, others).
  • Diagnosing and resolving software issues, hardware failures, kernel panics, system crashes, and performance degradation.
  • Lead and support a team of two technical support specialists through daily calls in an agile environment.
  • Leveraging scripting languages (e.g., Bash, Python) and automation tools (e.g., Ansible, Puppet) to automate repetitive tasks, streamline operations, and ensure consistency across bare metal systems.
  • Assessing the need for software and firmware updates based on vendor recommendations, security patches, and performance enhancements.
  • Hardening the security of bare-metal servers through configuration management, access controls, and encryption mechanisms.
  • Ensuring that bare-metal deployments comply with regulatory requirements and internal governance policies.
  • Maintaining comprehensive documentation of system configurations, procedures, and troubleshooting steps to facilitate knowledge sharing and continuity of operations.
  • Work closely with vendors and other internal and external parties to deliver an efficient support solution.
  • Contribute to problem resolution a part of cross functional teams using Agile/Scrum, Lean methodologies
Critical Qualifications
  • Degree in Computer Science or Information Science
  • 5 to 10 years of technical and operation experience in IT
  • Strong knowledge of physical server hardware and data center management
  • Experience building and maintain Ansible playbooks (including Ansible tower)
  • Server hardware and firmware management tools:
  • HPe OneView
  • HPeOneView Global Dashboard
  • DellEMC OpenManage
  • DellEMC SupportAssist Enterprise
  • Advanced Linux knowledge (medium to high level)
  • Python/API (intermediate to advance)
  • Strong problem-solving skills and ability to work under pressure.
  • Strong time management skills and work ethic to manage multiple accountabilities.
  • Ability to build relationships and work effectively with internal players and vendors
  • Embrace change within the same team
  • Available to work 24/7 on call (rotating schedule)
Preferred Qualifications
  • Agile/ DevOps / Lean experience
  • Monitoring Products: Zabbix,Grafana, Prometheus
  • Database: MySQL
  • DMTF RedFish
  • Bilingual French and English (written and verbal)
  • Familiarity with virtualization and containerization (e.g., VMware, OpenShift, OpenStack).
  • Storage in general
  • Network knowledge (medium)
  • CI/CD pipeline
  • Version Control Systems
  • Cloud Skills


Similar Jobs