DevOps Engineer
Apply NowCompany: Clifyx
Location: Atlanta, GA 30349
Description:
Job Description for L3 System Engineer (PlatOps)
I. Introduction
Need to provide the following Support Services:
Manage environments and provide L2/L3 support.
Plan, Implement, Manage and Consult for Infrastructure Services.
Activities to Cover:
Incident, Task, Change and Problem management through Service Now and Jira.
Keep the environments up to date & ensure none of the web services are disrupted.
Build knowledge base as and when necessary.
Support of Monitoring Apps: Circonus, Zenoss Core, Zabbix stated in terms of
Modifying alerts to appropriate threshold.
Adding new alerts as requested.
Remove deprecated alerts.
II. Scope of work
1. Linux OS Management (Redhat, CentOS, Ubuntu):
As part of OS management, L2 and L3 level of activities would be supported on the areas mentioned below.
Administration, Installation, Configuration & Trouble Shooting of Servers including VM's
OS start/shutdown/reboot
OS deployment, configuration, integration & decommissioning
Service start/stop/restart
Apply OS patches based OEM releases & internal guidelines
Apply security patches based on CERT alerts & internal security guidelines
File system and disk space management
Log rotation, movement & retrieval
Ensure uptime of all servers & services
Capacity management
CPU
Memory
IOPS
Storage
2. Virtualization Platform Support
Red Hat Virtualization / KVM IO 1.0 2.0 administration
VMware VMs administration
Aerosol VM provisioning
Argo / Joyent VM provisioning
3. Hardware Installation and Configuration (Remote Access Management)
Remote Access Management, Installation, Configuration and Troubleshooting of iLO, iDRAC, DELL OME, Serial Console for listed Servers.
Dell Power Edge Series
Client ProLiant Series
Compaq ProLiant Series
IBM System X Series
IBM eServer Series
Sun Microsystem
4. Configuration Management and Build Integration (L3 Support)
Chef
Puppet
Unity
Bamboo
5. Storage and Storage Switch Installation, Configuration, Monitoring (L3 Support)
NetApp Storage
Hitachi Storage
Xiotech Storage
Cisco Switch
Brocade Switch
III. Tools Management
Need to manage Client Infra Tools as listed.
Topic Tools
Ticket Management JIRA
Service Now
Alert and Communication Tools Slack
Xmatters
Skype
VMWare Vcenter / Vsphere
Configuration MGMT Tools CHEF
Puppet
Unity
Inventory and Reporting Tools
FileMaker Pro Inventory
Analyzer Report
IDB Argo Report
Aerosol Utilization Report
Version Control Management Tool GITHUB
Bit Bucket
Build/CI Tool Bamboo
Monitoring Tools Circonus
Zabbix
HiTrack HDS
Observium
OME
Consoles AWS
AZURE
OpenStack
Programming Tools
(Added Advantage) Ruby
Perl
IV. Knowledge Base
Create standard operating procedures on demand basis such as
Server commissioning & decommissioning
Web server deployments & Integration
OS security updates and patching procedures
CERT advisory related security updates
Puppet, Chef architecture & configuration details
File system management & Housekeeping
V. Value Adds
Innovate and Identify Opportunities for Automation
RCA and Permanent Remediation of Problems
Implement Continuous Integration
Implement Continuous Delivery
I. Introduction
Need to provide the following Support Services:
Manage environments and provide L2/L3 support.
Plan, Implement, Manage and Consult for Infrastructure Services.
Activities to Cover:
Incident, Task, Change and Problem management through Service Now and Jira.
Keep the environments up to date & ensure none of the web services are disrupted.
Build knowledge base as and when necessary.
Support of Monitoring Apps: Circonus, Zenoss Core, Zabbix stated in terms of
Modifying alerts to appropriate threshold.
Adding new alerts as requested.
Remove deprecated alerts.
II. Scope of work
1. Linux OS Management (Redhat, CentOS, Ubuntu):
As part of OS management, L2 and L3 level of activities would be supported on the areas mentioned below.
Administration, Installation, Configuration & Trouble Shooting of Servers including VM's
OS start/shutdown/reboot
OS deployment, configuration, integration & decommissioning
Service start/stop/restart
Apply OS patches based OEM releases & internal guidelines
Apply security patches based on CERT alerts & internal security guidelines
File system and disk space management
Log rotation, movement & retrieval
Ensure uptime of all servers & services
Capacity management
CPU
Memory
IOPS
Storage
2. Virtualization Platform Support
Red Hat Virtualization / KVM IO 1.0 2.0 administration
VMware VMs administration
Aerosol VM provisioning
Argo / Joyent VM provisioning
3. Hardware Installation and Configuration (Remote Access Management)
Remote Access Management, Installation, Configuration and Troubleshooting of iLO, iDRAC, DELL OME, Serial Console for listed Servers.
Dell Power Edge Series
Client ProLiant Series
Compaq ProLiant Series
IBM System X Series
IBM eServer Series
Sun Microsystem
4. Configuration Management and Build Integration (L3 Support)
Chef
Puppet
Unity
Bamboo
5. Storage and Storage Switch Installation, Configuration, Monitoring (L3 Support)
NetApp Storage
Hitachi Storage
Xiotech Storage
Cisco Switch
Brocade Switch
III. Tools Management
Need to manage Client Infra Tools as listed.
Topic Tools
Ticket Management JIRA
Service Now
Alert and Communication Tools Slack
Xmatters
Skype
VMWare Vcenter / Vsphere
Configuration MGMT Tools CHEF
Puppet
Unity
Inventory and Reporting Tools
FileMaker Pro Inventory
Analyzer Report
IDB Argo Report
Aerosol Utilization Report
Version Control Management Tool GITHUB
Bit Bucket
Build/CI Tool Bamboo
Monitoring Tools Circonus
Zabbix
HiTrack HDS
Observium
OME
Consoles AWS
AZURE
OpenStack
Programming Tools
(Added Advantage) Ruby
Perl
IV. Knowledge Base
Create standard operating procedures on demand basis such as
Server commissioning & decommissioning
Web server deployments & Integration
OS security updates and patching procedures
CERT advisory related security updates
Puppet, Chef architecture & configuration details
File system management & Housekeeping
V. Value Adds
Innovate and Identify Opportunities for Automation
RCA and Permanent Remediation of Problems
Implement Continuous Integration
Implement Continuous Delivery