AWS Cloud Operations Specialist/Engineer

Apply Now

Company: Edward Jones

Location: Princeton, NJ 08540

Description:

Job Title: AWS Cloud Operations Specialist/Engineer
Job Location: Princeton, NJ 08540
Job Duration: 12 + Months

Call Notes:
Client is setting up an AWS environment that they will manage themselves.
Working across the managed service environment and the environment they are setting up on their own.
Operationalize managing the environment themselves.
Ingesting events that come across, security issues, health performance monitoring, changes to the environment.
Cnoc cloud network operations center.
Responding to alerts and are they noise or need action is there a process for taking action?
If not build a process for handling those.
Monitoring health and performance.
Monitoring capacity, over or under provisioned?
Responding to events, following existing processes, developing new processes and monitoring.
Scripting experience and systems administration experience for maintaining servers.
They expect to be much more cloud native in their new environment so systems administration experience is key to debug performance issues is big.
Rotation shift, not 24/7 shifts, but there is an on call rotation.
Every 6 weeks your on call for a week.
Certifications as solutions architect is great.
Scripting language is python and bash are the most relevant.
They use git for managing source code and configuration.
Understanding a devops pipeline important.

Description:
The Operations Engineer provides a wide variety of systems administration and Engineering support functions.
Operation tasks are conducted primarily in AWS public cloud, with some work in traditional data centers.
You will be part of a back-end systems support team for our growing portfolio of cloud-based software applications.
You will use infrastructure monitoring tools and respond to alerts to continuously improve the stability of our systems.
Emphasis will be placed on operational duties, tasks will include automating and maintaining cloud services, supporting application development teams on the cloud, performing security and performance related compliance and monitoring tasks, and conducting research and POCs to bring enhancements to the environments.
The role will support troubleshooting incidents and change requests as part of the Cloud Network Operations Center.
There will also be tasks associated with onboarding and supporting application teams to enterprise DevOps CI/CD infrastructure tool sets.
The role will contribute to the documentation of run books, guidelines, and best practices.
Participate in support rotation schedule with off hours support.
Deploy and support automated AWS cloud-based tools and environments in support of application teams.
Analyze and response to incidents and problems including the development of automated monitoring and remediation to maintain uptime and expected service levels.
This includes cloud infrastructure, applications, middleware, and other 3rd party software.
Analyze and resolve problems associated with the operating systems and middleware, for example Redhat Linux, JBoss, Apache, Tomcat, Windows Server, IIS, etc.
Manage, configure, respond and resolve AWS Security alerts including vulnerabilities and patch management.
Design, generate and interpret operational reports related to system health status, capacity management and system performance management.
Determine root cause for incidents, correlate recurring incidents to systemic problems, and drive towards resolution.
Contribute to the build-out of cloud infrastructure, for example, working with services such as load balancers, gateways, firewalls, subnets, security groups, and storage options.
Use scripting and automation tools to increase efficiency, performance, and cost reductions, for example CloudFormation,Terraform, Unix Shell, Python, PowerShell, Ansible, etc.
Participate in the development of Systems Engineering departmental architecture, standards and guidelines.
Work closely with application teams following Agile methods and principles.
Contribute and collaborate to design, document, and publish Engineering standards, principles, guidelines and best practices.
Seek opportunities to increase efficiency through research and investigation, application team input, automation options, POCs, etc.

Required:
Experience with core AWS services like EC2, S3, SNS, Lambda, CloudWatch and CloudTrail.
Experience in the design, development, and implementation of AWS-based infrastructure solutions using AWS APIs, and Python with boto3.
Strong scripting experience in Python and PowerShell/Bash.
Windows and Linux system administration: OS, middleware, application layer
Server, network, and storage performance benchmarking and optimization.
In-depth understanding of the operational dependencies of applications, networks, systems, security, and policy.
Experience with cloud orchestrations tools like AWS CloudFormation and/or Terraform, with an emphasis on creating modular architecture.
Experience with AWS IAM.
Proficient in using Git branching, push/pull requests, and advanced Git workflows.

Preferred:
Experience with Jenkins, Ansible or similar tools.
Experience with application build technologies.
Demonstrated knowledge of DevOps principles. Hands-on experience required.
Strong networking knowledge, preferably with DNS, subnets, routing, security groups, whitelisting, firewalls and various networking infrastructure.
CDK, Control Tower, AWS Control Tower Customization Solution
Experience in containerization and orchestration using Docker, Kubernetes, or Fargate/EKS/ECS.
Familiar with analytics and log aggregation tools such as Splunk or Microsoft BI
Required Skills :
Basic Qualification :
Additional Skills :
Background Check :Yes
Drug Screen :Yes
Candidate must be your W2 Employee :No
Interview Process :
Additional Keywords :
Degree Requirements :
Certification Requirement :
Minimum Experience (In Years) :
Travel Requirements :

Similar Jobs