Engineer

Apply Now

Company: Tata Consultancy Services

Location: Arlington, VA 22204

Description:

Skill: Platform Engineer, APIGEE

  • Cloud Platforms: AWS, Azure, GCP.
  • Infrastructure as Code (IaC): Terraform, Ansible, CloudFormation.
  • Automation & Scripting: Python, Bash, PowerShell, YAML.
  • Containerization & Orchestration: Docker, Kubernetes, Helm.
  • Operating Systems: Linux, Windows, Unix.
  • Monitoring & Observability: Datadog, Splunk, Prometheus, Grafana, ELK Stack.
  • CI/CD & DevOps: Jenkins, GitLab CI/CD, GitHub Actions, ArgoCD.
  • Networking & Security: VPN, Firewalls, IAM, Zero Trust.
  • Database & Storage: Cassandra, PostgreSQL, Redis, S3.
  • Logging & Alerting: Grafana, Prometheus, Splunk.


Roles & Responsibilities:

  • Platform Engineering.
  • Infrastructure Management.
  • Site Reliability Engineering (SRE).
  • Cloud Architecture.
  • High Availability & Scalability.
  • Performance Optimization.
  • Incident Management.
  • Disaster Recovery & Business.
  • Platform Automation & Workflow Orchestration.
  • Microservices & API Management.
  • Capacity Planning & Resource Optimization.
  • Hybrid Cloud & Multi-Cloud Strategy.
  • Enterprise IT Support & Troubleshooting.


Soft Skills:

  • Cross-functional Collaboration.
  • Technical Documentation & Knowledge Sharing.
  • Stakeholder Communication.
  • Problem-Solving & Critical Thinking.
  • Project Management (Agile, Scrum, Kanban).


Key Responsibilities:

Platform Development & Engineering:
  • Designing and implementing scalable, reliable, and secure IT platforms.
  • Developing automation tools for infrastructure provisioning, monitoring, and maintenance.
  • Ensuring high availability and performance of platforms.
  • Managing cloud (AWS, Azure, GCP) and on-premises infrastructure.


Infrastructure & Operations Management:

  • Overseeing server, storage, and network operations.
  • Managing operating systems (Linux, Windows) and virtualization platforms (VMware, Kubernetes).
  • Implementing infrastructure as code (IaC) using Terraform, Ansible, etc.
  • Handling incident response and troubleshooting platform-related issues.


Security & Compliance:

  • Enforcing security policies and access controls.
  • Implementing security best practices for platform infrastructure.
  • Monitoring for vulnerabilities and applying patches/updates.


Observability & Performance Optimization:

  • Setting up monitoring, logging, and alerting tools (Datadog, Splunk, Prometheus, etc.).
  • Conducting performance tuning and capacity planning.
  • Analyzing system metrics to optimize platform efficiency.


Automation & DevOps Integration:

  • Building CI/CD pipelines for seamless software deployment.
  • Automating workflows to improve development and operations efficiency.
  • Supporting microservices architecture and container orchestration (Docker, Kubernetes).


Governance & Lifecycle Management:

  • Managing platform versioning, upgrades, and end-of-life processes.
  • Standardizing best practices for infrastructure and application deployment.
  • Documenting platform archite cture and operational procedures.


Collaboration & Support:

  • Working closely with software development, security, and IT operations teams.
  • Supporting enterprise applications and ensuring platform stability.
  • Providing technical guidance and training for teams using the platform.


Salary Range - $100,000-$120,000 a year

#LI-NR3

Similar Jobs